Debian Bug report logs - #430876
python-extractor when extracting keywords from file.

version graph

Package: python-extractor; Maintainer for python-extractor is Bertrand Marc <bmarc@debian.org>; Source for python-extractor is src:libextractor-python (PTS, buildd, popcon).

Reported by: romain@mezimail.com

Date: Wed, 27 Jun 2007 21:54:03 UTC

Severity: important

Fixed in version libextractor/0.5.18a-1

Done: Daniel Baumann <daniel@debian.org>

Bug is archived. No further changes may be made.

Toggle useless messages

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor. (full text, mbox, link).


Acknowledgement sent to romain@mezimail.com:
New Bug report received and forwarded. Copy sent to Daniel Baumann <daniel@debian.org>. (full text, mbox, link).


Message #5 received at submit@bugs.debian.org (full text, mbox, reply):

From: romain@mezimail.com
To: Debian Bug Tracking System <submit@bugs.debian.org>
Subject: python-extractor when extracting keywords from file.
Date: Wed, 27 Jun 2007 23:40:41 +0200
Package: python-extractor
Version: 1:0.5-2
Severity: grave
Justification: renders package unusable

Hi !

I have a very strange behaviour when I try to extract keywords from a file.
Here a pydb strace of the execution of the extract.py example file on a
textlive-pstricks doc distribution pdf file :
    $ pydb ./extract.py /usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf
    (/usr/share/doc/python-extractor/examples/extract.py:24): 
    24 """
    (Pydb) n
    (/usr/share/doc/python-extractor/examples/extract.py:25): 
    25 import extractor
    (Pydb) n
    (/usr/share/doc/python-extractor/examples/extract.py:26): 
    26 import sys
    (Pydb) n
    (/usr/share/doc/python-extractor/examples/extract.py:28): 
    28 xtract = extractor.Extractor()
    (Pydb) 
    (/usr/share/doc/python-extractor/examples/extract.py:29): 
    29 for arg in sys.argv[1:]:
    (Pydb) 
    (/usr/share/doc/python-extractor/examples/extract.py:30): 
    30     print "Keywords from %s:" % arg
    (Pydb) 
    Keywords from
    /usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf:
    (/usr/share/doc/python-extractor/examples/extract.py:31): 
    31     keys = xtract.extract(arg)
    (Pydb) s
    --Call level 0
    (/var/lib/python-support/python2.4/extractor.py:172):  extract
    172     def extract(self, filename=None, data=None, size=None):
    (Pydb) n
    (/var/lib/python-support/python2.4/extractor.py:187):  extract
    187     if not filename and not (data and size):
    (Pydb) n
    (/var/lib/python-support/python2.4/extractor.py:189):  extract
    189     elif filename:
    (Pydb) n
    (/var/lib/python-support/python2.4/extractor.py:190):  extract
    190         return self.extractFromFile(filename)
    (Pydb) s
    ----Call level 1
    (/var/lib/python-support/python2.4/extractor.py:194):  extractFromFile
    194     def extractFromFile(self, filename):
    (Pydb) n
    (/var/lib/python-support/python2.4/extractor.py:229):  extractFromFile
    229     self.keywords_p =
    libextractor.EXTRACTOR_getKeywords(self.extractors, filename)
    (Pydb) s

    ** (process:23365): CRITICAL **: gsf_input_memory_get_type: assertion
    `gsf_input_memory_type != 0' failed

    (process:23365): GLib-GObject-CRITICAL **: g_object_new: assertion
    `G_TYPE_IS_OBJECT (object_type)' failed

    ** (process:23365): CRITICAL **: gsf_shared_memory_get_type: assertion
    `gsf_shared_memory_type != 0' failed

    (process:23365): GLib-GObject-CRITICAL **: g_object_new: assertion
    `G_TYPE_IS_OBJECT (object_type)' failed

Another strange thing is that I tested the extract on another computer and I
experienced no crash.

So I hesitated between "normal" (because on another computer it does not crash) and "grave"  severity. I choosed the last one because on my computer, it crashes and render the package unusable ;-)

I hope we will able to find the origin of this problem.

Cheers,
Romain.

-- System Information:
Debian Release: lenny/sid
  APT prefers unstable
  APT policy: (500, 'unstable'), (500, 'stable'), (1, 'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.21 (SMP w/2 CPU cores)
Locale: LANG=fr_FR.UTF-8, LC_CTYPE=fr_FR.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages python-extractor depends on:
ii  libextractor1c2a              0.5.18-3   extracts meta-data from files of a
ii  python                        2.4.4-6    An interactive high-level object-o
ii  python-ctypes                 1.0.1-1    Python package to create and manip
ii  python-support                0.6.4      automated rebuilding support for p

Versions of packages python-extractor recommends:
ii  libextractor-plugins          0.5.18-3   extracts meta-data from files of a

-- no debconf information



Information forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor. (full text, mbox, link).


Acknowledgement sent to Romain Chantereau <romain@mezimail.com>:
Extra info received and forwarded to list. Copy sent to Daniel Baumann <daniel@debian.org>. (full text, mbox, link).


Message #10 received at 430876@bugs.debian.org (full text, mbox, reply):

From: Romain Chantereau <romain@mezimail.com>
To: 430876@bugs.debian.org
Subject: Re: Bug#430876: Acknowledgement (python-extractor when extracting keywords from file.)
Date: Thu, 28 Jun 2007 21:53:53 +0200
Hi !

It seems that it is related to libextractor itself, here a gdb and a
strace of the extract util invokation :

GDB :

** (process:3683): CRITICAL **: gsf_input_memory_get_type: assertion
`gsf_input_memory_type != 0' failed

(process:3683): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed

** (process:3683): CRITICAL **: gsf_shared_memory_get_type: assertion
`gsf_shared_memory_type != 0' failed

(process:3683): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread -1212966144 (LWP 3683)]
0xb7d74978 in gsf_shared_memory_new () from /usr/lib/libgsf-1.so.114
(gdb) back
#0  0xb7d74978 in gsf_shared_memory_new () from /usr/lib/libgsf-1.so.114
#1  0xb7d6543d in gsf_input_memory_new () from /usr/lib/libgsf-1.so.114
#2  0xb7f00076 in libextractor_ole2_extract ()
   from /usr/lib/libextractor/libextractor_ole2.so
#3  0xb7ef74ca in ?? () from /usr/lib/libextractor.so.1
#4  0xbfa88964 in ?? ()
#5  0xb77c9000 in ?? ()
#6  0x00004d55 in ?? ()
#7  0x0805c910 in ?? ()
#8  0x00000000 in ?? ()


STRACE :

open("/usr/lib/libextractor/libextractor_nsfe.la", O_RDONLY) = 3
fstat64(3, {st_mode=S_IFREG|0644, st_size=996, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb7a2f000
read(3, "# libextractor_nsfe.la - a libto"..., 4096) = 996
read(3, "", 4096)                       = 0
close(3)                                = 0
munmap(0xb7a2f000, 4096)                = 0
open("/usr/lib/libextractor/libextractor_nsfe.so", O_RDONLY) = 3
read(3, "\177ELF\1\1\1\0\0\0\0\0\0\0\0\0\3\0\3\0\1\0\0\0\220\4\0"...,
512) = 512
fstat64(3, {st_mode=S_IFREG|0644, st_size=4368, ...}) = 0
mmap2(NULL, 7336, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0)
= 0xb780a000
mmap2(0xb780b000, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|
MAP_DENYWRITE, 3, 0) = 0xb780b000
close(3)                                = 0
open("/usr/lib/libextractor/libextractor_nsf.la", O_RDONLY) = 3
fstat64(3, {st_mode=S_IFREG|0644, st_size=990, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb7a2f000
read(3, "# libextractor_nsf.la - a libtoo"..., 4096) = 990
read(3, "", 4096)                       = 0
close(3)                                = 0
munmap(0xb7a2f000, 4096)                = 0
open("/usr/lib/libextractor/libextractor_nsf.so", O_RDONLY) = 3
read(3, "\177ELF\1\1\1\0\0\0\0\0\0\0\0\0\3\0\3\0\1\0\0\0\360\3\0"...,
512) = 512
fstat64(3, {st_mode=S_IFREG|0644, st_size=3832, ...}) = 0
mmap2(NULL, 6800, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0)
= 0xb7808000
mmap2(0xb7809000, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|
MAP_DENYWRITE, 3, 0) = 0xb7809000
close(3)                                = 0
stat64("/usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf", {st_mode=S_IFREG|0644, st_size=19797, ...}) = 0
open("/usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf",
O_RDONLY|O_LARGEFILE) = 3
mmap2(NULL, 19797, PROT_READ, MAP_PRIVATE, 3, 0) = 0xb7803000
open("/usr/lib/charset.alias", O_RDONLY|O_LARGEFILE) = -1 ENOENT (No
such file or directory)
write(2, "\n** (process:4429): CRITICAL **:"..., 106
** (process:4429): CRITICAL **: gsf_input_memory_get_type: assertion
`gsf_input_memory_type != 0' failed
) = 106
write(2, "\n(process:4429): GLib-GObject-CR"..., 107
(process:4429): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed
) = 107
write(2, "\n** (process:4429): CRITICAL **:"..., 108
** (process:4429): CRITICAL **: gsf_shared_memory_get_type: assertion
`gsf_shared_memory_type != 0' failed
) = 108
write(2, "\n(process:4429): GLib-GObject-CR"..., 107
(process:4429): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed
) = 107
--- SIGSEGV (Segmentation fault) @ 0 (0) ---
+++ killed by SIGSEGV +++
Process 4429 detached


As in my previous message I am suffering this only on my computer. :-/

Cheers,
Romain




Information forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor. (full text, mbox, link).


Acknowledgement sent to daniel@debian.org:
Extra info received and forwarded to list. Copy sent to Daniel Baumann <daniel@debian.org>. (full text, mbox, link).


Message #15 received at 430876@bugs.debian.org (full text, mbox, reply):

From: Daniel Baumann <daniel@debian.org>
To: Romain Chantereau <romain@mezimail.com>, 430876@bugs.debian.org, control@bugs.debian.org
Subject: Re: Bug#430876: Acknowledgement (python-extractor when extracting keywords from file.)
Date: Thu, 28 Jun 2007 22:14:35 +0200
severity 430876 important
thanks

Romain Chantereau wrote:
> As in my previous message I am suffering this only on my computer. :-/

I've tried it on sid/i386 (as you) and didn't encounter that problem in
the first run:

(sid_i386)user@debian:/usr/share/doc/python-extractor/examples$
./extract.py /usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf
Keywords from
/usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf:
format - PDF 1.2
mimetype - application/pdf
(sid_i386)user@debian:/usr/share/doc/python-extractor/examples$

then, i installed libextractor-plugins, and could reproduce the problem.

-- 
Address:        Daniel Baumann, Burgunderstrasse 3, CH-4562 Biberist
Email:          daniel.baumann@panthera-systems.net
Internet:       http://people.panthera-systems.net/~daniel-baumann/



Severity set to `important' from `grave' Request was from Daniel Baumann <daniel@debian.org> to control@bugs.debian.org. (Thu, 28 Jun 2007 20:15:02 GMT) (full text, mbox, link).


Information forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor. (full text, mbox, link).


Acknowledgement sent to Romain Chantereau <romain@mezimail.com>:
Extra info received and forwarded to list. Copy sent to Daniel Baumann <daniel@debian.org>. (full text, mbox, link).


Message #22 received at 430876@bugs.debian.org (full text, mbox, reply):

From: Romain Chantereau <romain@mezimail.com>
To: 430876@bugs.debian.org
Subject: Re: Bug#430876: Info received (Bug#430876: Acknowledgement (python-extractor when extracting keywords from file.))
Date: Thu, 28 Jun 2007 22:14:56 +0200
I submitted a bug to libextractor Mantis :
https://gnunet.org/mantis/view.php?id=1241

Romain.




Information forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor. (full text, mbox, link).


Acknowledgement sent to Romain Chantereau <romain@mezimail.com>:
Extra info received and forwarded to list. Copy sent to Daniel Baumann <daniel@debian.org>. (full text, mbox, link).


Message #27 received at 430876@bugs.debian.org (full text, mbox, reply):

From: Romain Chantereau <romain@mezimail.com>
To: 430876@bugs.debian.org
Subject: You can close this bug, it is libgsf related.
Date: Fri, 29 Jun 2007 21:27:46 +0200
Hi !

Christian Grothoff accused the libgsf to be the origin of the bug, and
he was right.

I downgraded libgsf version from 1.4.4 to 1.4.3, and this segfault
disapeared.

I reported a bug #431104

Thanks,
Romain.




Reply sent to Daniel Baumann <daniel@debian.org>:
You have taken responsibility. (full text, mbox, link).


Notification sent to romain@mezimail.com:
Bug acknowledged by developer. (full text, mbox, link).


Message #32 received at 430876-close@bugs.debian.org (full text, mbox, reply):

From: Daniel Baumann <daniel@debian.org>
To: 430876-close@bugs.debian.org
Subject: Bug#430876: fixed in libextractor 0.5.18a-1
Date: Thu, 05 Jul 2007 12:17:07 +0000
Source: libextractor
Source-Version: 0.5.18a-1

We believe that the bug you reported is fixed in the latest version of
libextractor, which is due to be installed in the Debian FTP archive:

extract_0.5.18a-1_i386.deb
  to pool/main/libe/libextractor/extract_0.5.18a-1_i386.deb
libextractor-dev_0.5.18a-1_i386.deb
  to pool/main/libe/libextractor/libextractor-dev_0.5.18a-1_i386.deb
libextractor-plugins_0.5.18a-1_i386.deb
  to pool/main/libe/libextractor/libextractor-plugins_0.5.18a-1_i386.deb
libextractor1c2a_0.5.18a-1_i386.deb
  to pool/main/libe/libextractor/libextractor1c2a_0.5.18a-1_i386.deb
libextractor_0.5.18a-1.diff.gz
  to pool/main/libe/libextractor/libextractor_0.5.18a-1.diff.gz
libextractor_0.5.18a-1.dsc
  to pool/main/libe/libextractor/libextractor_0.5.18a-1.dsc
libextractor_0.5.18a.orig.tar.gz
  to pool/main/libe/libextractor/libextractor_0.5.18a.orig.tar.gz



A summary of the changes between this version and the previous one is
attached.

Thank you for reporting the bug, which will now be closed.  If you
have further comments please address them to 430876@bugs.debian.org,
and the maintainer will reopen the bug report if appropriate.

Debian distribution maintenance software
pp.
Daniel Baumann <daniel@debian.org> (supplier of updated libextractor package)

(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing ftpmaster@debian.org)


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Format: 1.7
Date: Thu,  5 Jul 2007 13:52:00 +0200
Source: libextractor
Binary: libextractor-plugins extract libextractor1c2a libextractor-dev
Architecture: source i386
Version: 0.5.18a-1
Distribution: unstable
Urgency: medium
Maintainer: Daniel Baumann <daniel@debian.org>
Changed-By: Daniel Baumann <daniel@debian.org>
Description: 
 extract    - displays meta-data from files of arbitrary type
 libextractor-dev - extracts meta-data from files of arbitrary type (development)
 libextractor-plugins - extracts meta-data from files of arbitrary type (plugins)
 libextractor1c2a - extracts meta-data from files of arbitrary type (library)
Closes: 430876
Changes: 
 libextractor (0.5.18a-1) unstable; urgency=medium
 .
   * New upstream release:
     - Fixes a problem with libgsf (Closes: #430876).
     - Dropped 01-include.dpatch, included in upstream.
Files: 
 e34fe3c9ef5252dd93739f37b7a2c5a8 834 devel optional libextractor_0.5.18a-1.dsc
 1623ee0670c2da8ce12d68a7c98dc1e0 7915435 devel optional libextractor_0.5.18a.orig.tar.gz
 b755f0509e6308f519a618cf7404900c 5621 devel optional libextractor_0.5.18a-1.diff.gz
 098666fb0a01b6858aa45f94a29ccade 7603772 libs optional libextractor1c2a_0.5.18a-1_i386.deb
 ebcbbdcc3b3b2bfd054cbbe10f1493b7 217562 libs optional libextractor-plugins_0.5.18a-1_i386.deb
 226b47d8e1897e758f69d8ad0731eedb 24076 libdevel optional libextractor-dev_0.5.18a-1_i386.deb
 99132e8034e0bee2295fa5864ac008b8 53382 utils optional extract_0.5.18a-1_i386.deb

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)

iD8DBQFGjN6V+C5cwEsrK54RAtwwAJ9HMVuL12K2Xa5kv+uO+osTd+aIQwCeJ+9i
PixoEk1aamhvNRCgpqFPEGk=
=wtGv
-----END PGP SIGNATURE-----




No longer marked as found in versions libextractor-python/1:0.5-2. Request was from Andreas Beckmann <anbe@debian.org> to control@bugs.debian.org. (Sat, 02 Nov 2013 15:57:34 GMT) (full text, mbox, link).


Bug archived. Request was from Debbugs Internal Request <owner@bugs.debian.org> to internal_control@bugs.debian.org. (Sun, 01 Dec 2013 07:41:47 GMT) (full text, mbox, link).


Send a report that this bug log contains spam.


Debian bug tracking system administrator <owner@bugs.debian.org>. Last modified: Wed Oct 11 23:39:56 2017; Machine Name: buxtehude

Debian Bug tracking system

Debbugs is free software and licensed under the terms of the GNU Public License version 2. The current version can be obtained from https://bugs.debian.org/debbugs-source/.

Copyright © 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson, 2005-2017 Don Armstrong, and many other contributors.