Debian Bug report logs -
#430876
python-extractor when extracting keywords from file.
Reported by: romain@mezimail.com
Date: Wed, 27 Jun 2007 21:54:03 UTC
Severity: important
Fixed in version libextractor/0.5.18a-1
Done: Daniel Baumann <daniel@debian.org>
Bug is archived. No further changes may be made.
Toggle useless messages
Report forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor.
(full text, mbox, link).
Acknowledgement sent to romain@mezimail.com:
New Bug report received and forwarded. Copy sent to Daniel Baumann <daniel@debian.org>.
(full text, mbox, link).
Message #5 received at submit@bugs.debian.org (full text, mbox, reply):
Package: python-extractor
Version: 1:0.5-2
Severity: grave
Justification: renders package unusable
Hi !
I have a very strange behaviour when I try to extract keywords from a file.
Here a pydb strace of the execution of the extract.py example file on a
textlive-pstricks doc distribution pdf file :
$ pydb ./extract.py /usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf
(/usr/share/doc/python-extractor/examples/extract.py:24):
24 """
(Pydb) n
(/usr/share/doc/python-extractor/examples/extract.py:25):
25 import extractor
(Pydb) n
(/usr/share/doc/python-extractor/examples/extract.py:26):
26 import sys
(Pydb) n
(/usr/share/doc/python-extractor/examples/extract.py:28):
28 xtract = extractor.Extractor()
(Pydb)
(/usr/share/doc/python-extractor/examples/extract.py:29):
29 for arg in sys.argv[1:]:
(Pydb)
(/usr/share/doc/python-extractor/examples/extract.py:30):
30 print "Keywords from %s:" % arg
(Pydb)
Keywords from
/usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf:
(/usr/share/doc/python-extractor/examples/extract.py:31):
31 keys = xtract.extract(arg)
(Pydb) s
--Call level 0
(/var/lib/python-support/python2.4/extractor.py:172): extract
172 def extract(self, filename=None, data=None, size=None):
(Pydb) n
(/var/lib/python-support/python2.4/extractor.py:187): extract
187 if not filename and not (data and size):
(Pydb) n
(/var/lib/python-support/python2.4/extractor.py:189): extract
189 elif filename:
(Pydb) n
(/var/lib/python-support/python2.4/extractor.py:190): extract
190 return self.extractFromFile(filename)
(Pydb) s
----Call level 1
(/var/lib/python-support/python2.4/extractor.py:194): extractFromFile
194 def extractFromFile(self, filename):
(Pydb) n
(/var/lib/python-support/python2.4/extractor.py:229): extractFromFile
229 self.keywords_p =
libextractor.EXTRACTOR_getKeywords(self.extractors, filename)
(Pydb) s
** (process:23365): CRITICAL **: gsf_input_memory_get_type: assertion
`gsf_input_memory_type != 0' failed
(process:23365): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed
** (process:23365): CRITICAL **: gsf_shared_memory_get_type: assertion
`gsf_shared_memory_type != 0' failed
(process:23365): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed
Another strange thing is that I tested the extract on another computer and I
experienced no crash.
So I hesitated between "normal" (because on another computer it does not crash) and "grave" severity. I choosed the last one because on my computer, it crashes and render the package unusable ;-)
I hope we will able to find the origin of this problem.
Cheers,
Romain.
-- System Information:
Debian Release: lenny/sid
APT prefers unstable
APT policy: (500, 'unstable'), (500, 'stable'), (1, 'experimental')
Architecture: i386 (i686)
Kernel: Linux 2.6.21 (SMP w/2 CPU cores)
Locale: LANG=fr_FR.UTF-8, LC_CTYPE=fr_FR.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash
Versions of packages python-extractor depends on:
ii libextractor1c2a 0.5.18-3 extracts meta-data from files of a
ii python 2.4.4-6 An interactive high-level object-o
ii python-ctypes 1.0.1-1 Python package to create and manip
ii python-support 0.6.4 automated rebuilding support for p
Versions of packages python-extractor recommends:
ii libextractor-plugins 0.5.18-3 extracts meta-data from files of a
-- no debconf information
Information forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor.
(full text, mbox, link).
Acknowledgement sent to Romain Chantereau <romain@mezimail.com>:
Extra info received and forwarded to list. Copy sent to Daniel Baumann <daniel@debian.org>.
(full text, mbox, link).
Message #10 received at 430876@bugs.debian.org (full text, mbox, reply):
Hi !
It seems that it is related to libextractor itself, here a gdb and a
strace of the extract util invokation :
GDB :
** (process:3683): CRITICAL **: gsf_input_memory_get_type: assertion
`gsf_input_memory_type != 0' failed
(process:3683): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed
** (process:3683): CRITICAL **: gsf_shared_memory_get_type: assertion
`gsf_shared_memory_type != 0' failed
(process:3683): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed
Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread -1212966144 (LWP 3683)]
0xb7d74978 in gsf_shared_memory_new () from /usr/lib/libgsf-1.so.114
(gdb) back
#0 0xb7d74978 in gsf_shared_memory_new () from /usr/lib/libgsf-1.so.114
#1 0xb7d6543d in gsf_input_memory_new () from /usr/lib/libgsf-1.so.114
#2 0xb7f00076 in libextractor_ole2_extract ()
from /usr/lib/libextractor/libextractor_ole2.so
#3 0xb7ef74ca in ?? () from /usr/lib/libextractor.so.1
#4 0xbfa88964 in ?? ()
#5 0xb77c9000 in ?? ()
#6 0x00004d55 in ?? ()
#7 0x0805c910 in ?? ()
#8 0x00000000 in ?? ()
STRACE :
open("/usr/lib/libextractor/libextractor_nsfe.la", O_RDONLY) = 3
fstat64(3, {st_mode=S_IFREG|0644, st_size=996, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb7a2f000
read(3, "# libextractor_nsfe.la - a libto"..., 4096) = 996
read(3, "", 4096) = 0
close(3) = 0
munmap(0xb7a2f000, 4096) = 0
open("/usr/lib/libextractor/libextractor_nsfe.so", O_RDONLY) = 3
read(3, "\177ELF\1\1\1\0\0\0\0\0\0\0\0\0\3\0\3\0\1\0\0\0\220\4\0"...,
512) = 512
fstat64(3, {st_mode=S_IFREG|0644, st_size=4368, ...}) = 0
mmap2(NULL, 7336, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0)
= 0xb780a000
mmap2(0xb780b000, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|
MAP_DENYWRITE, 3, 0) = 0xb780b000
close(3) = 0
open("/usr/lib/libextractor/libextractor_nsf.la", O_RDONLY) = 3
fstat64(3, {st_mode=S_IFREG|0644, st_size=990, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb7a2f000
read(3, "# libextractor_nsf.la - a libtoo"..., 4096) = 990
read(3, "", 4096) = 0
close(3) = 0
munmap(0xb7a2f000, 4096) = 0
open("/usr/lib/libextractor/libextractor_nsf.so", O_RDONLY) = 3
read(3, "\177ELF\1\1\1\0\0\0\0\0\0\0\0\0\3\0\3\0\1\0\0\0\360\3\0"...,
512) = 512
fstat64(3, {st_mode=S_IFREG|0644, st_size=3832, ...}) = 0
mmap2(NULL, 6800, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0)
= 0xb7808000
mmap2(0xb7809000, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|
MAP_DENYWRITE, 3, 0) = 0xb7809000
close(3) = 0
stat64("/usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf", {st_mode=S_IFREG|0644, st_size=19797, ...}) = 0
open("/usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf",
O_RDONLY|O_LARGEFILE) = 3
mmap2(NULL, 19797, PROT_READ, MAP_PRIVATE, 3, 0) = 0xb7803000
open("/usr/lib/charset.alias", O_RDONLY|O_LARGEFILE) = -1 ENOENT (No
such file or directory)
write(2, "\n** (process:4429): CRITICAL **:"..., 106
** (process:4429): CRITICAL **: gsf_input_memory_get_type: assertion
`gsf_input_memory_type != 0' failed
) = 106
write(2, "\n(process:4429): GLib-GObject-CR"..., 107
(process:4429): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed
) = 107
write(2, "\n** (process:4429): CRITICAL **:"..., 108
** (process:4429): CRITICAL **: gsf_shared_memory_get_type: assertion
`gsf_shared_memory_type != 0' failed
) = 108
write(2, "\n(process:4429): GLib-GObject-CR"..., 107
(process:4429): GLib-GObject-CRITICAL **: g_object_new: assertion
`G_TYPE_IS_OBJECT (object_type)' failed
) = 107
--- SIGSEGV (Segmentation fault) @ 0 (0) ---
+++ killed by SIGSEGV +++
Process 4429 detached
As in my previous message I am suffering this only on my computer. :-/
Cheers,
Romain
Information forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor.
(full text, mbox, link).
Acknowledgement sent to daniel@debian.org:
Extra info received and forwarded to list. Copy sent to Daniel Baumann <daniel@debian.org>.
(full text, mbox, link).
Message #15 received at 430876@bugs.debian.org (full text, mbox, reply):
severity 430876 important
thanks
Romain Chantereau wrote:
> As in my previous message I am suffering this only on my computer. :-/
I've tried it on sid/i386 (as you) and didn't encounter that problem in
the first run:
(sid_i386)user@debian:/usr/share/doc/python-extractor/examples$
./extract.py /usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf
Keywords from
/usr/share/doc/texlive-pstricks/generic/pstricks/voss/bsp02.pdf:
format - PDF 1.2
mimetype - application/pdf
(sid_i386)user@debian:/usr/share/doc/python-extractor/examples$
then, i installed libextractor-plugins, and could reproduce the problem.
--
Address: Daniel Baumann, Burgunderstrasse 3, CH-4562 Biberist
Email: daniel.baumann@panthera-systems.net
Internet: http://people.panthera-systems.net/~daniel-baumann/
Severity set to `important' from `grave'
Request was from Daniel Baumann <daniel@debian.org>
to control@bugs.debian.org.
(Thu, 28 Jun 2007 20:15:02 GMT) (full text, mbox, link).
Information forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor.
(full text, mbox, link).
Acknowledgement sent to Romain Chantereau <romain@mezimail.com>:
Extra info received and forwarded to list. Copy sent to Daniel Baumann <daniel@debian.org>.
(full text, mbox, link).
Message #22 received at 430876@bugs.debian.org (full text, mbox, reply):
I submitted a bug to libextractor Mantis :
https://gnunet.org/mantis/view.php?id=1241
Romain.
Information forwarded to debian-bugs-dist@lists.debian.org, Daniel Baumann <daniel@debian.org>:
Bug#430876; Package python-extractor.
(full text, mbox, link).
Acknowledgement sent to Romain Chantereau <romain@mezimail.com>:
Extra info received and forwarded to list. Copy sent to Daniel Baumann <daniel@debian.org>.
(full text, mbox, link).
Message #27 received at 430876@bugs.debian.org (full text, mbox, reply):
Hi !
Christian Grothoff accused the libgsf to be the origin of the bug, and
he was right.
I downgraded libgsf version from 1.4.4 to 1.4.3, and this segfault
disapeared.
I reported a bug #431104
Thanks,
Romain.
Reply sent to Daniel Baumann <daniel@debian.org>:
You have taken responsibility.
(full text, mbox, link).
Notification sent to romain@mezimail.com:
Bug acknowledged by developer.
(full text, mbox, link).
Message #32 received at 430876-close@bugs.debian.org (full text, mbox, reply):
Source: libextractor
Source-Version: 0.5.18a-1
We believe that the bug you reported is fixed in the latest version of
libextractor, which is due to be installed in the Debian FTP archive:
extract_0.5.18a-1_i386.deb
to pool/main/libe/libextractor/extract_0.5.18a-1_i386.deb
libextractor-dev_0.5.18a-1_i386.deb
to pool/main/libe/libextractor/libextractor-dev_0.5.18a-1_i386.deb
libextractor-plugins_0.5.18a-1_i386.deb
to pool/main/libe/libextractor/libextractor-plugins_0.5.18a-1_i386.deb
libextractor1c2a_0.5.18a-1_i386.deb
to pool/main/libe/libextractor/libextractor1c2a_0.5.18a-1_i386.deb
libextractor_0.5.18a-1.diff.gz
to pool/main/libe/libextractor/libextractor_0.5.18a-1.diff.gz
libextractor_0.5.18a-1.dsc
to pool/main/libe/libextractor/libextractor_0.5.18a-1.dsc
libextractor_0.5.18a.orig.tar.gz
to pool/main/libe/libextractor/libextractor_0.5.18a.orig.tar.gz
A summary of the changes between this version and the previous one is
attached.
Thank you for reporting the bug, which will now be closed. If you
have further comments please address them to 430876@bugs.debian.org,
and the maintainer will reopen the bug report if appropriate.
Debian distribution maintenance software
pp.
Daniel Baumann <daniel@debian.org> (supplier of updated libextractor package)
(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing ftpmaster@debian.org)
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Format: 1.7
Date: Thu, 5 Jul 2007 13:52:00 +0200
Source: libextractor
Binary: libextractor-plugins extract libextractor1c2a libextractor-dev
Architecture: source i386
Version: 0.5.18a-1
Distribution: unstable
Urgency: medium
Maintainer: Daniel Baumann <daniel@debian.org>
Changed-By: Daniel Baumann <daniel@debian.org>
Description:
extract - displays meta-data from files of arbitrary type
libextractor-dev - extracts meta-data from files of arbitrary type (development)
libextractor-plugins - extracts meta-data from files of arbitrary type (plugins)
libextractor1c2a - extracts meta-data from files of arbitrary type (library)
Closes: 430876
Changes:
libextractor (0.5.18a-1) unstable; urgency=medium
.
* New upstream release:
- Fixes a problem with libgsf (Closes: #430876).
- Dropped 01-include.dpatch, included in upstream.
Files:
e34fe3c9ef5252dd93739f37b7a2c5a8 834 devel optional libextractor_0.5.18a-1.dsc
1623ee0670c2da8ce12d68a7c98dc1e0 7915435 devel optional libextractor_0.5.18a.orig.tar.gz
b755f0509e6308f519a618cf7404900c 5621 devel optional libextractor_0.5.18a-1.diff.gz
098666fb0a01b6858aa45f94a29ccade 7603772 libs optional libextractor1c2a_0.5.18a-1_i386.deb
ebcbbdcc3b3b2bfd054cbbe10f1493b7 217562 libs optional libextractor-plugins_0.5.18a-1_i386.deb
226b47d8e1897e758f69d8ad0731eedb 24076 libdevel optional libextractor-dev_0.5.18a-1_i386.deb
99132e8034e0bee2295fa5864ac008b8 53382 utils optional extract_0.5.18a-1_i386.deb
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
iD8DBQFGjN6V+C5cwEsrK54RAtwwAJ9HMVuL12K2Xa5kv+uO+osTd+aIQwCeJ+9i
PixoEk1aamhvNRCgpqFPEGk=
=wtGv
-----END PGP SIGNATURE-----
No longer marked as found in versions libextractor-python/1:0.5-2.
Request was from Andreas Beckmann <anbe@debian.org>
to control@bugs.debian.org.
(Sat, 02 Nov 2013 15:57:34 GMT) (full text, mbox, link).
Bug archived.
Request was from Debbugs Internal Request <owner@bugs.debian.org>
to internal_control@bugs.debian.org.
(Sun, 01 Dec 2013 07:41:47 GMT) (full text, mbox, link).
Send a report that this bug log contains spam.
Debian bug tracking system administrator <owner@bugs.debian.org>.
Last modified:
Wed Oct 11 23:39:56 2017;
Machine Name:
buxtehude
Debian Bug tracking system
Debbugs is free software and licensed under the terms of the GNU
Public License version 2. The current version can be obtained
from https://bugs.debian.org/debbugs-source/.
Copyright © 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson,
2005-2017 Don Armstrong, and many other contributors.