Debian Bug report logs - #441934
w3m: misinterprets ­ by drawing hyphens everywhere

version graph

Package: w3m; Maintainer for w3m is Tatsuya Kinoshita <tats@debian.org>; Source for w3m is src:w3m (PTS, buildd, popcon).

Reported by: Helmut Grohne <helmut@subdivi.de>

Date: Tue, 11 Sep 2007 23:51:01 UTC

Severity: minor

Tags: patch

Found in version w3m/0.5.1-5.1

Fixed in version w3m/0.5.3-10

Done: Tatsuya Kinoshita <tats@debian.org>

Bug is archived. No further changes may be made.

Toggle useless messages

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to debian-bugs-dist@lists.debian.org, Fumitoshi UKAI <ukai@debian.or.jp>:
Bug#441934; Package w3m. (full text, mbox, link).


Acknowledgement sent to Helmut Grohne <helmut@subdivi.de>:
New Bug report received and forwarded. Copy sent to Fumitoshi UKAI <ukai@debian.or.jp>. (full text, mbox, link).


Message #5 received at submit@bugs.debian.org (full text, mbox, reply):

From: Helmut Grohne <helmut@subdivi.de>
To: Debian Bug Tracking System <submit@bugs.debian.org>
Subject: w3m: misinterprets &shy; by drawing hyphens everywhere
Date: Wed, 12 Sep 2007 01:48:21 +0200
Package: w3m
Version: 0.5.1-5.1+b1
Severity: normal

w3m seems to handle &shy; worse than the gecko family. Rather than
ignoring &shy; which results in readable texts w3m puts expands every
&shy; to a hyphen. The correct behaviour would be to remove &shy; and
otherwise insert a hyphen if and only if a line break improves text
flow.

Similarly w3m expands &zwnj; to something that displays as a question
mark on my xterm. As w3m requires monospace fonts character joining is a
non-issue and &zwnj; and &zwj; should also expand to nothing.

An idea for a patch would be modifying entity.c:66 conv_entity to return
an empty string for some special integer value to be chosen and then
modifying entity.h to map these entities to the special integer value.

Helmut

-- System Information:
Debian Release: lenny/sid
  APT prefers unstable
  APT policy: (500, 'unstable'), (1, 'experimental')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.20.1 (SMP w/2 CPU cores)
Locale: LANG=C, LC_CTYPE=de_DE (charmap=ISO-8859-1)
Shell: /bin/sh linked to /bin/dash

Versions of packages w3m depends on:
ii  libc6                   2.6.1-2          GNU C Library: Shared libraries
ii  libgc1c2                1:6.8-1.1        conservative garbage collector for
ii  libgpmg1                1.19.6-25        General Purpose Mouse - shared lib
ii  libncurses5             5.6+20070908-1   Shared libraries for terminal hand
ii  libssl0.9.8             0.9.8e-8         SSL shared libraries
ii  zlib1g                  1:1.2.3.3.dfsg-5 compression library - runtime

Versions of packages w3m recommends:
ii  ca-certificates               20070303   Common CA Certificates PEM files

-- no debconf information




Added tag(s) patch. Request was from Hilko Bengen <bengen@debian.org> to control@bugs.debian.org. (Tue, 01 Mar 2011 22:57:03 GMT) (full text, mbox, link).


Information forwarded to debian-bugs-dist@lists.debian.org, Tatsuya Kinoshita <tats@debian.org>:
Bug#441934; Package w3m. (Tue, 01 Mar 2011 23:24:03 GMT) (full text, mbox, link).


Acknowledgement sent to Hilko Bengen <bengen@debian.org>:
Extra info received and forwarded to list. Copy sent to Tatsuya Kinoshita <tats@debian.org>. (Tue, 01 Mar 2011 23:24:03 GMT) (full text, mbox, link).


Message #12 received at 441934@bugs.debian.org (full text, mbox, reply):

From: Hilko Bengen <bengen@debian.org>
To: 441934@bugs.debian.org
Cc: control@bugs.debian.org
Subject: Patch for &shy;
Date: Tue, 01 Mar 2011 23:54:46 +0100
[Message part 1 (text/plain, inline)]
tag 441934 +patch
thank you

The attached patch should care of "&shy;". It does not work if the SOFT
HYPHEN character appears verbatim in a properly-advertised Unicode
document.

As of 0.5.3-2, the question-marks you reported for "&zwnj;" and "&zwj;"
appear when LC_CTYPE (or LANG) is set to "C". If I leave LC_CTYPE set to
en_US.utf-8, the proper characters are emitted, but neither xterm nor
urxvt render them.

-Hilko

[Message part 2 (text/x-diff, inline)]
diff --git a/entity.c b/entity.c
index fdd8f64..a7c9e99 100644
--- a/entity.c
+++ b/entity.c
@@ -44,6 +44,8 @@ conv_entity(unsigned int c)
 	return " ";
     if (c == 0xa0)
 	return NBSP;
+    if (c == 0xad)		/* SOFT HYPHEN */
+        return "";
     if (c < 0x100) {		/* Latin1 (ISO 8859-1) */
 	if (UseAltEntity)
 	    return alt_latin1[c - 0xa0];

Severity set to 'minor' from 'normal' Request was from d+deb@vdr.jp to control@bugs.debian.org. (Tue, 15 Nov 2011 01:51:02 GMT) (full text, mbox, link).


Information forwarded to debian-bugs-dist@lists.debian.org:
Bug#441934; Package w3m. (Thu, 01 Aug 2013 21:45:04 GMT) (full text, mbox, link).


Acknowledgement sent to Tatsuya Kinoshita <tats@debian.org>:
Extra info received and forwarded to list. (Thu, 01 Aug 2013 21:45:04 GMT) (full text, mbox, link).


Message #19 received at 441934@bugs.debian.org (full text, mbox, reply):

From: Tatsuya Kinoshita <tats@debian.org>
To: bengen@debian.org, 441934@bugs.debian.org
Subject: Re: Bug#441934: Patch for &shy;
Date: Fri, 02 Aug 2013 06:39:32 +0900 (JST)
[Message part 1 (text/plain, inline)]
tags 441934 + pending
thanks

On March 1, 2011 at 11:54PM +0100, bengen (at debian.org) wrote:
> The attached patch should care of "&shy;". It does not work if the SOFT
> HYPHEN character appears verbatim in a properly-advertised Unicode
> document.

Merged in the development repository.

  http://anonscm.debian.org/gitweb/?p=collab-maint/w3m.git;a=commitdiff;h=9cbe40c99ece3909e0192cd9f62dbf2ca4449802
  http://anonscm.debian.org/gitweb/?p=collab-maint/w3m.git;a=shortlog;h=refs/heads/debian

Thanks,
--
Tatsuya Kinoshita
[Message part 2 (application/pgp-signature, inline)]

Added tag(s) pending. Request was from Tatsuya Kinoshita <tats@debian.org> to control@bugs.debian.org. (Thu, 01 Aug 2013 21:45:07 GMT) (full text, mbox, link).


Reply sent to Tatsuya Kinoshita <tats@debian.org>:
You have taken responsibility. (Fri, 02 Aug 2013 15:51:11 GMT) (full text, mbox, link).


Notification sent to Helmut Grohne <helmut@subdivi.de>:
Bug acknowledged by developer. (Fri, 02 Aug 2013 15:51:11 GMT) (full text, mbox, link).


Message #26 received at 441934-close@bugs.debian.org (full text, mbox, reply):

From: Tatsuya Kinoshita <tats@debian.org>
To: 441934-close@bugs.debian.org
Subject: Bug#441934: fixed in w3m 0.5.3-10
Date: Fri, 02 Aug 2013 15:48:21 +0000
Source: w3m
Source-Version: 0.5.3-10

We believe that the bug you reported is fixed in the latest version of
w3m, which is due to be installed in the Debian FTP archive.

A summary of the changes between this version and the previous one is
attached.

Thank you for reporting the bug, which will now be closed.  If you
have further comments please address them to 441934@bugs.debian.org,
and the maintainer will reopen the bug report if appropriate.

Debian distribution maintenance software
pp.
Tatsuya Kinoshita <tats@debian.org> (supplier of updated w3m package)

(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing ftpmaster@ftp-master.debian.org)


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Format: 1.8
Date: Sat, 03 Aug 2013 00:07:00 +0900
Source: w3m
Binary: w3m w3m-img
Architecture: source amd64
Version: 0.5.3-10
Distribution: unstable
Urgency: low
Maintainer: Tatsuya Kinoshita <tats@debian.org>
Changed-By: Tatsuya Kinoshita <tats@debian.org>
Description: 
 w3m        - WWW browsable pager with excellent tables/frames support
 w3m-img    - inline image extension support utilities for w3m
Closes: 441934 647385 657666 696209
Changes: 
 w3m (0.5.3-10) unstable; urgency=low
 .
   * New patch 140_sort-dump-links.patch to sort anchors in -dump
     (closes: #657666)
   * New patch 150_contact-list.patch to update README (closes: #696209)
   * New patch 160_ignore-shy.patch to ignore SOFT HYPHEN (closes: #441934)
   * New patch 170_w3mman2html-utf8.patch (LP: #680202)
   * Update README.Debian to mention package development
   * Drop unneeded menu dependency (closes: #647385)
   * New patch 050_autotools-config.patch to update config.*
   * Drop autotools-dev from Build-Depends
   * Use cmigemo instead of migemo
   * Set show_cookie to 0 in debian/w3mconfig
   * Typo fix for debian/w3mconfig
   * Backup and restore entity.h on build time
   * Remove DM-Upload-Allowed
   * Update and correct debian/copyright
   * Update 015_debian-version.patch to 0.5.3+debian-10
Checksums-Sha1: 
 b6de5abd23c1a23d9272f6bf9e6259c0d92d4a87 1990 w3m_0.5.3-10.dsc
 46b9830c5da663c57eff01c0bc66099d111b08fd 76305 w3m_0.5.3-10.debian.tar.gz
 ec4078bc3cfec7abd194036bd902faeb38783afa 1264180 w3m_0.5.3-10_amd64.deb
 9ef7db35a907e425663723a478911692bd7c8110 114532 w3m-img_0.5.3-10_amd64.deb
Checksums-Sha256: 
 636b5e09b3d3a3e70e1080c377397a19f8b81245c1e5acf6b654d90a40a1c5f6 1990 w3m_0.5.3-10.dsc
 b393d866cf57c1ff81b0dc7d4516e37970923d8bf2e0d4b3e7fecc7683415875 76305 w3m_0.5.3-10.debian.tar.gz
 e634db3dd6ba8bdfea896d04b92d137a093bac15599dcfff4464b3713f39c54c 1264180 w3m_0.5.3-10_amd64.deb
 24b4bb958c94c3f079aa9122b031c5064b76b096033385830949b5ce2edecde4 114532 w3m-img_0.5.3-10_amd64.deb
Files: 
 d368bd610849ff772383d765a1dca66c 1990 web standard w3m_0.5.3-10.dsc
 0873ba9cfdaf0db434b0a089e4a1f0f7 76305 web standard w3m_0.5.3-10.debian.tar.gz
 648f25fb9d48f455d3ba9b289f37bebb 1264180 web standard w3m_0.5.3-10_amd64.deb
 80ae4dff722b333f678f7cc45819ec62 114532 web optional w3m-img_0.5.3-10_amd64.deb

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.14 (GNU/Linux)

iQIcBAEBCAAGBQJR+9J6AAoJEOXvq5AIDqY8ChIP/2twQ06C2bcgZRx7JS36b1EL
63ew4HF3A8dvgoDX9YYZHaEreRH21pYC5AHh9V39/Kk9oC91jG9qqeOiTsfCieOO
NZABil5bm457OnoP1ObUlB0Wza0ozwzmFagu74maLdhMMoq2cMwl2cR7epJpDHq6
fpuv+AWyS7wvL5j2XfVOiLn1L4Sfw4aTH+mDEz1H/XnBxDneODKVwf711huCRMbz
ba6y21f+aECV6n/hpvMjCr6uUauXbk05TAY6lFMbFWjkgWQEgqMbEM3MerwhfFGO
kRrH269PT+qu9dU8UStxI1/ISbzPat3V7tqrXY3vE+TFlFreyf9Iq1GyEZ2Lo3QH
PvxQfkTfC8sS7fPFqJUaRRSvwehP0QSG8JZOhlFKkf+AnHczCxVnyY3MVjjYWS66
b9/+ao8vfGkmIy9eJgPkA7j2yx1/pzACZYNflwaQStPTIIaHrJ/iJZzsMkJfufcv
h8tY+V/IQaVjr5GDXzsA1SrgKLKqm9Cal8brLVVQNaresIykU7Xxg6619ONJLYxh
cGKyQAyy98f2sOtw7CQgqvCi5C9OEhhRbd16sAPdoybdqNe1oahfiWNLo+CaTIRq
fAIdrXpyraepfs9nQ2ianwlCqgQmmLjuBQ80TbOkfS2GODscf8f6ID0BZ9dkj9+w
ciE8X7heAwuc7Ns6yJG5
=E8/Q
-----END PGP SIGNATURE-----




Bug archived. Request was from Debbugs Internal Request <owner@bugs.debian.org> to internal_control@bugs.debian.org. (Sun, 01 Sep 2013 07:28:43 GMT) (full text, mbox, link).


Send a report that this bug log contains spam.


Debian bug tracking system administrator <owner@bugs.debian.org>. Last modified: Mon Jun 5 03:10:04 2023; Machine Name: buxtehude

Debian Bug tracking system

Debbugs is free software and licensed under the terms of the GNU Public License version 2. The current version can be obtained from https://bugs.debian.org/debbugs-source/.

Copyright © 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson, 2005-2017 Don Armstrong, and many other contributors.