Debian Bug report logs - #3846
lynx misdisplays multiple <dt> in <dl>

version graph

Package: lynx; Maintainer for lynx is Atsuhito KOHDA <kohda@debian.org>; Source for lynx is src:lynx-cur.

Reported by: ian@chiark.chu.cam.ac.uk (Ian Jackson)

Date: Sat, 20 Jul 1996 20:41:31 UTC

Severity: normal

Found in version 2.4-FM-960316-1

Done: "H. Nanosecond" <aldomel@ix.netcom.com>

Bug is archived. No further changes may be made.

Forwarded to Lynx Development <lynx-dev@sig.net>

Toggle useless messages

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to debian-devel@lists.debian.org, ian@chiark.chu.cam.ac.uk (Ian Jackson), Andrew Howell <andrew@it.com.au>:
Bug#3846; Package lynx. Full text and rfc822 format available.

Acknowledgement sent to ian@chiark.chu.cam.ac.uk (Ian Jackson):
New bug report received and forwarded. Copy sent to ian@chiark.chu.cam.ac.uk (Ian Jackson) Andrew Howell <andrew@it.com.au>. Full text and rfc822 format available.

Message #5 received at submit@bugs.debian.org (full text, mbox):

From: ian@chiark.chu.cam.ac.uk (Ian Jackson)
To: Debian bugs submission address <submit@bugs.debian.org>
Subject: lynx misdisplays multiple <dt> in <dl>
Date: Sat, 20 Jul 96 21:20 BST
Package: lynx
Version: 2.4-FM-960316-1

Below is a copy of an HTML file which is also available at
<URL:http://chiark.chu.cam.ac.uk/~ijackson/test-deflist.html>.

There is a bug in the parsing produced by lynx, as shown below.  The
line marked >>>>> should not appear.

(Interestingly Mosaic 2.7b4 and Netscape 2.01 each have a bug here
too, but their bugs are different from lynx's even though they're the
same as each other's (-:.)

Ian.

    > chiark:~> lynx -dump http://chiark.chu.cam.ac.uk/~ijackson/test-deflist.html   
    > 
    >                                      test
    >                                        
    >    keyword1
>>>>>           
    >    keyword2
    >           paragraph
    >           
    >           second paragraph
    >           
    >    keyphrase for third para
    >           third paragraph
    >           
    >                                test with compact
    >                                        
    >    keyword1
    >    keyword2
    >           paragraph
    >           
    >           second paragraph
    >    keyphrase for third para
    >           third paragraph
    > chiark:~> 

<html><head>
<title>Test of definition lists</title>
</head><body>
<h1>test</h1>

<dl>
<dt>keyword1
<dt>keyword2
<dd>
paragraph
<p>
second paragraph

<dt>keyphrase for third para
<dd>
third paragraph

</dl>

<h1>test with compact</h1>

<dl compact>
<dt>keyword1
<dt>keyword2
<dd>
paragraph
<p>
second paragraph

<dt>keyphrase for third para
<dd>
third paragraph

</dl>

</body></html>


Acknowledgement sent to Christian Hudon <chrish@libertel.montreal.qc.ca>:
Extra info received and filed, but not forwarded. Full text and rfc822 format available.

Message #8 received at 3846-quiet@bugs.debian.org (full text, mbox):

From: Christian Hudon <chrish@libertel.montreal.qc.ca>
To: Ian Jackson <ian@chiark.chu.cam.ac.uk>
Cc: 3846-quiet@bugs.debian.org
Subject: Question about 'lynx misdisplays multiple <dt> in <dl>' bug
Date: Sat, 28 Sep 1996 22:53:31 -0700 (PDT)
Greetings Ian...

I'm the new maintainer for lynx and I'm currently cleaning up its bug
reports...

Could you please argue your case a bit/clue me in about the "lynx
misdisplays multiple <dt> in <dl>" bug you filed? Why can't it just be a
'cosmetic' choice from the lynx developers? (i.e. "When there's no
'compact', we'll separate the definition terms by a blank line.")

(Yes, lynx 2.6 still puts a newline there.)

  Christian




Information forwarded to Christian Hudon <chrish@debian.org>:
Bug#3846; Package lynx. Full text and rfc822 format available.

Acknowledgement sent to remco@blaakmeer.student.utwente.nl (Remco Blaakmeer):
Extra info received and forwarded to maintainer. Copy sent to Christian Hudon <chrish@debian.org>. Full text and rfc822 format available.

Message #13 received at 3846-maintonly@bugs.debian.org (full text, mbox):

From: remco@blaakmeer.student.utwente.nl (Remco Blaakmeer)
To: 3846-maintonly@bugs.debian.org
Subject: Old bugs need to be looked at
Date: Sun, 1 Feb 1998 12:00:36 +0100 (CET)
This is an automated message sent to all bugs older than one year.

This bug is very old. Please take a look at it and see if you can fix it.
If it has already been fixed, please close it.

If you have problems fixing it or if you don't have the time to fix it,
please ask the people on debian-devel@lists.debian.org for help, so that
at least the oldest bugs can be solved before Debian 2.0 is released.

Remco Blaakmeer


Information forwarded to debian-bugs-dist@lists.debian.org, Christian Hudon <chrish@debian.org>:
Bug#3846; Package lynx. Full text and rfc822 format available.

Acknowledgement sent to Antti-Juhani Kaijanaho <gaia@iki.fi>:
Extra info received and forwarded to list. Copy sent to Christian Hudon <chrish@debian.org>. Full text and rfc822 format available.

Message #18 received at 3846@bugs.debian.org (full text, mbox):

From: Antti-Juhani Kaijanaho <gaia@iki.fi>
To: debian-qa@lists.debian.org
Cc: 3846@bugs.debian.org
Subject: The old lynx bug (was: Re: Bugs older than two years)
Date: Tue, 30 Mar 1999 21:19:03 +0300
On Wed, Mar 31, 1999 at 12:14:42AM +1000, Anthony Towns wrote:
>    Package: lynx
>    Maintainer: Christian Hudon <chrish@debian.org>
>    3846 lynx misdisplays multiple <dt> in <dl>
>    [HELP] Is this really a bug?

RFC 1866 (HTML 2.0) says:

   The content of a <DL> element is a sequence of <DT> elements and/or
   <DD> elements, usually in pairs. Multiple <DT> may be paired with a
   single <DD> element. Documents should not contain multiple
   consecutive <DD> elements.

   [...]

   Unless the COMPACT attribute is present, an HTML user agent may leave
   white space between successive DT, DD pairs. The COMPACT attribute
   may also reduce the width of the left-hand (DT) column.

Thus, it looks like a bug, since (by natural implication) space may be
left only between successive DT DD pairs, and multiple DT's are allowed
for a DD.  The newer HTML specifications do not address this.

This should be forwarded upstream.

-- 
%%% Antti-Juhani Kaijanaho % gaia@iki.fi % http://www.iki.fi/gaia/ %%%

                      Good Times are back again!
                  http://www.iki.fi/gaia/zangelding/


Reply sent to Martin Schulze <joey@infodrom.north.de>:
You have marked bug as forwarded. Full text and rfc822 format available.

Message #21 received at 3846-forwarded@bugs.debian.org (full text, mbox):

From: Martin Schulze <joey@finlandia.Infodrom.North.DE>
To: Lynx Development <lynx-dev@sig.net>
Cc: 3846-forwarded@bugs.debian.org, debian-qa@lists.debian.org
Subject: Re: The old lynx bug (was: Re: Bugs older than two years)
Date: Fri, 2 Apr 1999 13:05:08 +0200
Dear Lynx developers,

we have received the follwoing report through the bug tracking system
of Debian at http://www.debian.org/Bugs/.

The bug is against version 2.4-FM-960316-1 but still present.
The entire report can be viewed at http://www.infodrom.north.de/Debian/Bugs/db/38/3846.html

Caveat: The URL as given in the bug report is not available anymore.

Please keep 3846@bugs.debian.org in your replies in order
to record it in that bug tracking system.

There is a bug in the parsing produced by lynx, as shown below.  The
line marked >>>>> should not appear.

(Interestingly Mosaic 2.7b4 and Netscape 2.01 each have a bug here
too, but their bugs are different from lynx's even though they're the
same as each other's (-:.)

Ian.

    > chiark:~> lynx -dump http://chiark.chu.cam.ac.uk/~ijackson/test-deflist.html
    >
    >                                      test
    >
    >    keyword1
>>>>>
    >    keyword2
    >           paragraph
    >
    >           second paragraph
    >
    >    keyphrase for third para
    >           third paragraph
    >
    >                                test with compact
    >
    >    keyword1
    >    keyword2
    >           paragraph
    >
    >           second paragraph
    >    keyphrase for third para
    >           third paragraph
    > chiark:~>

<html><head>
<title>Test of definition lists</title>
</head><body>
<h1>test</h1>

<dl>
<dt>keyword1
<dt>keyword2
<dd>
paragraph
<p>
second paragraph

<dt>keyphrase for third para
<dd>
third paragraph

</dl>

<h1>test with compact</h1>

<dl compact>
<dt>keyword1
<dt>keyword2
<dd>
paragraph
<p>
second paragraph

<dt>keyphrase for third para
<dd>
third paragraph

</dl>

</body></html>


Antti-Juhani Kaijanaho wrote:
> On Wed, Mar 31, 1999 at 12:14:42AM +1000, Anthony Towns wrote:
> >    Package: lynx
> >    Maintainer: Christian Hudon <chrish@debian.org>
> >    3846 lynx misdisplays multiple <dt> in <dl>
> >    [HELP] Is this really a bug?
> 
> RFC 1866 (HTML 2.0) says:
> 
>    The content of a <DL> element is a sequence of <DT> elements and/or
>    <DD> elements, usually in pairs. Multiple <DT> may be paired with a
>    single <DD> element. Documents should not contain multiple
>    consecutive <DD> elements.
> 
>    [...]
> 
>    Unless the COMPACT attribute is present, an HTML user agent may leave
>    white space between successive DT, DD pairs. The COMPACT attribute
>    may also reduce the width of the left-hand (DT) column.
> 
> Thus, it looks like a bug, since (by natural implication) space may be
> left only between successive DT DD pairs, and multiple DT's are allowed
> for a DD.  The newer HTML specifications do not address this.
> 
> This should be forwarded upstream.

Which I've done now.

Regards,

	Joey

-- 
Whenever you meet yourself you're in a time loop or in front of a mirror.

Please always Cc to me when replying to me on the lists.


Information forwarded to debian-bugs-dist@lists.debian.org, Christian Hudon <chrish@debian.org>:
Bug#3846; Package lynx. Full text and rfc822 format available.

Acknowledgement sent to purslow@chass.utoronto.ca:
Extra info received and forwarded to list. Copy sent to Christian Hudon <chrish@debian.org>. Full text and rfc822 format available.

Message #26 received at 3846@bugs.debian.org (full text, mbox):

From: Philip Webb <purslow@chass.utoronto.ca>
To: lynx-dev@sig.net
Cc: 3846@bugs.debian.org
Subject: Re: lynx-dev Re: The old lynx bug
Date: Fri, 2 Apr 1999 10:54:19 -0500 (EST)
990402 Martin Schultz outlined a "bug" in Lynx 2.4-FM-960316-1 ,
involving HTML 2.0 specs for definition lists:
 
others among the international team of volunteers who maintain Lynx
can offer more expert advice about HTML, but otherwise:

first in practice, only the latest release/development versions are supported:
you can get 2-8-1 from  www.slcc.edu/lynx/release
& 2-8-2dev.21 from  sol.slcc.edu/lynx/current ;

second, there is no mention of the requirement in HTML 4.0 (as you say),
nor is it clear what the rather brief account in the RFC for 2.0 means,
but it does seem at least better aesthetically
not to insert a blank line between consecutive <DT><DT>'s ;

third, the described behaviour does indeed persist with 2-8-2dev.19 .

-- 
========================,,============================================
SUPPORT     ___________//___,  Philip Webb : purslow@chass.utoronto.ca
ELECTRIC   /] [] [] [] [] []|  Centre for Urban & Community Studies
TRANSIT    `-O----------O---'  University of Toronto


Information forwarded to debian-bugs-dist@lists.debian.org, Christian Hudon <chrish@debian.org>:
Bug#3846; Package lynx. Full text and rfc822 format available.

Acknowledgement sent to lvirden@cas.org (Larry W. Virden):
Extra info received and forwarded to list. Copy sent to Christian Hudon <chrish@debian.org>. Full text and rfc822 format available.

Message #31 received at 3846@bugs.debian.org (full text, mbox):

From: lvirden@cas.org (Larry W. Virden)
To: lynx-dev@sig.net
Cc: 3846@bugs.debian.org
Subject: Re: lynx-dev Re: The old lynx bug
Date: Fri, 2 Apr 1999 13:04:20 -0500 (EST)
From: Philip Webb <purslow@chass.utoronto.ca>

>third, the described behaviour does indeed persist with 2-8-2dev.19 .

and in fact in 2-8-2dev.21 ...
-- 
Larry W. Virden                 <URL: mailto:lvirden@cas.org>
<URL: http://www.purl.org/NET/lvirden/> <*> O- "No one is what he seems."
Unless explicitly stated to the contrary, nothing in this posting should 
be construed as representing my employer's opinions.


Information forwarded to debian-bugs-dist@lists.debian.org, Christian Hudon <chrish@debian.org>:
Bug#3846; Package lynx. Full text and rfc822 format available.

Acknowledgement sent to lynx-dev@sig.net:
Extra info received and forwarded to list. Copy sent to Christian Hudon <chrish@debian.org>. Full text and rfc822 format available.

Message #36 received at 3846@bugs.debian.org (full text, mbox):

From: Klaus Weide <kweide@tezcat.com>
To: Martin Schulze <joey@finlandia.Infodrom.North.DE>
Cc: Lynx Development <lynx-dev@sig.net>, 3846@bugs.debian.org
Subject: dev.22 patch 1 - old Debian lynx bug #3846
Date: Sat, 17 Apr 1999 05:29:54 -0500 (CDT)
On Fri, 2 Apr 1999, Martin Schulze wrote:
> we have received the follwoing report through the bug tracking system
> of Debian at http://www.debian.org/Bugs/.
> 
> The bug is against version 2.4-FM-960316-1 but still present.
> The entire report can be viewed at http://www.infodrom.north.de/Debian/Bugs/db/38/3846.html
> 
> Caveat: The URL as given in the bug report is not available anymore.
> 
> Please keep 3846@bugs.debian.org in your replies in order
> to record it in that bug tracking system.
[.....]
> Antti-Juhani Kaijanaho wrote:
> > 
> > RFC 1866 (HTML 2.0) says:
> > 
> >    The content of a <DL> element is a sequence of <DT> elements and/or
> >    <DD> elements, usually in pairs. Multiple <DT> may be paired with a
> >    single <DD> element. Documents should not contain multiple
> >    consecutive <DD> elements.
> > 
> >    [...]
> > 
> >    Unless the COMPACT attribute is present, an HTML user agent may leave
> >    white space between successive DT, DD pairs. The COMPACT attribute
> >    may also reduce the width of the left-hand (DT) column.
> > 
> > Thus, it looks like a bug, since (by natural implication) space may be
> > left only between successive DT DD pairs, and multiple DT's are allowed
> > for a DD.  The newer HTML specifications do not address this.

It may be questionable whether this is a bug, but I agree that the
requested behavior makes more sense.  Below is a patch against current
2.8.2dev.22.  Not checked whether it applies cleanly against 2.8.1,
but if it doesn't the only reason would be insignificant formatting
changes, it should be trivial to fix by hand.

  Klaus

* Tweak in HTML_start_element case HTML_DT: prevent generation of empty
  line between multiple simple DT elements without intervening DD elements
  (Debian bug #3846).

--- lynx2-8-2.old/src/GridText.c	Tue Apr 13 04:39:16 1999
+++ lynx2-8-2/src/GridText.c	Sat Apr 17 05:00:48 1999
@@ -6473,6 +6473,15 @@
     return;
 }
 
+PUBLIC BOOL HText_inLineOne ARGS1(
+	HText *,	text)
+{
+    if (text) {
+	return text->in_line_1;
+    }
+    return YES;
+}
+
 /*
  *  This function is for removing the first of two
  *  successive blank lines.  It should be called after
--- lynx2-8-2.old/src/GridText.h	Tue Apr 13 04:39:16 1999
+++ lynx2-8-2/src/GridText.h	Sat Apr 17 05:02:19 1999
@@ -186,6 +186,7 @@
 extern int HText_LastLineSize PARAMS((HText *me, BOOL IgnoreSpaces));
 extern int HText_PreviousLineSize PARAMS((HText *me, BOOL IgnoreSpaces));
 extern void HText_NegateLineOne PARAMS((HText *text));
+extern BOOL HText_inLineOne PARAMS((HText *text));
 extern void HText_RemovePreviousLine PARAMS((HText *text));
 extern int HText_getCurrentColumn PARAMS((HText *text));
 extern int HText_getMaximumColumn PARAMS((HText *text));
--- lynx2-8-2.old/src/HTML.c	Tue Apr 13 04:39:16 1999
+++ lynx2-8-2/src/HTML.c	Sat Apr 17 04:56:35 1999
@@ -2281,7 +2281,23 @@
     case HTML_DT:
 	CHECK_ID(HTML_GEN_ID);
 	if (!me->style_change) {
+	    BOOL in_line_1 = HText_inLineOne(me->text);
+	    HTCoord saved_spaceBefore = me->sp->style->spaceBefore;
+	    HTCoord saved_spaceAfter = me->sp->style->spaceAfter;
+	    /*
+	     *  If there are several DT elements and this is not the first,
+	     *  and the preceding DT element's first (and normally only) line
+	     *  has not yet been ended, suppress intervening blank line by
+	     *  temporarily modifying the paragraph style in place.  Ugly
+	     *  but there's ample precedence. - kw
+	     */
+	    if (in_line_1) {
+		me->sp->style->spaceBefore = 0;	/* temporary change */
+		me->sp->style->spaceAfter = 0;	/* temporary change */
+	    }
 	    HText_appendParagraph(me->text);
+	    me->sp->style->spaceBefore = saved_spaceBefore; /* undo */
+	    me->sp->style->spaceAfter = saved_spaceAfter; /* undo */
 	    me->in_word = NO;
 	    me->sp->style->alignment = HT_LEFT;
 	}



Reply sent to "H. Nanosecond" <aldomel@ix.netcom.com>:
You have taken responsibility. Full text and rfc822 format available.

Notification sent to ian@chiark.chu.cam.ac.uk (Ian Jackson):
Bug acknowledged by developer. Full text and rfc822 format available.

Message #41 received at 3846-done@bugs.debian.org (full text, mbox):

From: "H. Nanosecond" <aldomel@ix.netcom.com>
To: 3846-done@bugs.debian.org, Lynx Development <lynx-dev@sig.net>
Subject: Re: lynx misdisplays multiple <dt> in <dl>
Date: Fri, 7 Jul 2000 16:44:04 -0400 (EDT)
Here is the rendering with lynx 2.8.3rel.1

[fido debian]$ lynx -dump 3846-t.html  | cat --number
     1
     2                                       test
     3
     4     keyword1
     5     keyword2
     6            paragraph
     7
     8            second paragraph
     9
    10     keyphrase for third para
    11            third paragraph
    12
    13                                 test with compact
    14
    15     keyword1
    16     keyword2
    17            paragraph
    18            second paragraph
    19     keyphrase for third para
    20            third paragraph





Send a report that this bug log contains spam.


Debian bug tracking system administrator <owner@bugs.debian.org>. Last modified: Thu Apr 17 07:44:53 2014; Machine Name: buxtehude.debian.org

Debian Bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.