Debian Bug report logs - #655528
awstats: Decoding URL in "Pages-URL" table

version graph

Package: awstats; Maintainer for awstats is Sergey B Kirpichev <skirpichev@gmail.com>; Source for awstats is src:awstats.

Reported by: Dmitry Katsubo <dma_k@mail.ru>

Date: Wed, 11 Jan 2012 23:39:01 UTC

Severity: wishlist

Tags: upstream

Found in version awstats/7.0~dfsg-2

Forwarded to http://sourceforge.net/tracker/?func=detail&aid=3473378&group_id=13764&atid=313764

Reply or subscribe to this bug.

Toggle useless messages

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to debian-bugs-dist@lists.debian.org, Sergey B Kirpichev <skirpichev@gmail.com>:
Bug#655528; Package awstats. (Wed, 11 Jan 2012 23:39:04 GMT) Full text and rfc822 format available.

Acknowledgement sent to Dmitry Katsubo <dma_k@mail.ru>:
New Bug report received and forwarded. Copy sent to Sergey B Kirpichev <skirpichev@gmail.com>. (Wed, 11 Jan 2012 23:39:04 GMT) Full text and rfc822 format available.

Message #5 received at submit@bugs.debian.org (full text, mbox):

From: Dmitry Katsubo <dma_k@mail.ru>
To: submit@bugs.debian.org
Subject: awstats: Decoding URL in "Pages-URL" table
Date: Thu, 12 Jan 2012 00:28:52 +0100
[Message part 1 (text/plain, inline)]
Package: awstats
Version: 7.0~dfsg-2

Dear awstats maintainer,

I have noticed that awstats does not decode e.g. URLs containing
Cyrillic characters, leaving them URL-encoded:

/w/%D1%81%D0%B5%D0%BC%D1%8C%D1%8F

For me it is nice to see this link decoded:

/w/семья

I am attaching simple patch that solves this problem.

-- 
With best regards,
Dmitry
[awstats.pl_urldecode.diff (text/plain, attachment)]

Added tag(s) moreinfo. Request was from Sergey B Kirpichev <skirpichev@gmail.com> to control@bugs.debian.org. (Thu, 12 Jan 2012 18:51:08 GMT) Full text and rfc822 format available.

Message sent on to Dmitry Katsubo <dma_k@mail.ru>:
Bug#655528. (Thu, 12 Jan 2012 18:51:10 GMT) Full text and rfc822 format available.

Message #10 received at 655528-submitter@bugs.debian.org (full text, mbox):

From: Sergey B Kirpichev <skirpichev@gmail.com>
To: 655528-submitter@bugs.debian.org
Cc: control@bugs.debian.org
Subject: Re: Bug#655528: awstats: Decoding URL in "Pages-URL" table
Date: Thu, 12 Jan 2012 22:49:30 +0400
tag 655528 +moreinfo
severity whishlist
thanks

On Thu, Jan 12, 2012 at 12:28:52AM +0100, Dmitry Katsubo wrote:
> I have noticed that awstats does not decode e.g. URLs containing
> Cyrillic characters, leaving them URL-encoded:
> 
> /w/%D1%81%D0%B5%D0%BC%D1%8C%D1%8F
> 
> For me it is nice to see this link decoded:
> 
> /w/семья
> 
> I am attaching simple patch that solves this problem.

I'm not sure if this is a correct an approach at all.  Do you
assume that URL-encoded string is in UTF-8, right?  Why?

Anyway, I'll lower the severity of this bug to wishlist.




Severity set to 'wishlist' from 'normal' Request was from Sergey B Kirpichev <skirpichev@gmail.com> to control@bugs.debian.org. (Thu, 12 Jan 2012 18:57:07 GMT) Full text and rfc822 format available.

Information forwarded to debian-bugs-dist@lists.debian.org, Sergey B Kirpichev <skirpichev@gmail.com>:
Bug#655528; Package awstats. (Thu, 12 Jan 2012 21:24:13 GMT) Full text and rfc822 format available.

Acknowledgement sent to Dmitry Katsubo <dma_k@mail.ru>:
Extra info received and forwarded to list. Copy sent to Sergey B Kirpichev <skirpichev@gmail.com>. (Thu, 12 Jan 2012 21:24:13 GMT) Full text and rfc822 format available.

Message #17 received at 655528@bugs.debian.org (full text, mbox):

From: Dmitry Katsubo <dma_k@mail.ru>
To: 655528@bugs.debian.org
Subject: Re: Bug#655528: awstats: Decoding URL in "Pages-URL" table
Date: Thu, 12 Jan 2012 22:03:06 +0100
On 12.01.2012 19:49, Sergey B Kirpichev wrote:
> I'm not sure if this is a correct an approach at all.  Do you
> assume that URL-encoded string is in UTF-8, right? Why?

I myself is not gig in specifications, one should carefully read     RFC
1738 & RFC 3986. I found that HTML 4.0 recommends the use of UTF-8 to
encode the query string and the same is mentioned here:
http://en.wikipedia.org/wiki/Query_string#URL_encoding

> Anyway, I'll lower the severity of this bug to wishlist.

Yes, wishlist is most appropriate. Hopefully somebody looking for
solution will find this patch. Decoding of URLs was crucial for me.

-- 
With best regards,
Dmitry




Set Bug forwarded-to-address to 'http://sourceforge.net/tracker/?func=detail&aid=3473378&group_id=13764&atid=313764'. Request was from Sergey B Kirpichev <skirpichev@gmail.com> to control@bugs.debian.org. (Fri, 13 Jan 2012 09:51:36 GMT) Full text and rfc822 format available.

Removed tag(s) moreinfo. Request was from Sergey B Kirpichev <skirpichev@gmail.com> to control@bugs.debian.org. (Fri, 13 Jan 2012 09:51:41 GMT) Full text and rfc822 format available.

Message sent on to Dmitry Katsubo <dma_k@mail.ru>:
Bug#655528. (Fri, 13 Jan 2012 09:51:49 GMT) Full text and rfc822 format available.

Message #24 received at 655528-submitter@bugs.debian.org (full text, mbox):

From: Sergey B Kirpichev <skirpichev@gmail.com>
To: 655528-submitter@bugs.debian.org
Cc: control@bugs.debian.org
Subject: Re: Bug#655528: awstats: Decoding URL in "Pages-URL" table
Date: Fri, 13 Jan 2012 13:47:24 +0400
forwarded 655528 http://sourceforge.net/tracker/?func=detail&aid=3473378&group_id=13764&atid=313764
tag 655528 -moreinfo
thanks

On Thu, Jan 12, 2012 at 10:03:06PM +0100, Dmitry Katsubo wrote:
> I myself is not gig in specifications, one should carefully read     RFC
> 1738 & RFC 3986.

Nothing found, that prohibit using other charsets for non-ascii
symbols.

> I found that HTML 4.0 recommends the use of UTF-8 to

HTML standard is not a RFC for URI's.

> encode the query string and the same is mentioned here:
> http://en.wikipedia.org/wiki/Query_string#URL_encoding

Quote:
-->8--
All other characters are encoded as %FF hex representation with any non-ASCII characters first encoded as UTF-8 (or other specified encoding)
-->8--
I dont think this is even recommends UTF-8.  It does not prohibit, clearly.

> Yes, wishlist is most appropriate. Hopefully somebody looking for
> solution will find this patch. Decoding of URLs was crucial for me.

At the end, I think this patch should go to upstream first.  It
produce a lot of incompatibilites for old setups.

Anyway, thank you for your contribution!




Added tag(s) upstream. Request was from Sergey B Kirpichev <skirpichev@gmail.com> to control@bugs.debian.org. (Mon, 23 Jan 2012 13:42:41 GMT) Full text and rfc822 format available.

Send a report that this bug log contains spam.


Debian bug tracking system administrator <owner@bugs.debian.org>. Last modified: Sun Apr 20 08:29:46 2014; Machine Name: beach.debian.org

Debian Bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.