Debian Bug report logs - #518178
kakasi: Doesn't know ティ (ti) katakana combination

version graph

Package: kakasi; Maintainer for kakasi is Natural Language Processing (Japanese) <team+pkg-nlp-ja@tracker.debian.org>; Source for kakasi is src:kakasi (PTS, buildd, popcon).

Reported by: Christian Ohm <chr.ohm@gmx.net>

Date: Wed, 4 Mar 2009 16:24:01 UTC

Severity: important

Found in versions 2.3.5-1, kakasi/2.3.4-3.1

Reply or subscribe to this bug.

Toggle useless messages

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to debian-bugs-dist@lists.debian.org, Keita Maehara <maehara@debian.org>:
Bug#518178; Package kakasi. (Wed, 04 Mar 2009 16:24:03 GMT) (full text, mbox, link).


Acknowledgement sent to Christian Ohm <chr.ohm@gmx.net>:
New Bug report received and forwarded. Copy sent to Keita Maehara <maehara@debian.org>. (Wed, 04 Mar 2009 16:24:04 GMT) (full text, mbox, link).


Message #5 received at submit@bugs.debian.org (full text, mbox, reply):

From: Christian Ohm <chr.ohm@gmx.net>
To: Debian Bug Tracking System <submit@bugs.debian.org>
Subject: =?utf-8?b?a2FrYXNpOiBEb2Vzbid0IGtub3cg44OG44KjICh0aSkga2F0YWthbmEg?= =?utf-8?q?combination?=
Date: Wed, 4 Mar 2009 17:22:44 +0100
Package: kakasi
Version: 2.3.4-3.1
Severity: important

Hello,

As the subject says, kakasi doesn't know the ティ katakana combination:

$ echo ティ | iconv -f utf8 -t eucjp | kakasi -Ka
tei

though it knows more uncommon ones like フェ (fe) or ヴァ (va).

Best regards,
Christian Ohm

-- System Information:
Debian Release: 5.0
  APT prefers unstable
  APT policy: (900, 'unstable'), (800, 'experimental'), (500, 'testing')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.28.5 (SMP w/2 CPU cores; PREEMPT)
Locale: LANG=en_GB.UTF-8, LC_CTYPE=en_GB.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages kakasi depends on:
ii  kakasi-dic                    2.3.4-3.1  KAKASI dictionary files
ii  libc6                         2.7-18     GNU C Library: Shared libraries

kakasi recommends no packages.

kakasi suggests no packages.

-- no debconf information




Marked as found in versions 2.3.5-1. Request was from Osamu Aoki <osamu@debian.org> to control@bugs.debian.org. (Sat, 01 Feb 2014 13:15:12 GMT) (full text, mbox, link).


Information forwarded to debian-bugs-dist@lists.debian.org, "Natural Language Processing, Japanese" <pkg-nlp-ja-devel@lists.alioth.debian.org>:
Bug#518178; Package kakasi. (Thu, 09 Feb 2017 15:42:05 GMT) (full text, mbox, link).


Acknowledgement sent to Osamu Aoki <rx788473@tc4.so-net.ne.jp>:
Extra info received and forwarded to list. Copy sent to "Natural Language Processing, Japanese" <pkg-nlp-ja-devel@lists.alioth.debian.org>. (Thu, 09 Feb 2017 15:42:05 GMT) (full text, mbox, link).


Message #12 received at 518178@bugs.debian.org (full text, mbox, reply):

From: Osamu Aoki <rx788473@tc4.so-net.ne.jp>
To: 518178@bugs.debian.org
Cc: Christian Ohm <chr.ohm@gmx.net>
Subject: kakasi: Doesn't know ティ (ti) katakana combination
Date: Fri, 10 Feb 2017 00:26:02 +0900
Hi,

Maybe request should be more like:

Please make explicit list of kana <-> rominization conversion table as a
part of this package and implement features accordingly.

Currently, kakasi supports 2 romanization scheme.

 * Kunrei             --- Education ministry supported style
                          Taught in elementally school but rarely used
                          in public.  Some prefer this as a way to input
                          Japanese since it is more concise.
                          "ち" = "ti".
 * Hepburn (or Hebon) --- Foreign ministry supported style
                          Used in public road sign and names in
                          passport.  It requires extra keys to input
                          some phoneme.
                          "ち" = "chi".

As for "ティ"

 * Kunrei             --- No official way
                          "ti" is used by "チ" or "ち"
 * Hepburn (or Hebon) --- "ti" does not conflict but not a part of 
                          original Hepburn system

Then you wonder how Japanese input Japanese text via romaji-input.
Inputting "ti" yields "チ" or "ち"

There are de facto (Ichitaro/VJE/MS IME/...) ways to input.
"thi" "teli" "texi" usually works. ('x" and "l" forces next character to
be small one). I usually use "thi" since it's the shortest one.

As I test via ibus/anthy: てぃ てぃ てぃ It works.

Yes, lack of "ティ" capability is in romanization scheme itself.  

This sound only appears in borrowed foreign words.  In romanized text, we
can use the original word.  So I spell "Yokohama City" ;-)

Anyway, supporting these odd-ball new kana scriptings maybe interesting
extension. ぁぃぅぇぉヵヶゎ

I think adding -l and -x options to enable "teli" and "texi" like
outputs is one idea.  But this behavour needs to be documented.

Osamu





Send a report that this bug log contains spam.


Debian bug tracking system administrator <owner@bugs.debian.org>. Last modified: Sun Jul 2 10:43:08 2023; Machine Name: bembo

Debian Bug tracking system

Debbugs is free software and licensed under the terms of the GNU Public License version 2. The current version can be obtained from https://bugs.debian.org/debbugs-source/.

Copyright © 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson, 2005-2017 Don Armstrong, and many other contributors.