TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

9 messages Options
Embed this post
Permalink
Jon Babcock

TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

Reply Threaded More More options
Print post
Permalink
Two questions:

1, Are there plans to extend SCIM's phrase feature to TC?

2. Can TC be set as default even when using a non-TC locale, such as
en_US.UTF-8?

My locale is en_US.UTF-8 so scim-pinyin selects '中' by default. IOW, both
simplified (SC) & traditional (TC) forms of the Chinese script are given in
the candidates list.

When I input a phrase using SC, I get reasonable results. When I input a
phrase using TC, the results are not useful.

For example, in  '中', when I input kanbao, I get 看包  and 看报. If I
switch
SCIM to '繁', when I input kanbao I only get 看包 . I want to get 看報.

These two issues was mentioned on scim-user in 2005. Any progress?

SCIM is a great contribution to open source computing. Thanks!

Jon Babcock


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Scim-user mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/scim-user
ePierre

Re: TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

Reply Threaded More More options
Print post
Permalink
2007/2/27, Jon Babcock <[hidden email]>:
> When I input a phrase using SC, I get reasonable results. When I input a
> phrase using TC, the results are not useful.
>
> For example, in  '中', when I input kanbao, I get 看包  and 看报. If I
> switch
> SCIM to '繁', when I input kanbao I only get 看包 . I want to get 看報.
>
> These two issues was mentioned on scim-user in 2005. Any progress?

I'm very interested in this information, too, since I'd like to write
using TC, but with the smart pinyin method...

Hope it will be included in a next release!


--
Pierre
http://pierre.equoy.free.fr/
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Scim-user mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/scim-user
liucougar

Re: TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

Reply Threaded More More options
Print post
Permalink
there is a filter feature introduced in scim 1.4. one of the builtin
filter is to convert output chinese between simplified style and
tranditional one, give it a try

On 2/28/07, Pierre Equoy <[hidden email]> wrote:

> 2007/2/27, Jon Babcock <[hidden email]>:
> > When I input a phrase using SC, I get reasonable results. When I input a
> > phrase using TC, the results are not useful.
> >
> > For example, in  '中', when I input kanbao, I get 看包  and 看报. If I
> > switch
> > SCIM to '繁', when I input kanbao I only get 看包 . I want to get 看報.
> >
> > These two issues was mentioned on scim-user in 2005. Any progress?
>
> I'm very interested in this information, too, since I'd like to write
> using TC, but with the smart pinyin method...
>
> Hope it will be included in a next release!
>
>
> --
> Pierre
> http://pierre.equoy.free.fr/
> -------------------------------------------------------------------------
> Take Surveys. Earn Cash. Influence the Future of IT
> Join SourceForge.net's Techsay panel and you'll get the chance to share your
> opinions on IT & business topics through brief surveys-and earn cash
> http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
> _______________________________________________
> Scim-user mailing list
> [hidden email]
> https://lists.sourceforge.net/lists/listinfo/scim-user
>


--
http://www.liucougar.net
生于忧患,死于安乐
"People's characters are strengthened through struggle against
difficulties; they are weakened by comfort."
- Old Chinese adage
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Scim-user mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/scim-user
ePierre

Re: TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

Reply Threaded More More options
Print post
Permalink
Hello!

2007/2/28, LiuCougar <[hidden email]>:
> there is a filter feature introduced in scim 1.4. one of the builtin
> filter is to convert output chinese between simplified style and
> tranditional one, give it a try

Err, I searched on the website, but there is no explanation for this
filter feature. Could you explain it a bit deeper? How to access it,
how to use the SC -> TC filter you've mentioned, etc. ?

Thanks in advance!

--
Pierre
http://pierre.equoy.free.fr/

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Scim-user mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/scim-user
David Oftedal

Re: TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

Reply Threaded More More options
Print post
Permalink
In reply to this post by ePierre
I have to chime in here. It's very nice that smart pinyin can output traditional characters, but it's still a bit incomplete as long as it lacks dictionary files in traditional Chinese. Seeing as how simplified characters can sometimes map to several traditional characters (and vice versa, even), it doesn't seem enough to just use a filter either. Perhaps we could "borrow" the dictionary from some traditional Chinese input method and convert it to use with Smart Pinyin?


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Scim-user mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/scim-user
Zhe Su

Re: TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

Reply Threaded More More options
Print post
Permalink
Hi,
  I had a plan to develop a new version of smart pinyin input method which has both TC and SC dictionary, and I already finished part of the source code. But recently I'm too busy to have time on this project. Hope I can restart developing it in near future.

Regards
James Su

On 2/28/07, David Oftedal <[hidden email]> wrote:
I have to chime in here. It's very nice that smart pinyin can output traditional characters, but it's still a bit incomplete as long as it lacks dictionary files in traditional Chinese. Seeing as how simplified characters can sometimes map to several traditional characters (and vice versa, even), it doesn't seem enough to just use a filter either. Perhaps we could "borrow" the dictionary from some traditional Chinese input method and convert it to use with Smart Pinyin?


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Scim-user mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/scim-user


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Scim-user mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/scim-user
ePierre

Re: TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

Reply Threaded More More options
Print post
Permalink
2007/2/28, Zhe Su <[hidden email]>:
> Hi,
>   I had a plan to develop a new version of smart pinyin input method which
> has both TC and SC dictionary, and I already finished part of the source
> code. But recently I'm too busy to have time on this project. Hope I can
> restart developing it in near future.

加油!!! :-)

Well, I hope you'll have time to spend on that point later, it could
be a great thing to have a new Smart Pinyin engine using both SC and
TC!

My development skills are close to NULL, but I'd be glad to help in any way!

--
Pierre
http://pierre.equoy.free.fr/
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Scim-user mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/scim-user
simon_w

Re: TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

Reply Threaded More More options
Print post
Permalink
Hi James / everyone,

2007/2/28, Zhe Su <james.su@gmail.com>:
> Hi,
>   I had a plan to develop a new version of smart pinyin input method which
> has both TC and SC dictionary, and I already finished part of the source
> code. But recently I'm too busy to have time on this project. Hope I can
> restart developing it in near future.
I wonder if any progress has been made on this?  As an intermediate student of Chinese, I really miss this functionality from the Windows Traditional Chinese IME.  So much so, in fact, that I usually use Windows in a VM if I want to type anything longer than a few sentences in Chinese.

I appreciate that those of us who want to type traditional Chinese in hanyu pinyin are very much in the minority, but the addition of a TC phrase dictionary would make a *huge* difference.  I frequently find myself typing something that I know how to say, but don't know which characters to use for.  Sometimes I will even type the phrase (in Windows) just to learn what the characters should be!

Is any further information available?

Thanks!

Simon
David Oftedal

Re: TC 繁 phrase input & TC 繁 as default in en_US.UTF-8 locale?

Reply Threaded More More options
Print post
Permalink
In reply to this post by David Oftedal
Hello everyone!

I took a look at the source code for scim-pinyin and libchewing to see
if the idea was actually feasible, and the conclusion has to be that it
probably could be, but it might be easier to do it another way.

sicm-pinyin has a library of phrases that looks like this:
形似而实质全然不同      0       ADJ
无产阶级文化大革命      30      N
我国人民        21
我好伤心        2

scim-chewing has one like this:
一個接一個 47 ㄧ ㄍㄜ˙ ㄐㄧㄝ ㄧ ㄍㄜ˙
一個接一個 47 ㄧ ㄍㄜ˙ ㄐㄧㄝ ㄧˊ ㄍㄜ˙

It seems almost like one could get away with removing the Bopomofo and
adding a few extra spaces. Though the grammatical classes such as "ADJ"
and "N" would of course be lost.

Each one also has a table that describes the pronunciation of each
character - libchewing uses some sort of keyboard layout converted back
to QWERTY, but it seems like it could be mapped to Hanyu Pinyin fairly
easily.

However, SCIM-Pinyin also has some extra files... A "special_table" and
a "pinyin_phrase_index", both of which Chewing lacks, and it also has
binary versions of the text files, and I'm not sure what it does with
them or how to generate them.

The only problem with Chewing, as far as I can see, is that it handles
Hanyu Pinyin really clumsily and locks up all the time, probably because
people who use traditional characters tend not to use it. But perhaps a
minor change to the Hanyu Pinyin table in Chewing would be easier than
porting all of its data into scim-pinyin. I might look into that next
time.

- David Oftedal

 On Wed, 28 Feb 2007 16:44:50 +0100
David Oftedal <[hidden email]> wrote:

> I have to chime in here. It's very nice that smart pinyin can output
> traditional characters, but it's still a bit incomplete as long as it
> lacks dictionary files in traditional Chinese. Seeing as how
> simplified characters can sometimes map to several traditional
> characters (and vice versa, even), it doesn't seem enough to just use
> a filter either. Perhaps we could "borrow" the dictionary from some
> traditional Chinese input method and convert it to use with Smart
> Pinyin?

------------------------------------------------------------------------------
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
_______________________________________________
Scim-user mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/scim-user