Login

mvnForum

mvnForum Homepage Welcome Guest
  Search  
  Index  | Recent Threads  | Unanswered Threads  | List Polls  | Public Albums  | Who's Online  | Help


Quick Go »


No member browsing this thread
Thread Status: Active
Total posts in this thread: 5
Post new Thread
Author
Previous Thread This topic has been viewed 2986 times and has 4 replies Next Thread
Male BSJANG
Stranger
Member's Avatar


Joined: Jan 9, 2004
Post Count: 7
Status: Offline
Reply to this Post  Reply with Quote 
한글 검색이 제대로 지원이 안되는 이유 ........

한글 검색이 제대로 안되어서 소스를 살펴보니 mvnForum 이 검색엔진으로 Lucene 을 사용하고 있더군요. Lucene 사이트 가보니 아래와 같은 내용이 있어 최신 버전의 lucene 을 설치 했으나 여전히 한글 검색이 안되더군요

26 December 2003 - Lucene 1.3 Final Released

5. Fix StandardTokenizer's handling of CJK characters (Chinese,
Japanese and Korean ideograms). Previously contiguous sequences
were combined in a single token, which is not very useful. Now
each ideogram generates a separate token, which is more useful.

Lucene 1.3 Final Released 의 소스를 살펴보니 한글 검색 지원 부분이 빠져 있더군요.

Lucene 의 소스 파일 \lucene-1.3-final-src.tar\lucene-1.3-final\src\java\org\apache\lucene\analysis\standard\StandardTokenizer.jj 의 CJK 영역에 unicode의 한글영역을 더 추가하여 테스해 본 결과 90% 정도는 되는거 같은데 아직은 좀 더 살펴봐야 할 듯 ......
----------------------------------------
[Edit 3 times, last edit by BSJANG at Jan 9, 2004 4:49:14 PM]
[Jan 9, 2004 4:18:18 PM] Show Printable Version of Post    View Member Profile    Send Private Message [Link] Report threatening or abusive post: please login first  Go to top 
Male minhnn
mvnForum Developer
Member's Avatar

Vietnam
Joined: Oct 16, 2002
Post Count: 2956
Status: Offline
Reply to this Post  Reply with Quote 
Re: 한글 검색이 제대로 지원이 안되는 이유 ........

I dont understand this post but for your information, RC3-dev has just been update to the Lucene 1.3 Final cool
----------------------------------------
Minh Nguyen
mvnForum Developer
Want a free, open source Java Jsp/Servlet forum, get mvnForum at http://www.mvnForum.com

http://www.DienDanLinux.org
[Jan 10, 2004 11:05:16 AM] Show Printable Version of Post    View Member Profile    Send Private Message    Hidden to Guest    http://www.MyVietnam.net    minhnn_mvn [Link] Report threatening or abusive post: please login first  Go to top 
Male BSJANG
Stranger
Member's Avatar


Joined: Jan 9, 2004
Post Count: 7
Status: Offline
Reply to this Post  Reply with Quote 
Re: 한글 검색이 제대로 지원이 안되는 이유 ........

I dont understand this post but for your information, RC3-dev has just been update to the Lucene 1.3 Final cool


Lucene 1.3 Final still doesn't support Korean search.
Why don't you check 'StandardTokenizer.jj' file ?
You can find unicode missing to support Korean in CJK area.

=====================================================================================
| < CJK: // non-alphabets
[
"\u3040"-"\u318f",
"\u3300"-"\u337f",
"\u3400"-"\u3d2d",
"\u4e00"-"\u9fff",
"\uac00"-"\ud7a3", // add unicode to support korean
"\uf900"-"\ufaff"
]
>
=====================================================================================
[Jan 11, 2004 6:54:02 AM] Show Printable Version of Post    View Member Profile    Send Private Message [Link] Report threatening or abusive post: please login first  Go to top 
Male minhnn
mvnForum Developer
Member's Avatar

Vietnam
Joined: Oct 16, 2002
Post Count: 2956
Status: Offline
Reply to this Post  Reply with Quote 
Re: 한글 검색이 제대로 지원이 안되는 이유 ........

Lucene 1.3 Final still doesn't support Korean search.

I think you should report this bug to Lucene guys, it will not benefit Korean mvnForum's users but all Korean programs using Lucene too. smile
----------------------------------------
Minh Nguyen
mvnForum Developer
Want a free, open source Java Jsp/Servlet forum, get mvnForum at http://www.mvnForum.com

http://www.DienDanLinux.org
[Jan 15, 2004 4:20:02 PM] Show Printable Version of Post    View Member Profile    Send Private Message    Hidden to Guest    http://www.MyVietnam.net    minhnn_mvn [Link] Report threatening or abusive post: please login first  Go to top 
Male kimyohan
Stranger




Joined: Aug 24, 2004
Post Count: 6
Status: Offline
Reply to this Post  Reply with Quote 
confused Re: 한글 검색이 제대로 지원이 안되는 이유 ........

Lucene 1.3 Final Released 의 소스를 살펴보니 한글 검색 지원 부분이 빠져 있더군요.


답글 있는것 처럼 하면 되나요?
소스 코드로 lucene받아서 수정하고 컴파일 하면 되나요??
좀더 쉽게 설명좀부탁 드려요 ㅡㅡ confused
[Aug 26, 2004 11:05:48 AM] Show Printable Version of Post    View Member Profile    Send Private Message    Hidden to Guest [Link] Report threatening or abusive post: please login first  Go to top 
Show Printable Version of Thread  Post new Thread