Post reply

Name:
Email:
Subject:
Tags:

Seperate each tag by a comma
Message icon:

Attach:
(Clear Attachment)
(more attachments)
Allowed file types: apk, doc, docx, gif, jpg, mpg, pdf, png, txt, zip, xls, 3gpp, mp2, mp3, wav, odt, ods, html, mp4, amr, apk, m4a, jpeg
Restrictions: 50 per post, maximum total size 150000KB, maximum individual size 150000KB
Note that any files attached will not be displayed until approved by a moderator.
Anti-spam: complete the task

shortcuts: hit alt+s to submit/post or alt+p to preview


Topic Summary

Posted by: Johann
« on: February 02, 2019, 12:12:07 PM »

OCR pdf into text Khmer

http://rnd.niptict.edu.kh/ocr/index.php#

Quote from: About
Khmer OCR is an Optical Character Recognition system for Khmer font which is one research project of Research and Development Center at National Institute of Posts, Telecoms, and ICT (NIPTICT). OCR is an important software tool for Khmer language in collection and compilation legacy documents most of which had been printed without or lost their digital files, and some were hand written documents. OCR can be used to convert such that documents into computer digital files as archived documents for any purpose in the future.
Many different Khmer fonts have been using everyday, we have developed Khmer OCR and can recognized several fonts such as:
Khmer Angulileka
Khmer S1 or Limon S1
Khmer M1 or Limon R1
Khmer Kep
Khmer OS Battambang
Khmer OS Siemreap

Approach

We used open source Tesseract OCR Engine for training. (https://code.google.com/p/tesseract-ocr/ ). We applied our rule based cropping method for Khmer language

Related Work

"Khmer OCR " is the first published web-based application that developed by Mr. Danh Hong, who has been working hard on Khmer language in computer field such as developed Khmer unicode fonts. Further more, without his guideline, our research team would not had started this project either.

Project Team

Mr. Rapid Sun, Director of Research and Development Center

Mr. Vichet Chea, Chief office of Research and Development
Mr. Nan Mech, Project assistant
Mr. Nan Mech, Project assistant
Mr. Reaksa Tep, Project assistant
Mr. Kea Sorn, Project assistant
Miss. Sreyhuy Leng, Project assistant
Mr. Vanna Chuon, Project assistant
Mr. Chheang Chorng Loem, Project assistant

Experiment and Evaluation

We used ISRI toolkit (http://isri-ocr-evaluation-tools.googlecode.com/files/ftk-1.0.tar.gz ) to evaluate the accuracy of each OCR model.
Here are close test evaluation result:
NoFontAccuracy (%)1Khmer Angulileka ()76.492
Khmer S1 or Limon S1 ()93.333
Khmer Kep ()96.574
Khmer M1 or Limon R1 ()94.305
Khmer OS Battambang ()93.966
Khmer OS Siemreap ()89.25

Contact us

Mr. Vichet CheaResearcher at Research and Development Center, NIPTICTTel: (+855) 77-657-007Email: Website: www.niptict.edu.

(it seems to work fine for certain fonts, not all)
Posted by: Johann
« on: February 02, 2019, 11:57:22 AM »

Alternative to the use of google for Dhamma translations Khmer. Since the Royal Government of Cambodia and it's ministeries are by national edict devoted to the tipple Gems, it can be assumed that it can be taken in trust that the giver rejoices by making use of it for the Sangha for Dhamma purposes:

http://rnd.niptict.edu.kh/smt/

(of course it's would be possible good if clear expressed, requested, to erase all doubts and dangers of faults and remorse)

Quote from: Info
National Institute Of Posts, Telecoms & Ict (Niptict)
    Email: rnd-info@niptict.edu.kh
    Tel:069 657 007
    Address:Khtor Village, Chriychangva Commune, Chroychangva District,Phnom Penh (West of bride #2, National Road 6A
Posted by: Johann
« on: May 18, 2015, 11:48:21 AM »

Atma hat sich erlaubt eine Anfrage und Einladung an das Team des Online Dictionaries auf http://dictionary.tovnah.com/contacts zu senden:

Quote from: Absende Antwort im Kontaktformular, 18.05.2015
Thank you for your suggestion/comments.

Sender: johann.brucker@sangham.net
Subject: Khmer Dictionary: Request and invitation
Content:

Valued Khmer-Dictionary team,
first Atma (me) would like to congratulate you to this great website and support. Sadhu!

Currently there is a lay mans project to develop a "Vitual Vihara" (Buddhist monastery) and some Cambodian voluntary students of German language would like to support the translation of the language files into Khmer language so that it is more accessible for Cambodian people. Atma, who is currently helping to develop and give advices on sangham.net, would therefore kindly ask for the use of your web site as translation tool and furthermore invite you to join this work and other translation into Khmer language. As internet and computer are a fast growing matter in Cambodia but the used language on net has not really found proper development, its maybe also a interesting topic for your works here. Maybe worthy a sub-directory.

Atma sees forward to read your reply and urges you not to feel pressured by the request and invitation while pardon in advanced for the uninvited request via your contact page.

metta & mudita
Samana Johann
(Forest Monk dwelling in Cambodia)
Here also current topics in regard of the translation works: http://forum.sangham.net/index.php/topic,1135.0.html http://forum.sangham.net/index.php/topic,1782.0.html
Posted by: Johann
« on: June 30, 2013, 01:22:52 AM »


dict.cc: English-German

Quote from: Paul Hemetsberger
Gemeinschafts-Wörterbuch dict.cc

Mein derzeitiges Hauptprojekt stellt den Versuch dar, ein Online-Wörterbuch für Deutsch/Englisch-Übersetzungen unter Mithilfe von Benutzern aus der ganzen Welt aufzubauen und zu verbessern. Der Grundgedanke ist dabei ähnlich der Wikipedia , von der Umsetzung her gibt es naturgemäß Unterschiede. Ich lade jeden Besucher meiner Seiten ein, das System unter dict.cc/contribute auszuprobieren und freue mich über konstruktive Rückmeldungen. Übrigens:
Mittlerweile ist dict.cc bereits umfangreicher als LEO !

Neben der Kontrolle von Übersetzungen und anderen administrativen Tätigkeiten wie etwa Benutzerbetreuung und Serverwartung entwickle ich derzeit das Konzept basierend auf Benutzerfeedback weiter, um Wörterbuch , Vokabeltrainer und Übersetzungsforum in Zukunft auf beliebige Sprachkombinationen ausdehnen zu können.

What dict.cc is all about?



Wörterbücher zum herunterladen und offline arbeiten:

Freelang German-English dictionary

LingoPad

Webtools:

Notepad++ is a free  source code editor and Notepad replacement that supports several languages. Running in the MS Windows environment, its use is governed by GPL License.

Christian Simon hat freundlicherweise seine kompakte Wörterbuch-Software zum offline lesen und arbeiten der dict.cc Daten für sangham.net gegeben. Siehe: Offline Wörterbuch software - "elcombri"



Da Programm läßt sich leicht installieren. Nach der Installation müssen Sie das Datenfile (Wortliste von dict.cc, zur Verfügung gestellt von Paul Hemetsberger) von seiner Webseite laden (Achtung! nur das file elcombri verwenden, die uft-8 files funktionieren nicht) und im Programm als Wörterbuch laden.

Habe das Programm gerade heruntergeladen und installiert und funktioniert alles einwandfrei und simpel. Hier auch noch einmal die Gabe mit den Angaben der Downloadlinks:

Quote from:
Am 2013-06-29 17:01, schrieb elcombri.de:
> Hallo Herr Brucker,
>
> schön, dass Ihnen der elcombri Translator gefällt und Sie diesen gern nutzen.
>
> Sie können den elcombri Translator gerne auf Ihrer Webseite zum Download anbieten. Schön wäre jedoch, wenn Sie die Dateien nicht auf Ihren Server laden, sondern einen Link auf die unter elcombri.de gehosteten Dateien setzen. Dies wären:
>
>     http://download.elcombri.de/download.php?file=translator/translator11cSetup.exe
>     http://download.elcombri.de/download.php?file=translator/translator11c.zip
>     http://download.elcombri.de/download.php?file=translator/translator11c.jar
>
> Damit ist sichergestellt, dass meine Statistiken die korrekte Anzahl an Downloads enthält. Und ich kann gegebenenfalls auch eine aktuelle Version unter dem Link bereitstellen. Sie hätten dann keine Nachteile durch veraltete Dateien.

Mehr Infos und Möglichkeiten für Rückfragen, wie auch die Möglichkeit sich erkenntlich zu zeigen, wenn man das wünscht, finden Sie auf: www.elcombri.de

 *sgift*
Posted by: Administration
« on: May 01, 2013, 12:17:46 AM »

Pali Keyboard

Windows Keyboards for Typing with Unicode Latin-script Pali Fonts

Original source: http://fsnow.com/pali/keyboard/



Pali Keyboard

Windows Keyboards for Typing with Unicode Latin-script Pali Fonts

In order to type Pali, you need a tool to map keystrokes to Pali characters, preferably one that works with commonly used applications. The Microsoft Keyboard Layout Creator (MSKLC) is an easy way to create Windows keyboards for typing languages that are not directly supported by Windows.

Attached for keyboards:
German
English (UK)
English (US)


You may download it for your Dhamma work here (you will find the installation explaining here as well:



Download: http://forum.sangham.net/index.php?action=tpmod;dl=item75

Posted by: Administration
« on: April 30, 2013, 09:04:37 PM »

Quote from: www.cambodia.org
Fonts | Khmer Fonts | Cambodian Fonts | Khmer Unicode

As computer and internet industry gain influence and market in Cambodia, several types of Khmer fonts have been developed as well, such as Khek font, Limon font, Zero-Space font, and many others just to name a few. Most of them were not developed by using Unicode or meet the guideline of the Unicode Standards .
 However, all of these fonts have been widely utilized with word
processing, such as Word in Microsoft Office. Because many of these fonts were neither developed using Unicode Standards nor adopted by makers of World Wide Web (WWW) browsers, many Khmer fonts were not readable without special library drivers.

Khmer Unicode

Khmer Unicode For Window Vista:
  Microsoft Window Vista (32bit and 64 bit) comes with Khmer Unicode built-in, but required you to set it up in order to read Web page using Khmer Unicode or to write in Khmer Unicode properly. The keyboard layout is a little bit different from keyboard layout developed by NIDA. Example, to type, kra-bey (in khmer), firstly type "K", then press "Space" to reserve space for Jerng (or Chherng) and press "R". To space between character, hold "Shift" and press "Space".
Now, you should have kra. Download Khmer OS fonts from the right side and you will enjoy and have fun with all the fonts style and types.
Khmer  Software Initiative (KhmerOS), National  Information Communications Technology Development Authority (NIDA), and Open  Institute joined to create a project for developing OpenSource software  that can accommodate Khmer Unicode-based fonts. Khmer Unicode is a part of  their project, but it has not yet widely utilized or built-in as part internet  browsers or software applications. It is, however, gradually becoming popular  among users/developers in Cambodia.  Khmer Unicode has been developed to use in platforms such as:

  • OpenOffice (Word Processing),
  • OpenSUSE (Linux based Operation System),
  • Khmer Email Application (Thunderbird-based email application),
  • Mekhala (FireFox-based Internet Browser)

Khmer Unicode For Window XP:

For MS Window XP, Khmer Unicode Keyboard (NIDA 1.0) driver is required. KhmerUnicode2.0.0.exe (developed by KhmerOS and NIDA) has both Khmer Unicode software and Khmer Unicode Keyboard (NIDA 1.0). Please follow the below instruction to download and install it. If you install the Khmer Unicode in your computer system correctly, you should be able at least to view the web site in Khmer via Mozilla FireFox, MS IE, Opera, and Safari. After installing it and you would like to see if you can read/view the page in Khmer Unicode, open your FireFox browser, and go to all these website http://www.cambodia.org/kh/buzz/ , Radio Free Asia (http://www.rfa.org/khmer/ ), http://www.google.com.kh/ (only in FireFox), http://km.wikipedia.org . To type in khmer, you are recommended to read the Instruction, "Documents How to Write " and follow the Keyboard Layout .

If your MS Window XP has Service Pack 2 installed, you can view the pages of Radio Free Asia (http://www.rfa.org/khmer/ ) in Khmer via Internet Exploer 6.0 or higher. In this case, RFA utilizes WEFT to have the pages viewed in Khmer even without Khmer Unicode installed.

How to install Khmer Unicode (KhmerUnicode2.0.0.exe) on Your Window XP and Vista 32-bit (Click Khmer Unicode for Microsoft Window Vist 64-bit (x86) )

  • Download KhmerUnicode2.0.0.zip (version 2.0.0)
     
  • Use a Zip softwares to Extract the KhmerUnicode2.0.0.zip
  • Installation:


    •       Click on this KhmerUnicode2.0.0.exe icon

       


    •       Click "Next" as indicating by the arrow

       


    •       Click "Next" as indecating by the arrow

       


    •       It may take minutes to wait...

       


    •         Click "Finish"

       


    • At the bottom-corner of your computer screen, you should see this image that allow you to select either CA: Catalan or EN:English (United States) for Writing (Typing). For writing in Khmer, you need to select CA:Catalan.

How to type Khmer Unicode in English (PDF)

How to type Khmer Unicode in Khmer (PDF)
KkhmerOS Download Page

Khmer Fonts Using TrueType

If a computer system and/or software uses TrueType fonts, then Khek font as described below works perfectly.

               
  • Khek font is developed by Khek Brothers , one of the earliest groups designing high quality Khmer fonts.
    Khek font was primarily made for use with Microsoft products running on Windows platform such as the various Windows versions from 3.x all the way to the current Vista. It also runs on Apple computers including Macintosh and the current Family of iMacs.
    Khek font is the most popular among users in the United States and other oversea countries. Learn more
     
  • Limon font and ABC Zero-Space font are traditional fonts developed using “Legacy Encodings ”,
     which is not part Unicode Standards. These two fonts are free and can be downloaded on this page under download section.
Khmer OpenType by Microsoft

Microsoft created an OpenType font and has been supporting it as standard, while Apple created ATT. In 2004  the OpenType font was adopted and supported by Adobe. Font developers creating Khmer fonts  can use OpenType standard. Learn more

Notes: This
 page does not focus on the technical parts of how  Khmer fonts were
created or the fundamentals of Khmer Unicode. But it does show  how to
utilize Khmer fonts and where to get Khmer fonts.

References and Khmer Fonts Resources:

http://projects.thedanielmay.com/khmerfonts/unicode.htm
http://www.microsoft.com/typography/otfntdev/khmerot/default.htm
http://www.wazu.jp/gallery/Fonts_Khmer.html

Keyboard layout (pdf): KeyboardLayout_NIDA


Intern Download please follow here:


Download (and info): http://forum.sangham.net/index.php?action=tpmod;dl=item72 (you also find the Keyboard Layout there)
Posted by: Johann
« on: April 04, 2013, 12:58:24 AM »


www.libreoffice.org

Das ist ein freies Office-Paket (fast das gleiche wie OpenOffice), mit dem man auch MS-Word-Dokumente und so bearbeiten kann. Wenn man damit arbeitet, ist es aber vernünftiger, Dinge im "hauseigenen" .odt-Format (open document trallala) zu speichern. - Wie die .doc- und .docx-Formate funktionieren, das wird ja nicht von MS öffentlich gemacht, und das haben die dann natürlich durch probieren selbst ausgetüftelt. Zumindest vorm Speichern im docx-Format wird daher ausdrücklich gewarnt, auch wenn alles größtenteils ziemlich reibungslos zu funktionieren scheint. Das .doc-Format scheint aber wohl gut durchschaut und gefahrlos nutzbar zu sein.
Posted by: Johann
« on: February 11, 2013, 11:57:31 AM »

Hier können Sie hilfreiche Werkzeuge zur Verfügung stellen und teilen.

Konverter:

Um html Texte leichter mit allen Formaten Links und Schriften hier im Forum einzubetten, hier ein "Cooles" Werkzeug:

Cool HTML to BBCode Converter

Wenn Sie html texte haben, geben Sie die texte (inkl codes) hier ein und konvertieren Sie sie dirket in SMF (unserer Software) hier.

Online Wörterbücher:

LEO An Online Service by LEO GmbH

Cambridge Dictionaries Online

Oxford Dictionaries

dict.cc: English-German

Quote from: Paul Hemetsberger
Gemeinschafts-Wörterbuch dict.cc

Mein derzeitiges Hauptprojekt stellt den Versuch dar, ein Online-Wörterbuch für Deutsch/Englisch-Übersetzungen unter Mithilfe von Benutzern aus der ganzen Welt aufzubauen und zu verbessern. Der Grundgedanke ist dabei ähnlich der Wikipedia , von der Umsetzung her gibt es naturgemäß Unterschiede. Ich lade jeden Besucher meiner Seiten ein, das System unter dict.cc/contribute auszuprobieren und freue mich über konstruktive Rückmeldungen. Übrigens:
Mittlerweile ist dict.cc bereits umfangreicher als LEO !

Neben der Kontrolle von Übersetzungen und anderen administrativen Tätigkeiten wie etwa Benutzerbetreuung und Serverwartung entwickle ich derzeit das Konzept basierend auf Benutzerfeedback weiter, um Wörterbuch , Vokabeltrainer und Übersetzungsforum in Zukunft auf beliebige Sprachkombinationen ausdehnen zu können.

What dict.cc is all about?



Wörterbücher zum herunterladen und offline arbeiten:

Freelang German-English dictionary

LingoPad

Webtools:

Notepad++ is a free  source code editor and Notepad replacement that supports several languages. Running in the MS Windows environment, its use is governed by GPL License.