Development of Technology and Tools
Standardization of Bangla Fonts and Codes
In order to make computer applications, such as Document Processing, E-mailing, Calculations, possible in Bangla, in a way that is interoperable across different computing platforms, it was necessary to establish a standard for encoding the data. SNLTR has established that Unicode 5.0 and above as the standard that can be adopted for different e-governance applications and is also in parity with the international practice and standard. This is in direct contrast with the plethora of non-standard Bangla software proliferating the market that hinders interoperability across different computers. Based on these standardizations some font encoding converters have been developed that are discussed in the next section.
In order to facilitate data and document entry in Bangla, a number of keyboard layouts have been designed and implemented to suit the needs of different people habituated with different typing practices. The suite of such keyboards is being increasingly upgraded. The salient feature of all these keyboards is that they are all Unicode 6.3 compatible and hence any document entered through them is acceptable across any standard platform and can be visualized through any Unicode compliant Bangla fonts. SNLTR has designed a new Bangla keyboard named “Baishakhi Keyboard”, which has a 3-layer keyboard structure, with Normal, Shift and Right Alt modes, designed to accommodate all the Bangla alphabetical signs and symbols. The keyboard layout is mostly phonetic in nature.
SNLTR has also customized two other popular keyboard layouts- Inscript and Gitanjali to make them fully UNICODE 6.3 compatible and named them as ‘Baishakhi Inscript’ and ‘Uni- Gitanjali’.
In order to conform with the existing practice of the government employees, another keyboard layout, named "Webel" has also been designed. This retains the existing "Webel" layout as used by the employees with the minor modifications as required for making it Unicode compliant.
Baishakhi Bangla Keyboard for Windows : Baishakhi Bangla keyboard is compatible with Windows (XP SP2 onwards) environment. Those interested to use this may, please, download from here.
Baishakhi Bangla Keyboard for Linux : Baishakhi Spirit is now spread globally to all Linux distribution. The Baishakhi keyboard is now accepted by all leading Linux distributions as a favorite and easy Bangla Keyboard Layout developed by SNLTR. Ubuntu comes with Baishakhi Keyboard by default, no need to add extra packages.
Baishakhi Bangla Keyboard for Mac : UNICODE Compliant Baishakhi Keyboard for Bangla is now ready for use in the MAC OSX (10.4 onwards) environment. Those interested to use this may, please, download from here.
Baishakhi Bangla Keyboard for Android : To download the Android version of Baishakhi Bangla Keyboard please, click here.
To view the Bangla properly Unicode fonts should be available in the system. Installing this tool two Unicode Bangla Font 'Vidya' and 'Bangla Akademi' will be installed in the system.
This project aims at developing a robust spell checker for Bengali that will be useful for various Bengali applications. The technology is suitable for use with editors and office applications.
A Bangla spell checker to minimize the spelling mistakes has long been a desideratum. SNLTR has developed a robust spell checker which can be used in UNICODE compliant editors like MSOffice, Open Office. Performance wise it is the most efficient Bangla spell checker available in the market. It also provides suggestion list for the user.
For Microsoft Office : This spell checker Add-ln functions through the user interface of Microsoft Office Word 2007, 2010 & 2013. This spell checker engine is dependent on Microsoft Office Word and cannot be used by other NLP applications. The lexical database used here is capable of storing and processing large wordlist. This spell checker add-in can only be installed in Windows environment.
For OpenOffice.Org : This spell checker Add-On is an external UNO component providing functions through the user interface of OpenOfﬁce.org Writer. This spell checker engine is dependent on OpenOffice.Org Writer and cannot be used by other NLP applications. The lexical database used here is capable of storing and processing large wordlist. This spell checker add-on can be installed in both Windows and Linux platforms.
For Online Use : This Spell Checker may be used online through this URL : http://banglabanan.nltr.org. This interactive online version also provides suggestion list for the user.
Bangla Optical Character Recognizer (OCR)
Based on the standards defined by SNLTR, a Bilingual version of the Linux Operating System has been developed and it has been named as বৈশাখী লিনাক্স (Baishakhi Linux). This operating system is open source and is based on Ubuntu flavor of Linux. Any desktop or laptop can have Baishakhi Linux as the operating system. This operating system will allow any user to be able to do all possible operation and computations that are supported in other Linux based systems. The Baishakhi Linux operating system supports all the available features of office works, such as Document Preparation, Presentation Preparation, Spreadsheet Computation, E-mail facilities, Web browsing etc. Further, these operations can be carried out in a Bi-lingual mode that is, both in Bangla as well as in English. Baishakhi Linux is distributed free.
Meetei Mayek Keyboard
While editors for writing musical notations are available for western scores, none such exists for the Indian Music System. The present work is developing an editing system that will enable the users write the Indian musical notations in a computer. The project further aims at providing facility to play the Swaralipi to facilitate music composition by the composers.
Conversion of Legacy Documents
There is a huge repository of Bangla digital documents that were not prepared in accordance with the Unicode 5.0 standard. These documents have been written in non-standardized fonts, or written in other conventions that are not based on Unicode compliant systems. Hence, any system that is not compatible with any of these fonts or encoding standards could not display Bangla texts properly. To have the benefit of this large corpus of information and data for further use, SNLTR has developed a number of code-conversion software that can convert the electronic version of legacy data to the Unicode 5.0 format.
This Tool is very useful for the user who doesn’t like to type Bangla using the normal keyboard. Installing this Tool a Soft Keyboard GUI will be available, where user can type Bangla into a specific text area by pressing the key of a virtual keyboard with mouse. The typed text can be cut and pasted to desired application.
Bangla firefox 3.0
Installing this tool Firefox-3.0 web browser will be available in Bangla.
Bangla OpenOffice 2.4(Windows)
The open Office application is the alternative of the Microsoft Office application, which is available freely. Like MS office all Document preparation, PowerPoint presentation preparation and Excel Spreadsheet computations can be done using this application.
This is a unique application which is very useful for Bangla. This Tool has the following features:
Normalization of Bangla Documents.
Word and Akshar Count: The ‘Word and Akshar Count’ feature is used to count the number of words and Akshars (considering yuktakshar as a single character) within a given text document. The count operation is done after normalizing the file. The feature is applicable for both txt file and Microsoft word file.
Sorting of a Bangla Document File: The ‘Sort’ operation is also done after normalizing the document file. This feature is also applicable for txt as well as Microsoft word file.
Bangla Word Browser: The ‘Bangla Word Browser’ is used to browse Bangla words. The feature can further be used to check spelling of a Bangla word. The digitized form ‘Bangla Academy Banan Abhidhan’ which has been also digitized by SNLTR has been used as the datum of the word database in this application. The figure below is an illustration of working of Bangla Sahayika as a Word Browser.
On-going projects :
This is a fundamental tool required for development of technologies, such as language translators. While existence of such facilities is commonplace in western languages, none exist for Bangla, for that matter for most of the Indian languages.