¤¤¤å

Cantonese Text-to-Speech (TTS) System

                                                                                                                        V3 is now ready

New Super Natural Male Voice

Web Demo

Main Features

Samples

Screen Shots

Purchase

Applications

Demo Download

System Requirements

Contact Us

 

Introduction

The Cantonese Text-to-Speech (TTS) system introduced by PiTL provides a CD quality output from Chinese (Big5) Text to Mp3.  The system consists of the technologies in Speech Signal Processing, Artificial Intelligence and Cantonese Linguistics that can transform a Text into native Cantonese Sounds. It is a Hong Kong style Cantonese speaking machine and can pronounce most of the local dialect (including native slang) of everyday language spoken by Hong Kong citizen.  The system is tested to read different style of literatures: Classics, Poems, Fictions, News, Jokes, Advertisements, and Movie Scripts and offer a high literacy of the read out.


This system also caters for the Visual Impair Persons who control the program with Key-strokes instead of Mouse clicks ¡V all functions in the program have the corresponding short-cut key-strokes.

Recently, MP3 player is getting very popular and the price is becoming very affordable. However, majority of the MP3 player users listen to Music. There may be a large population and potential that will use MP3 player to listen to information. We are dreaming a world that people will download a Newspaper, Magazine, or Fiction to MP3 player on demand. TTS technology enables a low cost and fast production of audio media and combining TTS and MP3, people can listen to the information at any time any where. Most important, no lighting, no glasses, no paper are required.

We also post a concept of e-Newspaper. Although PDA claims to be very handy for mobile information, however, due to the limitation in the size of the screen, reading large amount of text from PDA screen is an eye-sight burning exercise. If we can put only Headings/Graphics on the screen and the details are left for the user to click the Hyperlinks and then listen, we can include lots of information in a pocket size PDA screen.


¡§Point to the Titles, Listen to the details.¡¨

PiTL believes that the merging of e-book, MP3 and TTS technologies will bring us a new information era.

Demo Version Download
Try out our demo at: speakd03.zip (9MB)

(Please read Installation.txt before install)

 

Purchase V3 Full-version: HKD200 per license (single user)

 

System requirements:
You: Understand Cantonese, Can read Chinese
Your PC:

Microsoft Windows Vista, XP, 2000, ME, 98

Set to support Traditional Chinese version (Big5 Character set)

Intel PIII 500Hz or above

128M RAM

Sound Card & Speaker

100M Free Hard-Disk Space in C: Drive

CD-ROM


Main Features

True Voice               Sound Database is built on a high quality samples. This creates a more natural reading output. The MP3 files are written out at 64K encoding and is comparable to CD quality. In the future, more Sound Profiles will be created and will include different kind of expression and mood (such as: Happiness, Angriness, and Sadness). This is very difficult to be achieved by digital synthesis speech.

 

Variable                   By employing advanced DSP technologies, the reading speed can be adjusted at real-time without affecting the speech

Speed                      quality and pitch. Users can custom the reading speed to fit their preferences.

 

Rhythm                    Each sentence will be analyzed by the built-in AI mechanism and generated a reading tempo.

This will produce a more colorful sounding (especially fits for reading Chinese Poems).

 

Fast                          Real time reading. Batch processing of text files. (Individual file size can be up to 20K Chinese Characters). In 15 minutes, 20K Chinese Characters could be read to a total play time of 30 minutes MP3 files.( Intel P4 1GHz standard).

 

Accurate                  The system references to various Chinese dialect, linguistic and grammar dictionaries to handle the Polyphonic Characters  in Cantonese.  More than 90% accuracy can be achieved in reading all types of literatures ( News, Finance, Entertainments, Classics, Poems, and Fictions).

 

GB&BIG5                  Build in GB to BIG5 Character set converter allows the system to process all Chinese characters.

 

English                     Most of the Chinese TTS will only pronounce the English alphabets. Our system handles a mixture of both Chinese and English.

This is of importance for nowadays text information.

(English TTS is provided by the Microsoft internal TTS engine which is installed in Window XP).

 

Custom                    Flexibility is built in the system that allows sound editing. Users can select the correct sounding for a character before MP3 generation.

Sound                    This will increase the literacy and make it simpler to handle the complex Chinese Language.

 

Hong Kong               It is a Hong Kong style Cantonese speaking machine and can pronounce most of the local dialect (including native slang)

Style                  of everyday language spoken by Hong Kong people.  The system will support HKSCS (Big5 extension for Hong Kong character set). Comics and Conversation scripts will be read out in a natural way.

 

Screen Shots

ºuºuªø¦¿ªF³u¤ô¡A®öªá²^ºÉ­^¶¯¡C

¬O«D¦¨±ÑÂàÀYªÅ¡C

«C¤s¨Ì¦b¡A¤L«×¤i¶§¬õ¡C

¥Õµoº®¾ö¦¿²Z¤W¡AºD¬Ý¬î¤ë¬K­·¡C¤@³ý¿B°s³ß¬Û³{¡C

¥j¤µ¦h¤Ö¨Æ¡A³£¥I¯º½Í¤¤¡C

 

¡X¡X½Õ±H¡mÁ{¦¿¥P¡n

Listen¡Gsample1.mp3

 

Custom Sound

Usually applies in Surname

´¿(=zang1=) (´¿(=zang1=)¥ý¥Íªº´¿(=zang1=))

´¿ (´¿¸gªº´¿)

¦³¤H°Ñ(=sam1=)¥[¤J (¤H°Ñ(=sam1=)-«O«~)

¦³¤H°Ñ¥[¤J     

Listen¡Gcustom01.mp3

Sample Readings

Finance

´äªÑ¦¬¥«¤É216ÂI

12¤ë 9¤é ¬P´Á¤G 16:20 §ó·s

§À¥«¦³¶R½L¤ä«ù¡A«í¥Í«ü¼Æ¦¬¥«³ø12393.64¡A¤É216.20¡A¦¨¥æ134»õ¤¸¡C

ªø¹ê(0001)³ø60.25¤¸¡A¤É1¤¸¡F©M¶À(0013)³ø54¤¸¡A¤É0.5¤¸¡F·s¦a(0016)³ø65.25¤¸¡C

¶×Â×(0005)³ø119¤¸¡A¤É2¤¸¡F«í¥Í(0011)³ø101.5¤¸¡A¤É1.5¤¸¡C

Listen¡Gstock01.mp3

 

Advertisement

Fashion One ¶}­Ü¡T

©|¦³¨â¬P´Á¡A¸t½Ï´N±þ¨ì®I¨­¡CÀH¦í¸gÀÙ¦nÂà¡A¥«­±ªº½T¿³©ô°_¨Ó¡A¤H¤H¦£µÛ»`ù¸t½Ï§ª«¡A¤]¦£µÛ¸Ë¨­°Ñ¥[¬£¹ï¥h¡C·Q±½³f¤S¤£·Q¤Ó¯}¶O¡S·íµM­n§ìºò¶}­Ü³o¨Ç¨}¾÷¡C¶°¦h­Ó°ê»Ú¯Å«~µP©M¤j®v³]­pªº¾c©±-Fashion One¡A©ó¤µ¤é¶}­Ü¡C©Ò¦³¾c¼i¡B¤â³U¡BNew Style ³ò¤y¡B¥Ö¯ó¤Î°t¹¢µ¥¡A¥þ³¡§C¦Ü¤T§é°_¡C·í¤¤ÁÙ¦³¤µ©u New Times ©M Party Mood µ¥è°¾c¿ï¾Ü¡C·Q¥h¬£¹ï®É¦³¹ïè°¾cµÛ(=zoek3=)¡A°O¦í®É¶¡¥h±½³f¡C

Listen¡Gfashion01.mp3

 

Hong Kong Style dialect and slang

µo¹F°Õ(=laa3=)! §Ú¤¤(=zung3=)¥ª¤µ´Á¼T(=ge3=)¤»¦X±mØ{(=wo3=), ©O¦¸¥´¶_¸}¤U¥b¥@³£­ø¨Ï¼~Åo(=lo3=)!

¯u«Y?

žX(=ce1=),µo¥Õ¤é¹Ú©Q, °á(=lam2=)¤U(=haa5=)³£ÉN«§°ÝÃD¬[ (=gaa3=)¹À

Listen¡Gchat01.mp3

  

Poem

ª÷Á\¦ç

ÄU§g²ö±¤ª÷Á\¦ç¡A ÄU§g±¤¨ú¤Ö¦~®É¡C

ªá¶}³ô§éª½¶·§é¡A ²ö«ÝµLªáªÅ§éªK¡C

Listen¡Gpoem01.mp3

 

Classic

Romance of the Three Kingdoms

¿ï¦Û: ¤T°êºt¸q

²Ä¤@¦^: ®b®ç¶é»¨ªN¤Tµ²¸q,±Ù¶À¤y­^¶¯­º¥ß¥\.

 

¡@¡@¡@ºuºuªø¦¿ªF³u¤ô¡A®öªá²^ºÉ­^¶¯¡C¬O«D¦¨±ÑÂàÀYªÅ¡C

¡@¡@¡@«C¤s¨Ì¦b¡A¤L«×¤i¶§¬õ¡C¡@¡@

            ¥Õµoº®¾ö¦¿²Z¤W¡AºD¬Ý¬î¤ë¬K­·¡C

            ¤@³ý¿B°s³ß¬Û³{¡C¥j¤µ¦h¤Ö¨Æ¡A³£¥I¯º½Í¤¤¡C ¡X¡X½Õ±H¡mÁ{¦¿¥P¡n

 

¡@¡@¸Ü»¡¤Ñ¤U¤j¶Õ¡A¤À¤[¥²¦X¡A¦X¤[¥²¤À¡C©P¥½¤C°ê¤Àª§¡A¦}¤J¤_¯³¡C¤Î¯³·À¤§¦Z¡A·¡¡Bº~¤Àª§¡A¤S¦}¤J¤_º~¡Cº~´Â¦Û°ª¯ª±Ù¥Õ³D¦Ó°_¸q¡A

¤@²Î¤Ñ¤U¡A¦Z¨Ó¥úªZ¤¤¿³¡A¶Ç¦ÜÄm«Ò¡A¹E¤À¬°¤T°ê¡C±À¨ä­P¶Ã¤§¥Ñ¡A¬p©l¤_®Ù¡BÆF¤G«Ò¡C®Ù«Ò¸TÀDµ½Ãþ¡A±R«H«Æ©x¡C¤Î®Ù«Ò±Y¡AÆF«Ò§Y¦ì¡A

¤j±N­xÄuªZ¡B¤Ó³Å³¯¿»¦@¬Û»²¦õ¡C®É¦³«Æ©x±ä¸`µ¥§ËÅv¡AÄuªZ¡B³¯¿»¿Ñ¸Ý¤§¡AÉ󍯤£±K¡A¤Ï¬°©Ò®`¡A¤¤®þ¦Û¦¹·U¾î¡C

 

¡@¡@«ØÉr¤G¦~¥|¤ë±æ¤é¡A«Ò±s·Å¼w·µ¡C¤è¤É®y¡A·µ¨¤¨g­·ÆJ°_¡C¥u¨£¤@±ø¤j«C³D¡A±q±ç¤W­¸±N¤U¨Ó¡AÂϤ_´È¤W¡C«ÒÕa­Ë¡A¥ª¥k«æ±Ï¤J®c¡A

¦Ê©x­Ñ©bÁסC¶·ªØ¡A³D¤£¨£¤F¡C©¿µM¤j¹p¤j«B¡A¥[¥H¦B¹r¡A¸¨¨ì¥b©]¤è¤î¡A§¥«o©Ð«ÎµL¼Æ¡C«ØÉr¥|¦~¤G¤ë¡A¬¥¶§¦a¾_¡F¤S®ü¤ôªx·¸¡Aªu®ü©~¥Á¡A

ºÉ³Q¤j®ö¨÷¤J®ü¤¤¡C¥ú©M¤¸¦~¡A»ÛÂû¤Æ¶¯¡C¤»¤ë®Ò¡A¶ÂÉa¤Q§E¤V¡A­¸¤J·Å¼w·µ¤¤¡C¬î¤C¤ë¡A¦³­i²{¤_¥É°ó¡F¤­­ì¤s©¤¡AºÉ¬Ò±Yµõ¡CÏúÏú¤£²»¡A«D¤î¤@ºÝ¡C

«Ò¤U¶@°Ý¸s¦Ú¥H¨aÉݤ§¥Ñ¡Aij­¦½²°o¤W²¨¡A¥H¬°ãê¼ZÂû¤Æ¡A¤D°ü¦x¤z¬F¤§©Ò­P¡A¨¥»á¤Áª½¡C«ÒÄý«µ¼Û®§¡A¦]°_§ó¦ç¡C±ä¸`¦b¦ZÅѵø¡A±x«Å§i¥ª¥k¡F

¹E¥H¥L¨Æ³´°o¤_¸o¡A©ñÂk¥Ð¨½¡C¦Z±iÅý¡B»¯©¾¡B«Ê륡B¬q¯^¡B±ä¸`¡B«JÄý¡BӡBµ{Ãm¡B®LÙ@¡B³¢Ð`¤Q¤HªB¤ñ¬°¦l¡A¸¹¬°¡§¤Q±`¨Í¡¨¡C«Ò´L«H±iÅý¡A

©I¬°¡§ªü¤÷¡¨¡C´Â¬F¤é«D¡A¥H­P¤Ñ¤U¤H¤ß«ä¶Ã¡Aµs¸é¸Á°_¡C

Listen¡Gsgyy01.mp3(Fast); sgyy02.mp3(Slow)

 

 

A trail on Intonation

In this new version (V2), intonation elements are added in. This will it make more colorful when reading poems.

Listern

 

                                                                                                                                                   (More) §ó¦h... (Chinese Only)

Applications

Besides personal use, there are several areas of application:

 

E-Book Publishing                  

Publishers, Writers and Media can think about offering MP3 e-Books or use it for Proof reading.

 

Teaching

MP3lecture notes, text book. Students can repeat a session for enforcing the memory.

 

Visual Impaired Person

The system has been sold to some VIPs. Their comments are ¡§Those days of unnatural digital voice are gone. The system brings them more human

sounds¡¨

 

Media/Multi-media

News, Weather, Finance updates, Speaking Web pages and G3 Mobiles can make use of the system to create MP3 sources.

 

MP3 Player

Most of the Music lovers have an MP3 player. How about ¡§Information lovers¡¨ , ¡§Gossip lovers¡¨, ¡§Book lovers¡¨ ? 

There are big markets out there for us to explore.

 

Learn Cantonese

Cantonese is one of the oldest dialects in China. Many characters in classics and poems have the corresponding Cantonese

Sounds. It is also the most popular dialect in Hong Kong.

Contact Us

Pacific i-Technology Limited (PiTL)

E-mail:       more@pitl.com

Tel:             (852)-2992 7108


Pacific i-Technology Limited (PiTL)    Email: more@pitl.com  TEL:(852)-2992 7108