Enable festival Linux (text-to-speech-system) to read/speak PDF and DOC files (Speech PDF and DOC in Festival Script)

Sunday, 20th September 2009

Today I wondered if festival supports reading of PDF files on Linux? The answer due to my short research was NO!.

Well though I couldn’t find official program to speak PDFs for me it’s not such a big deal since it’s not so hard to convert PDF files into plain text files in Linux with pdftotext command. 

pdftotext is part of the poppler-utils which is a nice package which alsocontains pdfimages – enabling you to extract images from pdfs,
pdftohtml – pdf to html converter and
pdffonts – pdf font analyzier. The normal way to read PDF files via festival is: First use pdftotext to convert your PDF to text file

$ pdftotext filename.pdf outputfile.txt

and then to make computer speak it over festival default configured synthesizer:

$ cat outfile.txt | festival –tts
For convenience I’ve created a small shell script I calledfestival-read-pdf.sh which does this directly.

Please download the festival-read-pdf.sh shell script here Furthermore I wondered how to make the Microsoft Office .doc files to be played throughfestival. On that account It was required something to convert again the .doc file extension to plain text. I came across antiword which I’ve blogged about in my previous post. Thus to carry it via festival you need to: antiword filename.doc | festival –tts I’ve fastly scripted it for some convenience. Download the festival-doc-read.sh script here I’ve also created a third bash script which enables you to select either to play DOC or PDF file in Festival.
Here is a link to the festival’s festival-read-doc-en-pdf.sh PDF, DOC speaker script .
Talking about festival it might be interesting to mention fala – A simple text reader.If you’re a Debian user you’ll be glad to know there is already a package containg fala. Well I hope you’ll find the PDF, DOC festival speech scripts useful. Enjoy

END—–

Share this on:

Download PDFDownload PDF

Tags: , , , , ,

9 Responses to “Enable festival Linux (text-to-speech-system) to read/speak PDF and DOC files (Speech PDF and DOC in Festival Script)”

  1. Quiliro says:
    Firefox 3.6.6 Firefox 3.6.6 GNU/Linux GNU/Linux
    Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.6) Gecko/20100709 Trisquel/4.0 (taranis) Firefox/3.6.6

    less file.pdf | festival –tts

    View CommentView Comment
  2. luka says:
    Firefox 3.6.15 Firefox 3.6.15 Windows XP Windows XP
    Mozilla/5.0 (Windows; U; Windows NT 5.1; pl; rv:1.9.2.15) Gecko/20110303 Firefox/3.6.15

    hi, can you put again this script?

    View CommentView Comment
  3. admin says:
    Epiphany 2.30.6 Epiphany 2.30.6 Debian GNU/Linux x64 Debian GNU/Linux x64
    Mozilla/5.0 (X11; U; Linux x86_64; en-us) AppleWebKit/531.2+ (KHTML, like Gecko) Version/5.0 Safari/531.2+ Debian/squeeze (2.30.6-1) Epiphany/2.30.6

    Hi Luka.
    Sorry to put a broken link.
    Now the script is with correct link and is downloadable.
    I hope it’s useful to you.
    Looking forward to hear your feedback.

    B

    View CommentView Comment
  4. luka says:
    Firefox 3.6.15 Firefox 3.6.15 Windows XP Windows XP
    Mozilla/5.0 (Windows; U; Windows NT 5.1; pl; rv:1.9.2.15) Gecko/20110303 Firefox/3.6.15

    Big thanks! and thank you for your quick reply 🙂

    View CommentView Comment
    • admin says:
      Epiphany 2.30.6 Epiphany 2.30.6 Debian GNU/Linux x64 Debian GNU/Linux x64
      Mozilla/5.0 (X11; U; Linux x86_64; en-us) AppleWebKit/531.2+ (KHTML, like Gecko) Version/5.0 Safari/531.2+ Debian/squeeze (2.30.6-1) Epiphany/2.30.6

      My pleasure, if you like my blog you could subscribe and we can stay in touch.
      Cheers ! 🙂

      View CommentView Comment
  5. Angeline Kamansky says:
    Firefox 21.0 Firefox 21.0 Mac OS X 10.8 Mac OS X 10.8
    Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:21.0) Gecko/20100101 Firefox/21.0

    You need to be a part of a contest for one of the best websites on the internet.
    I will recommend this site!

    View CommentView Comment
  6. Vagner Rener says:
    IceWeasel 24.4.0 IceWeasel 24.4.0 GNU/Linux GNU/Linux
    Mozilla/5.0 (X11; Linux i686; rv:24.0) Gecko/20140319 Firefox/24.0 Iceweasel/24.4.0

    How can I make festival speak other languages and how can I make ‘text2wave’ – a wrapper for festival – record text in other languages.

    thx

    View CommentView Comment
    • admin says:
      Firefox 29.0 Firefox 29.0 Windows 7 x64 Edition Windows 7 x64 Edition
      Mozilla/5.0 (Windows NT 6.1; WOW64; rv:29.0) Gecko/20100101 Firefox/29.0

      Hi you will have to add more festvox, in Debian and Ubuntu to check what kind of extra festvox are there:

      apt-cache search festvox
      festvox-hi-nsk – Hindi male speaker for festival
      festvox-italp16k – Italian female speaker for Festival
      festvox-itapc16k – Italian male speaker for Festival
      festvox-mr-nsk – Marathi male speaker for festival
      festvox-te-nsk – Telugu (te) male speaker for festival
      festvox-czech-ph – Czech male speaker for Festival
      festvox-kallpc16k – American English male speaker for festival, 16khz sample rate
      festvox-kallpc8k – American English male speaker for festival, 8khz sample rate
      festvox-kdlpc16k – American English male speaker for festival, 16khz sample rate
      festvox-kdlpc8k – American English male speaker for festival, 8khz sample rate
      festvox-ru – Russian male speaker for Festival
      festvox-suopuhe-common – Common files for Festival Finnish speakers
      festvox-suopuhe-lj – Finnish female speaker for Festival
      festvox-suopuhe-mv – Finnish male speaker for festival
      flite1-dev – A small run-time speech synthesis engine – static libraries
      flite – A small run-time speech synthesis engine
      festvox-don – minimal British English male speaker for festival
      festvox-rablpc16k – British English male speaker for festival, 16khz sample rate
      festvox-rablpc8k – British English male speaker for festival, 8khz sample rate
      festvox-ellpc11k – Castilian Spanish male speaker for Festival
      libflite1 – Small run-time speech synthesis engine – shared libraries

      View CommentView Comment
  7. Things to install on newly installed GNU / Linux (My favourite must have Linux text and GUI programs missing in fresh Linux installs) - ☩ Walking in Light with Christ - Faith, Computing, Diary ☩ Walking in Light with Christ – Faith, Computing, D says:
    WordPress 4.7.11 WordPress 4.7.11
    The Incutio XML-RPC PHP Library -- WordPress/4.7.11

    […] be used to convert basic PDFs or DOC files to listen them if you're lazy to read I've explained on how you can use festival to read speak for you PDFs and DOCs, ODF (Open Document F… […]

    View CommentView Comment

Leave a Reply

CommentLuv badge