Print Page | Close Window

Team project to convert PDF to text or HTML

Printed From: FSI Language Courses
Category: Language Courses
Forum Name: Member Contributions
Forum Discription: If you have course materials and are planning to contribute them to the website, this is the place to let everyone know.
URL: http://fsi-language-courses.com/forum/forum_posts.asp?TID=630
Printed Date: 16 January 2009 at 3:01am


Topic: Team project to convert PDF to text or HTML
Posted By: Manning
Subject: Team project to convert PDF to text or HTML
Date Posted: 22 July 2008 at 3:15am
I've been using the Portuguese course and I would love to have a HTML version which links the exercises with the relevant HTML file.
 
Of course, no such thing exists... yet.
 
So I am wondering if anyone is interested in forming a team to tackle the coursework collectively?
 
If we were to all choose a single course (starting with a popular one like Spanish or French) and then all get assigned a single page of the coursebook to convert to text. Doing a single page should take no more than 5-10 minutes. Once you've completed one page you can start on another, obviously.
 
A person could then volunteer to integrate the various text files into a single HTML resource (and add in the MP3 links).
 
Tackling a 400 page user manual all on your own could be just too intimidating. But doing only a few pages would not be too much effort for anyone.
 
I'm not actually initiating a project here, just polling for interest. I have my own ideas on how it all might work, but I'd be keen to hear anyone else's ideas too. Of course if no-one is keen then my ideas don't really matter anyway :)
 
Regards
Manning
Sydney Australia



Replies:
Posted By: JennieLynn
Date Posted: 14 August 2008 at 4:23am
I would like to do this too, especially for French and German. Does anyone have recommendations for PDF to HTML software I could use? It's a bit tricky with all of the pages technically being images instead of text, so I might need some good OCR software too.

-------------
http://www.ielanguages.com - Indo-European Languages Website
http://www.ielanguages.com/blog/ - Jennie en France Blog


Posted By: JennieLynn
Date Posted: 15 August 2008 at 3:17am
Started with the French Basic Course! Only unit 1 is finished so far.

I divided it into seven parts so the pages wouldn't be so long, and I've added in links for the mp3s. I'll probably turn the Written exercises section into actual exercises or at least provide the answers (my translations though - I don't know where to find the FSI answers.)

http://www.ielanguages.com/fsi/frenchcontents.html - http://www.ielanguages.com/fsi/frenchcontents.html

I'm still looking for a good PDF to HTML converter...

- Jennie



-------------
http://www.ielanguages.com - Indo-European Languages Website
http://www.ielanguages.com/blog/ - Jennie en France Blog



Print Page | Close Window