Well, I have taken on a major php-to-aps.net upgrade, this month. The Project is a “resume" storage applications that takes common documents and “parses” the text so it can be inserted into a SQL table as text. The Parsers must strip out all “non printable characters”, white space, headers, footers, etc.
The php version, uses regular expressions and common string functions to manipulate the “garbage” out of the documents so all that is left is the written text. As I am not completely fluent in Regular Expressions, this is somewhat of a “troublesome” project.
More to follow………..