I am dealing with a flat text file without styles (I'll be adding paragraph styles later). There aren't even tabs. I'll have to figure out a way to insert tabs. Hopefully the number of spaces between “columns” is standardized.
My first step is a script to remove all the junk. I have some programming/JS experience, but it's mostly limited to online, html and DB management.
Can someone point me in the right direction of a good, sensical guide for this type of scripting?
Here is some pseudocode of what I want to do.
=================
Delete all characters left of “/2012” (only on that line)
Replace “London Rippers” with “Rippers”
Replace “Florence Freedom” with “Freedom”
etc…
Replace double linebreaks with single linebreaks
Delete ” AVG”
Delete ” .***”
(I used * to denote wildcard characters)
Replace ” C ” with ” ct”
Replace ” P ” with ” pt”
Replace ” 1b ” with ” 1bt”
Replace ” 2b ” with ” ct”
etc… (t is a tab, right?)