Google Code-In 2012/OCR document reformatting: Difference between revisions

No edit summary
No edit summary
 
(One intermediate revision by the same user not shown)
Line 17: Line 17:
3) Save file as either Pagennn.xls or Pagennn.ods.  Both formats are acceptable.
3) Save file as either Pagennn.xls or Pagennn.ods.  Both formats are acceptable.


4) Visually review (copyedit) each of the PDF file and spreadsheet file pairs.  Try to correct any errors introduced by the OCR process (for example a a "b" is sometimes OCR'ed as an "h" and vice-versa. Pay special attention to characters with accents of other diacritical marks.
4) Visually review (copyedit) each of the PDF file and spreadsheet file pairs.  Try to correct any errors introduced by the OCR process (for example a a "b" is sometimes OCR'ed as an "h" and vice-versa.fo Pay special attention to characters with accents or other diacritical marks.


5) Repeat process from step 3 with next 9 Page files in the zip.
5) Repeat process from step 3 with next 9 Page files in the zip.


6) Submit the spreadsheet files to the task mentor for review/sing-off.  
6) Submit the spreadsheet files to the task mentor for review/sign-off.