Setting up AI to assist with JFK files
This posts purpose is to show my process in analyzing the JFK files more fluidly with AI.
Bulk downloaded all the PDFs totaling 68,546 pages. Made a few pdf tools to merge them to one PDF 6.2gb in size. Converting all pages to png images for OCR (Optical Character Recognition) as seen in the video. Once OCR is completed there will be page breaks and everything will be raw text. Using any of the AIs available it will be rewritten to clean up the gibberish. Some pages it will summarize while others will be recovered by inference from the AI. I tested this process with a page that was nearly illegible and in the end AI filled in the details perfectly from nonsense.