First I had to design the document importation system. The supported formats are PDF, DOC, DOCX and TXT. PDFs can be read thanks to an OCR (Optical Character Recognition) tool that I have implemented thanks to Tesseract. After importation, we can then view the generated CV or job summaries. In the example below, where I've imported my own CV, we can see that the information has been correctly rendered and grouped: there are the skills I've mentioned, as well as my diplomas, work experience and other information.