Corpus Historia Peru UNMSM

OCR pipeline for extracting history Q&A from UNMSM admission exams.

Exam Q&A extractor using Mistral OCR to extract multiple-choice questions about Peruvian history from Universidad Nacional Mayor de San Marcos (UNMSM) admission exams spanning approximately 1970 to 2020.

[Space for corpus extraction visualization]