how does ocr extraction work