A Study of Information Extraction for Digitalization