Rule creation: customizing document insights
DocuMine extracts key information from documents and aggregates it into a structured overview, allowing users to gain valuable insights. Users can manually optimize the results and export the extracted data in the formats required for ingestion into other applications.
DocuMine uses a Drools rule engine (version 8.44.0.Final) that allows developers to define custom rules for information extraction. A set of rules works for extracting information from documents with a similar structure and content. Developing meaningful rules hinges on identifying the chapters and/or layout elements that contain the information to be extracted.
This guide covers the most relevant Drools topics for creating customized rules in the DocuMine context. If you would like to dive deeper into Drools-specific topcis, we recommend reading the Drools documentation. For detailed information on the Java classes and packages relevant to rule creation, we recommend referring to the Javadoc documentation.