
What is Regoxa Parser?
Regoxa Parser is more than data capture and extraction. It brings together artificial intelligence (AI), natural language processing (NLP), machine learning (ML), and advanced recognition in a single document automation platform that turns the data inside your business documents into something your systems can use.
By putting document processing automation in your hands, Regoxa Parser helps your organization onboard faster, process more without manual touches, cut costs, and see further into its own processes.
Available in the cloud, on premises, or as an SDK
Regoxa Parser is built for the digital enterprise. Run it wherever it fits your business.
01
On-Premises
Install and run Regoxa Parser inside your own infrastructure.
02
Cloud
Run Regoxa Parser Cloud, securely hosted on Microsoft Azure, with a choice of geographic regions and Azure's built-in security and data protection.
03
SDK
Build your own application with intelligent capture capabilities using Regoxa SDK.
Enterprise document automation starts here
A comprehensive, purpose-built AI platform for acquiring, processing, validating, and delivering the right data into your critical processes.
01
Straight-through processing of business-critical documents
Content entering through any channel, in any format, is automatically extracted, understood, and delivered, removing the friction of manual processing.
02
Smooth transactions, smart decisions, rapid action
Use customer-provided data to accelerate transactions, make better decisions, and give customers quick, accurate responses.
03
Control, predictability, and compliance
Get full chain-of-custody reporting and management to fine-tune results, while keeping the process aligned with your compliance and security models end to end.
04
Intelligent data extraction
NLP automates the identification and extraction of data from unstructured and complex documents alongside structured and semi-structured ones, accelerating transactions while cutting operating costs and errors.
05
Data validation and control
Critical fields, context, and entities are identified, validated, and processed according to your business rules. The system trains easily and uses ongoing machine learning to improve over time.
How Regoxa Parser works
Regoxa Parser is an accurate, scalable document automation platform. It captures, classifies, and moves critical data from unstructured and structured documents to the right process, workflow, or decision engine.
01
Data input
Install and run Regoxa Parser inside your own infrastructure.
02
Classification and recognition
Run Regoxa Parser Cloud, securely hosted on Microsoft Azure, with a choice of geographic regions and Azure's built-in security and data protection.
03
Data extraction and verification
Content and data are extracted and validated automatically, with accurate OCR, ICR, OMR, and barcode recognition. Validation checks data against your databases and built-in rules, and a verification station lets reviewers confirm that extracted fields match the original document.
More on Regoxa Parser
Product availability
01
Deployment options
Regoxa Parser runs where it makes the most sense for you. Configurations can be deployed on premises or in the cloud, and both are compatible. Regoxa Parser Cloud runs on a secure, managed cloud platform with geographic region options.
02
Security
Regoxa Parser Cloud has native security features at both the infrastructure and platform layers. It is designed to meet the criteria for security, availability, processing integrity, confidentiality, and privacy and undergo independent security audits.
03
API and integration
The Regoxa Parser Cloud REST API enables tight integration with your systems. The REST API lets external systems upload documents and receive extraction results, provide quality feedback to improve the model, and build customized verification workflows.
04
Multi-level document classification
AI-based classifiers, trained automatically with current machine learning methods, understand, separate, and route documents. Structured forms, semi-structured invoices, tax forms, claims, and onboarding documents, or fully unstructured correspondence and contracts: all are classified without manual sorting or labelling.
05
Natural language processing
NLP extends capture to unstructured documents such as contracts, leases, agreements, and email, automating processes that once required manual data entry. Extraction quality improves continuously as user feedback trains the NLP models, reducing time spent on verification.
06
Continuous learning
Auto-learning accelerates time to production and reduces ongoing system support and maintenance costs. Users can train the system to handle flexible or irregular document layouts, while the administrator keeps full control to edit, fine-tune, or discard auto-learning results. The system learns continuously from user feedback, drawing on advanced machine learning and NLP.
Technology and capabilities
01
Classification technology
Incoming documents can be classified by form and content to optimize your information-driven processes. Classification detects every incoming document type, including images, using deep-learning convolutional neural networks to sort by appearance or pattern and text classification that relies on statistical and semantic text analysis. You can use either technology on its own or combine both faster response times and quicker decisions.
02
Image processing and enhancement
Image enhancement automatically improves images from mobile devices to optimize processing. It handles documents with complex backgrounds, such as transcripts, identification documents, and transportation forms, optimizing images automatically, or flagging poor quality for immediate feedback. Auto-crop, background whitening, image quality assessment, and custom enhancement profiles for different image sources ensure every document gets processed regardless of its condition or origin.
03
Handwritten ICR
Regoxa Parser recognizes handwritten data using advanced intelligent character recognition (ICR). Extract handwritten data in fields, marks, or full text from bills, receipts, medical forms, prescriptions, applications, claims, invoices, and other financial or transportation documents. Built-in AI speeds up processing and improves recognition.
04
Scalability and performance
Regoxa Parser scales both vertically and horizontally to support high-volume, fast-processing scenarios. Whether you need to process millions of documents per day or thousands of pages per minute, Parser's architecture grows to meet your requirements. Multi-server installations, distributed infrastructure, and operators are all managed through centralized configuration and control.
05
Multi-tenancy
Create a secured, isolated environment for each tenant and apply common policies across different users. Secure, centralized administration tools and separated licenses protect data across multiple workgroups with minimal setup time.
06
Document types
Sophisticated document analysis detects the exact type of paper or digital document (spreadsheets, images, logos, and more) and finds different areas within a document, even when text is difficult to read. Word, Excel, PDF, email bodies, scanned images, and other digital documents can all be processed in the same flow.
Integrations, administration, customization, and more
01
SLA and support
SLA monitoring and analytics let you verify that your systems deliver results within the required timeframes. Set priorities for processing stages, adjust document queue order to meet deadlines, and use standard reports and dashboards to track performance.
02
Data protection
Confidential data within documents can be hidden using different methods during exchange and verification, with access controlled by user role. HTTPS provides bidirectional encryption between user and server, protecting against interception and tampering.
03
Monitoring and analytics
Analyze the document processing flow, verify that KPIs meet business objectives, and optimize or prioritize resources to tune performance. Data collected across processing stages can be shaped into reports that show whether processes meet your SLA targets and whether processing quality, including verification time, is on track.
04
Administration
A command-line interface (CLI) makes it straightforward to administer distributed environments, synchronize product installations, reuse machine learning results across projects, and back up or restore existing projects.
05
Integration
Parser's APIs and scripting enable tight integration with any system of record or engagement, including UiPath, Blue Prism, Pega, Appian, M-Files, Laserfiche, Automation Anywhere, and others. Use a secure SFTP connection to transfer files and export data as JSON.
06
Multi-channel data entry
Process paper and digital documents from multiple sources in a single flow: MFPs, network scanners, email, FTP, web post, hot folders, and mobile devices.
07
Mobile capture
Use mobile devices and other document sources for data entry with high-quality uploads ensured by image enhancement tools. Confirmation reports notify you when images are uploaded and processed correctly. Build the right mobile capture workflow for your use case, drawing on Regoxa's advanced mobile imaging capabilities.
08
Visibility
Monitoring tools and reports on resources, performance, and accuracy give administrators process transparency and predictability, along with clear sight of where improvements can be made.
09
Enterprise readiness
Parser's HTML5 web stations support Chrome, Firefox, Edge, and other browsers. Global teams can distribute and manage business processes across remote locations through a responsive web interface that's accessible anywhere and easy to maintain.
10
Document classification automation
Classify documents by type (driver's license, bank statement, tax form, contract, invoice, and more) and by variation, such as invoices from different vendors, to sort incoming documents automatically and route them to predefined destinations. Regoxa Parser offers image-based, text-based, and rule-based classification methods that can be combined into a hierarchical system for maximum straight-through processing and minimal manual review.
Customization capabilities
01
Flexible workflow customization
Regoxa Parser makes it straightforward to build applications that fit specific internal or outsourced business scenarios. Customization scripts and a web service API let you tailor processing stages and data routing to your exact needs.
02
Document scanning and classification
Scripting capabilities let you customize scanning and classification stages for projects that need specific actions or must follow regulations.
03
Data recognition and extraction
A third-party OCR/ICR engine can be used to recognize any region of a field during a customized recognition stage. The recognition stage, including document assembly, text recognition, and data extraction, can be adjusted to any custom scenario.
04
Autocorrection and data validation
The auto-correction script runs automatically after recognition to replace or modify data in recognized fields. Data validation scripts let you define custom algorithms for validation and normalization, including dictionaries and custom symbol sets.
05
Document image enhancement
Custom scripts provide flexibility in document image enhancement, assembling documents into sets based on user-defined rules rather than standard assembly rules.
06
Verification
Customized verification scripts add control over document-specific functions, change the software behavior for a particular project, or run automatically when a batch, document, or field is processed.
07
Export rules
All processed data can be exported in different formats for further use. Custom export modules with scriptable export deliver data and images directly to external applications, including ECM, CRM, and ERP systems.
08
Web service API
The web service API makes it straightforward to build custom applications or import modules that deliver documents directly to Regoxa Parser for indexing, classification, and data extraction. Data from external applications arrive at the Regoxa Parser processing server over HTTP or HTTPS. Scripts enable embedding of Regoxa Parser web stations into any back-end system and apply custom scenarios, stages, user roles, and interface design.
What is Regoxa Parser?
Regoxa Parser is an enterprise document automation platform that captures, classifies, extracts, and validates data from paper and electronic documents, then delivers it to your systems. It combines AI, NLP, machine learning, and advanced recognition to replace manual data entry with accurate, scalable automation.
What document types and formats can Regoxa Parser process?
Regoxa Parser handles structured forms, semi-structured documents like invoices and tax forms, and unstructured documents like contracts and correspondence. It processes Word, Excel, PDF, email bodies, scanned images, and other digital formats in a single flow.
How does Regoxa Parser extract data from unstructured documents?
It uses natural language processing to identify and extract data from unstructured content such as contracts, leases, agreements, and email, alongside structured and semi-structured documents. Extraction accuracy improves over time as user feedback trains the models.
Can Regoxa Parser run on-premises and in the cloud?
Yes. Regoxa Parser deploys on-premises, as Regoxa Parser Cloud, or as an SDK for building your own capture-enabled application. The same configurations are compatible across on-premises and cloud.
How does Regoxa Parser classify and route documents automatically?
Neural-network and text-based classifiers sort incoming documents by type and subcategory using both appearance and content, then route them to predefined destinations. Image, text, and rule-based methods can be combined into a hierarchy for the highest straight-through processing rate and the least manual review.
How does Regoxa Parser handle handwriting and poor-quality scans?
Intelligent character recognition (ICR) extracts handwritten data from fields, marks, and full text, while automatic image enhancement corrects mobile captures and complex backgrounds through auto-crop, background whitening, and quality assessment.
How does Regoxa Parser scale for high document volumes?
The architecture scales vertically and horizontally to handle more than 3 million documents a day or 2,000 pages a minute, managed centrally across multi-server, distributed deployments, so processing capacity grows with your volume without re-architecting your workflow.
How does Regoxa Parser integrate with existing systems?
REST API and progressive scripting connect Regoxa Parser to your systems of record and engagement, including UiPath, Blue Prism, Pega, Appian, M-Files, Laserfiche, and Automation Anywhere, as well as ECM, CRM, and ERP systems. External systems can submit documents and receive structured results, with secure file transfer over SFTP and JSON export.
How does Regoxa Parser keep data secure?
Confidential data can be masked during exchange and verification according to operator access rights, and HTTPS encrypts traffic between user and server. Regoxa Parser Cloud adds infrastructure and platform-level security controls.
Frequently asked questions
Contact Us
Let’s Connect and Build Intelligent Business Solutions Together.
Ready to Partner with Us?
Contact us today.