Interested in seeing firsthand how our software works? Sovren would welcome the opportunity to schedule an online demo with you. Please contact us anytime or fill out this form.

Scalability

Our software runs in desktop applications, multi-tenant SaaS architectures, global server farms, and everything in between. It is fully multi-threaded and cost-effective to scale up and out.

Parser FAQs

Q.
What does the Sovren Document Converter do?
A.

The Sovren Document Converter converts resumes from their native formats to a straight ASCII text format that is usable by the Sovren Resume/CV Parser. The Sovren Document Converter does not require any of the authoring software (Microsoft Word, Adobe PDF, etc.), as it is completely self-contained. The Converter can convert dozens of resumes per second.

In addition to converting documents to text, the Sovren Document Converter can convert Microsoft Word documents into formatted HTML. This HTML version looks like the original Word document, but is actually safe and lightweight HTML (virus free!).

Q.
What is the Sovren Resume/CV Parser?
A.

The Sovren Resume/CV Parser is a component that performs the following functions:
1. Parses converted resumes to extract hundreds of pieces of important data.
2. Outputs that data in an open industry-standard HR-XML Resume format, or to text, or to CSV.

Q.
In technical terms, how does the product accomplish its tasks?
A.
Q.
How long has the product existed?
A.

The earliest predecessor of the Sovren Resume/CV Parser was built in 1996. Sovren is the oldest surviving parsing technology and we were the first company to be certified by HR-XML.org on the Resume 2.x standard.

Q.
How scalable is the Sovren Resume/CV Parser?
A.

The Sovren Resume/CV Parser powers the two largest career sites on the Internet. You can't get more scalable than that. On modern server hardware, the Sovren Resume/CV Parser generally completes parsing in under 1 second (regardless of language or region). Higher quality resumes will parse faster than low quality resumes. Every single resume section type is configurable on-the-fly to be turned "on" or "off" as desired.

Based on a real-world customer experience in May 2011, batch processing 2.5 million resumes on a dual Xeon X5690 3.46 GHz yielded an average throughput of 35 resumes per second. That's 126,000 per hour, 3 million per day, 1.1 billion per year -- on one multi-core server running the default configuration of our BulkParser sample application.

Just as importantly, the parser easily scales to the low-end. Many customers add the parser to their existing application server (which is important for ISVs), and some even embed the parser directly into their desktop client applications.

Q.
Does the parser have a built-in skills list?
A.

Yes. The parser ships with a built-in skills taxonomy covering approximately 50 top level categories (Legal, Retail, Accounting and Finance, Medical, HR, IT, etc.); approximately 500+ sub-categories; and 25,000+ skills, some of which are further grouped into synonym hierarchies. More importantly the Sovren Resume/CV Parser will tell you where it found each skill (such as in work history, job objective, qualification summary etc.). Each skill or competency as they are tagged in HR-XML will also be tagged with the years of experience and when that skill was last used as identified with work experience. This is a key piece of information in beginning to understand who a candidate is, who she was, and what she may be qualified to do from a career perspective.

Q.
What if we want to use our own skills list?
A.

No problem. The parser allows you to create an unlimited number of custom skills taxonomies, and you can specify which taxonomies that you want the parser to use on a per-resume-parsed basis. You can use our skills list, add to our skills list, or build your own skills lists.

Q.
What sample applications are available?
A.

We can supply over a dozen ready-to-run, fully commented sample applications, which are all included with the license to the parser. Clients are free to modify and use the source code to these applications. Applications include:

  • A VB6 application
  • An ASP.NET online resume submission/conversion/parsing/editing application:
    o A user can upload her resume, and the application will convert it, parse it, and write the results back to sectioned editing screens (Contact Info, Skills, etc.) for the candidate to verify and edit. This saves candidates incredible amounts of time while providing you with the most useful data for searching.
  • Two versions of a zero-code ready-to-run File System Watcher application:
    o This application monitors a client-designated incoming file folder. When a new resume is written to that folder, the application uses the Sovren Document Converter to convert the resume to ASCII text and then stores that converted text into a client-designated folder. This converted-text-output folder is also monitored for new files, and as soon as a new conversion file is written to that folder, the application reads the new file and parses it with the Sovren Resume/CV Parser. The parsed HR-XML record is then saved to a user-designated output folder. This folder can in turn be monitored by an application that loads the HR-XML records and indexes them into a search engine and/or stores them into a database.
  • An interactive WinForms.NET desktop application that performs parsing and displays the results as human-readable text or as HR-XML
  • A desktop or server batch processing application that can process whole folders of resumes, saving the converted text, the parsed text, and (optionally) performance and accuracy metrics
  • A Sovren Document Converter Web Service
  • A Sovren Resume/CV Parser Web Service
  • A command-line (or, "console") application
  • An email processor application that monitors a list of POP accounts and automatically determines what to process (it distinguishes between cover letters, resumes, and reference letters), converts and parses the resume, and stores it into a user-defined folder.

Q.
What is the Sovren Document Converter?
A.

The Sovren Document Converter is a.NET Assembly and/or an ActiveX control that converts resumes from their native formats to a straight ASCII text format that is usable by the Sovren Resume/CV Parser. This is the first step to normalize the data for the parser. The Sovren Document Converter does not require any of the authoring software (Microsoft Word, Adobe PDF, etc.), as it is completely self-contained. The Converter can convert as many as 100 resumes per second (YMMV).

Q.
What file types does the Sovren Document Converter support?
A.

The Sovren Document Converter, which is included with the Sovren Resume/CV Parser, provides lightning-fast conversion of essentially all commercially used recruitment document formats including Microsoft Word (including DOCX), Adobe PDF, Corel WordPerfect, HTML, MS Office HTML, Open Office (ODF), Rich Text (RTF), and various other formats.

Q.
How complicated is the product to install?
A.

For the parser, integration only requires the installation of two files - one for the Sovren Document Converter and one for the Sovren Resume/CV Parser. Click-and-go, zero-code, ready-to-run solutions are available at no extra charge. You can be running your own instance of the SovrenConvertAndParse Web Service in about 20 minutes. Installation of the optional Sovren Semantic Matching Engine is far more involved, and requires substantial integration after installation.

Q.
Is the Sovren Resume/CV Parser compatible with COM environments? Can I use it from VB6?
A.

Yes, and Yes. The Sovren Resume/CV Parser is written completely in C# using the Microsoft .NET Framework. However, the Parser integrates seamlessly into COM-compatible environments such as Delphi 6-7 and Visual Basic 6. For clients who wish to integrate into COM environments, we supply the Parser as a late-bound COM control, and we supply the corresponding Type Library.

Q.
Is the Sovren Resume/CV Parser compatible with non-Windows environments like J2EE, Linux, and ColdFusion?
A.

Integration with non-Windows environments is extremely simple and can be accomplished with any of our zero-code deployment options. However, as a Microsoft Windows .NET application, the Parser itself must be run on a Windows OS machine. About half of our customers primarily run non-Windows environments. About 50% of our customers are non-Windows shops who integrate the Parser via running an instance of the supplied SovrenConvertAndParse Web Service

Q.
Is the parser compatible with Microsoft.NET?
A.

YES. The Sovren Resume/CV Parser is a native C#/.NET component and the Sovren Document Converter is a .NET assembly (with optional additional Converter functionality provided through traditional Windows DLLs).

Q.
Is the Parser multithreaded?
A.

Yes. The Parser can run multi-threaded or single-threaded. In IIS webserver applications, IIS will automatically manage the thread pool for you since the parser is a native .NET component.

Q.
Can we purchase the source code?
A.

Yes, if approved, and at substantial additional cost.

Q.
What version of the .NET Framework is required?
A.

Only the free runtime is required. The Parser runs on the .NET Framework v2.0 or higher runtime, which ships with MS Windows.