Aim & Introduction
Aims to read millions of unstructured bibliographic references as input into the system and recognize all the elements in each reference.
These references are like Journal Articles, Book Chapters, Conference Papers etc.,
The objective of the project is creating a database for easy reference by recognizing all the elements of Bibliographic references with 95% accuracy and 99% precision.
Environment
  • Grammar using PERL Regular Expressions
  • Parser using Java
  • Intelligence using
  • Knowledge Base
  • Pattern Recognition
  • Algorithms
  • Tools for Test, Classification, Regression Test etc.,
  • Separate teams for Development, Testing, Analysis
Testing & Analysis
  • Rigorous Testing
  • Initially using 1000 refs test data
  • Subsequently using up to 30,000 refs test data
  • Regression Testing compared thru Automated script
  • Stamping process time for each ref in the output
  • Dedicated team for Analyzing test outputs for feedback to development team
                                                                                                                                           previous