Artificial Intelligence Solution
for Houston-based Startup

BACKGROUND

An Artificial Intelligence startup in Houston, approached us to develop a Saas based application. The application provides quantitative research driven industries named- entity recognition and information extract of news media and social media sites with a subscription based module.

Objective

To provide a comprehensive solution that involves developing a subscription based php website, a big data repository to house millions of online data in a condensed format & a search engine that searches the data using Artificial Intelligence (Natural Language Processing) based on several factors like sensitivity score, sentiment analysis, keywords, authors and other search criteria. Finally, it was to provide statistical analysis that includes k-means clustering, lda topic extraction and other meaningful scientific data for data scientists and financial analyst in verticals like Oil & Gas, Finance/Banking and Healthcare to make their prediction and investment decision.

artificial-intelligence-objective-img

Solution

At the request of the customer, we provided a comprehensive solution that involves developing a subscription based php website. For that, a big data repository was required to house millions of website data in a condensed format. Additional, a search engine that searches the data based on several factors like sensitivity score, sentiment analysis, keywords, authors and other search criteria, Lastly, the web application provides statistical analysis that includes k-means clustering, lda topic extraction and other meaningful scientific data for data scientists and financial analyst to make key predictions about the desired topic.

artificial-intelligence-solution-img

Tools & Technology

  • Backend: AWS EC2, Load balancer, Cloud watch, Amazon Elastic beanstalk, Amazon elastic search (big data), Amazon Apache Spark(distributed computing), AWS S3
  • Languages: PHP, Python, Javascript, MySQL, JSON
  • Third Party APIs : SciKit, Scipy, PySpark, Stripe, JSON to CSV, CSV to JSON, Pandas

Results

  • Google type search engine that was able to display results within split of second
  • Able to drive named entities and other scientific data for data scientist and analyst using AI (NLP) within seconds (distributive computing)
  • SAAS based application that’s auto scalable, secure and highly robust
  • Data storage than can store tera bytes of data