The purpose of the project is to aggregate as many as possible active users using Artificial Intelligence, organizations providing AI technology, institutions teaching AI and scholars / experts having a deep understanding of AI.
- Searching the web for AI related content
Identifying websites from all over the world that contain AI relevant content and has an about that indicates it is a company that either produces AI relevant products or other products for sale.
- Creating data-sets from that content
Creating data sets with URL, about, company name, product or services identification relevant to the AI research.
- Training the system
The data-sets should be reviewed by selected human reviewer and then provide feedback to the system if the page is relevant and the category (produces or consumes AI) is accurate.
- Optimizing the algorithms
So that the system can grind through the Internet, finding hundreds of thousands of pages and make correct assumptions.
- Content extraction
Extracting selected specific content from the pages including relevant people, location, etc.
Rating the content for relevancy.
- Summary page
Creating a summary page by selecting the most relevant information, organization name, people, location.
- Compiling all resulting pages into a single document
This document includes also automatically generated statistics like number of companies from each type by country, number of people and so forth.
After starting the program, the only output of this process shall be a 500 page word document.
The above obviously is just to give an idea. The final concept will be discussed with the most compelling ideas and then creating the solution.