Project:Agora - service for collection and analysis of advertisments
The project's purpose is to search for ads on websites, sorted by special category. We scrape ads from all websites and create a hash of ad images in order to check check for identical images later. Then we scrape title, text, link of each ad. Next, linking these data to ads, we make screenshots, HTML text of web page. All gathered data is then stored in a database. We get text of ads using the library, allowing sc scraping only the main text from website. The web application was created to display information of scraped ads. There you can lter ads by date and other criteria.