Sinon is a Java tool that extracts textual information from Web sites. In other words, it is a tool that can be used to scrape any kind of text (HTML included) available in the Internet or in a filesystem. The extraction is driven by a XML file.
Categories
TopicLicense
Apache Software LicenseFollow sinon
Other Useful Business Software
Discover HPCC Systems - the truly open source big data solution that allows you to quickly process, analyze and understand large data sets, even data stored in massive, mixed-schema data lakes. Designed by data scientists, HPCC systems is a complete integrated solution from data ingestion and data processing to data delivery. The free online introductory courses and a robust developer community allow you to get started quickly.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of sinon!