Sinon is a Java tool that extracts textual information from Web sites. In other words, it is a tool that can be used to scrape any kind of text (HTML included) available in the Internet or in a filesystem. The extraction is driven by a XML file.

Project Activity

See All Activity >

Categories

Topic

License

Apache Software License

Follow sinon

sinon Web Site

Other Useful Business Software
Open source. Easy to use. Proven. Complete. Icon
Open source. Easy to use. Proven. Complete.

End to end big data that enables you to spend less time formatting data and more time analyzing it.

Discover HPCC Systems - the truly open source big data solution that allows you to quickly process, analyze and understand large data sets, even data stored in massive, mixed-schema data lakes. Designed by data scientists, HPCC systems is a complete integrated solution from data ingestion and data processing to data delivery. The free online introductory courses and a robust developer community allow you to get started quickly.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of sinon!

Additional Project Details

Languages

English

Intended Audience

Information Technology, Developers

Programming Language

Java

Related Categories

Java Topic Software

Registered

2005-05-03