What is Xtroo?

Xtroo is a content and article extraction tool. In the simplest terms, you provide Xtroo with the address of any web page, and it will extract the useful content of that page and return it into a neat JSON package ready for you to do as you wish.

We identify the important parts of a page, and prune the unwanted items such as comments, adverts, and unrelated articles, all the while keeping the core article components such as the article text, videos and images, and important metadata like the article author and the articles posted time.

Curious as to how it all fits together? Why not take us for a quick spin with our online Test Drive function.
Responsive image

Why use Xtroo?

Xtroo takes the hassle out of article extraction by using a combination of artificial intelligence and machine learning, it greatly increases the relevancy of information of the returned results. No more picking through hundreds of lines of code just to find the actual article itself, let Xtroo do all the heavy lifting for you, allowing you to easily pick what you need from the JSON output.

Xtroo is also constantly evolving, we are constantly at work on improving its learning algorithms, or working to implement new functions and features that can further improve the service that Xtroo offers.

Not only that, but we built Xtroo from the ground up to be quick, efficient, accurate, but most importantly, affordable. We’ve put together several pricing tiers depending on your requirements, and if that's not enough to convince you, why not sign up for a 14 day free trial, no credit card or bank details required.

How it Works

Once you have created an account, Xtroo will give you a token that you can use for calling our API.

From then it’s just a simple case of setting your CURL or Guzzle request to our endpoint, along with your key and the site to analyse, and then parsing the response. As long as your account is active and has credit, you’ll receive a JSON packet back that contains all of the relevant information Xtroo was able to obtain from the site, in both HTML formatted and plain text versions.

For more information on the API and making requests, feel free to have a look over our documentation.