Moreover UGC Metabase

Power business information services and Web applications with real-time blogs and social media.

  • Indexing over 250,000 spam-free blogs in real-time, and counting
  • All major platforms covered including WordPress, LiveJournal, Blogger, TypePad
  • Ranked and categorised by topic, language, publishing platform and more
  • Client-side data for complete control over design and integration

The Moreover UGC Metabase is a live, continuously updating database of aggregated blogs, podcasts and other social media. Customers receive the data through an enriched XML feed and build a client-side index of live blog posts, which clients then deploy in their original applications and products.

Power Blog Search and Social Media Monitoring

The Moreover UGC Metabase powers business applications by leading companies across multiple markets, including online search engines, B2B and B2C portals, publishers, professional networking sites, business information and media monitoring services.

Our customers are as varied as there are uses for real-time blogs and other social media, and cover both business and consumer-focused applications, including

  • Media monitoring service providers
  • PR and reputation management tools
  • Blogs for B2B and B2C portals and directories
  • Blogs and social media for search engines
  • Blogs and social media for publishers

The Metabase is provided in pure data format, leaving customers free to plug it into their IT systems and design their own implementations on top.

Real-time posts from thousands of blogs

Moreover's proprietary indexing technology continuously monitors thousands of bloggers across the blogosphere for all the latest posts. Powered by Weblogs.com, one of the Web's largest ping servers, the UGC Metabase collates the data into a single index and covers all the major blogging platforms, including but by no means limited to: WordPress, LiveJournal, TypePad, Blogger, Movable Type, MySpace Blogs, Microsoft Spaces, Canal Blog, Community Server and many more.

Categorised and enriched with metadata

Each blog post in the UGC feed is analysed, categorised and tagged with additional metadata, allowing customers to intelligently index, sort and serve the links in their products and applications. A blog post can have as many as 35 different pieces of data attached to it, including:

  • A 1-to-10 blog ranking, plus a blog's individual ranking within the index
  • Industry and topical categorisation of the blog and its posts
  • Publishing platform of the blog, e.g. Wordpress, LiveJournal, etc.
  • Extracted list of links referenced in the blog post
  • Language of the blog post
  • Media type such as podcast
  • Extracted mentions of companies including stockticker data

The 'standard' blog data from the original RSS feeds is of course also included, such as the title of the post and the link to it, the name and Web address of the blog, the time the post was indexed, tags describing the contents as provided by the author, links back to comments, etc.

Delivered in a single stream via a simple API

Clients access the Metabase XML feed through standard HTTP calls over the Web, scheduled at regular intervals to receive all the latest posts since the previous call. The (RESTful) API for calling the Metabase is simple and provides all the data in a single feed - so no messing about with multiple feed calls and complex interactions.

Because the data is delivered in Web-standard XML, it can be readily indexed by all search and database technologies, from where it can be deployed into practically any application environment.

For: media monitoring companies, portals, publishers, search engines
Related products: News Metabase