RSS feed
<< February 20, 2008 | Home | February 22, 2008 >>

Welcome to CiteSeerX

CiteSeerX has just launched!  This year marks the 10th anniversary of CiteSeer's service to the research community and we are proud to celebrate by revitalizing the project with a brand new system architecture.  Our new system is a ground-up rewrite of the legacy CiteSeer application, designed to address scalability, extensibility, and usability issues that were limiting our ability to keep up with new content and technologies.  We think that CiteSeerX represents a major enhancement over the old system in terms of our stated goals and we hope you agree.

At the time of this writing, we have extended the size of our collection to approximately 810K documents and over 14 million citations.  This represents over 200K more unique documents than can be found in the legacy system, since we filtered out about 150K duplicate records during the conversion process.  Our crawlers have been working hard to scour the web for new content, and we currently have half a million more PDF and PostScript files queued for ingestion.  Not all will pass our filtration system, but stay tuned for large content updates!

CiteSeerX is still in alpha phase until we complete our initial high-volume crawl cycle.  This means that not all features will be enabled for the next couple of weeks.  In particular, user-submitted corrections and content submissions are currently disabled, as well as our new content notification system.  We think many users will enjoy these new and greatly enhanced features and we are working to enable them as quickly as we can.  In the meantime, we hope you explore the several new features that are currently operational, particularly within the MyCiteSeer personal content portal.

Stay tuned for further updates on this page as we roll out new features or enhance old ones.  Please let us know what you think through our feedback channel. We hope you enjoy CiteSeerX!