Latest Updates: Our Blog

Advanced Boolean Searches

Posted
Sep 27th, 2011

Tags
Workspace

Author
Jeremy Ashkenas

DocumentCloud now supports advanced boolean search queries, allowing you to more easily perform searches that hone right in on the documents you’re trying to find. You may be familiar with boolean operators from other search engines, but here’s a quick refresher on the available options:

  • and: both terms must exist in the document   Perry and Romney
  • or: either term may match   indicted or accused
  • !: the term must not exist in the document   obama !barack
  • *: a wildcard to match any sequence of letters   J*e Smith (Matches Joe, Jane or Jake Smith)
  • ( ): group together words into a term   (Perry or Romney) and governor

 

Here’s an example of what that last search looks like in action:

Behind the scenes, we’re using the latest stable release of the open-source Solr/Lucene search engine (3.4.0). It includes a new query parser called “edismax” that adds boolean operators to the previous implementation of full text search.

Give boolean searches a spin, and let us know if they’re working well for your ongoing projects.

We’re at #eij11 — come by Kni…

Posted
Sep 27th, 2011

Tags
Twitter

Author
documentcloud

We’re at #eij11 — come by Knight’s Innovation panel at 2:30 to get a great intro to @documencloud. /abh

Neat CPI use of DocumentCloud …

Posted
Sep 27th, 2011

Tags
Twitter

Author
documentcloud

Neat CPI use of DocumentCloud note embeds to compare signatures: http://t.co/yVoE4Bdq /ja

We released a feature this pas…

Posted
Sep 26th, 2011

Tags
Twitter

Author
documentcloud

We released a feature this past week: Printing Document Annotations! Learn more about it on our blog: http://t.co/2wdfGRj3 /th

Printing Document Annotations

Posted
Sep 26th, 2011

Tags
Documents

Author
Ted Han

We’ve been hard at work during our short Columbia, Missouri hackathon at DocumentCloud’s new home at the Investigative Reporters & Editors office. As a result we’ve rolled out a new feature for readers and journalists to print annotations made on documents.

Journalists have been publishing documents through DocumentCloud for a while now as well as annotating documents both for readers and for their own story writing processes. We think it’s just as important for DocumentCloud to make story writing quicker and easier as it is to help readers find primary source material.

So, when Marshall Allen of ProPublica told us that he would like to try using DocumentCloud to take his story notes, we did our best to help out. As a result, you can now select one or more documents in the workspace and choose “Print Notes” under the “Publish” menu.

This way you can annotate your sources in DocumentCloud, and have a single copy of all your research ready at hand for your copy editor or read when your flight attendant announces that all power switches should be in the off position.

And readers can find a “Print Notes” link in the sidebar footer of the document viewer too.

We hope this will help readers and journalists alike note and collect information in the format the best suits their workflows. Happy Printing (and remember to recycle)!

Busy week! Our upload queue is…

Posted
Sep 23rd, 2011

Tags
Twitter

Author
documentcloud

Busy week! Our upload queue is a bit backed up, so we’re spinning up several more worker machines to cope. Thanks for bearing with us /th

If you’re in Columbia, MO, @ja…

Posted
Sep 21st, 2011

Tags
Twitter

Author
documentcloud

If you’re in Columbia, MO, @jashkenas and @knowtheory will be speaking at Broadway Brewery tonight, at 7 p.m. http://ow.ly/6AFfa

Welcome Aboard, Ted Han

Posted
Sep 21st, 2011

Tags
IdeaLab,People

Author
Amanda Hickman

Back in August, we announced that we’d be welcoming a new lead developer, but he’s been on the job two weeks already and we managed to forget to say anything like “Welcome aboard!”

Well, better late than never.

If you want to re-OCR the text…

Posted
Sep 13th, 2011

Tags
Twitter

Author
documentcloud

If you want to re-OCR the text of an existing document: open it, and click “Reprocess Text” under the “Text Tools” section. /ja

Just pushed out an upgrade to …

Posted
Sep 13th, 2011

Tags
Twitter

Author
documentcloud

Just pushed out an upgrade to our OCR (via Docsplit). DocumentCloud should be able to read fine print much better now. /ja

DocumentCloud: Over 10 million…

Posted
Aug 16th, 2011

Tags
Twitter

Author
documentcloud

DocumentCloud: Over 10 million primary source documents served. /ja

Thanks to @A_L, you can now us…

Posted
Aug 15th, 2011

Tags
Twitter

Author
documentcloud

Thanks to @A_L, you can now use the Document Viewer JavaScript API to navigate pages, and listen for page changes: http://t.co/2k4FM7Q /ja

New DocumentCloud API feature:…

Posted
Aug 11th, 2011

Tags
Twitter

Author
documentcloud

New DocumentCloud API feature: You can upload a doc hosted on a public server by passing the URL. Good for gov docs http://t.co/M9y8Er8 /ja

New version of VisualSearch.js…

Posted
Aug 11th, 2011

Tags
Twitter

Author
documentcloud

New version of VisualSearch.js is out: http://t.co/kfoKpZg. Version 0.2.0 adds support for multiple search boxes. /sc

Looking for some great example…

Posted
Aug 9th, 2011

Tags
Twitter

Author
documentcloud

Looking for some great examples of @documentcloud in local or national environmental reporting. Your favorites, please! /abh

What can you do with our API? …

Posted
Aug 2nd, 2011

Tags
Twitter

Author
documentcloud

What can you do with our API? Tons. For example: http://blog.documentcloud.org/blog/2011/08/getting-the-most-out-of-documentcloud/ /abh

Getting the Most out of DocumentCloud

Posted
Aug 2nd, 2011

Tags
Workspace ,

Author
Amanda Hickman

Updated! How I left MuckRock out is beyond me. There may be more updates as appropriate.

If you’re new to programming, looking at what others have done is probably the best way to get your bearings. DocumentCloud is no exception. You asked for more, better API examples. We’re long overdue for a roundup of some of the great tools DocumentCloud users have built on our API or otherwise poked their heads under the hood. Continue reading »

Sure thing @dancow. We’ll get …

Posted
Jul 26th, 2011

Tags
Twitter

Author
documentcloud

Sure thing @dancow. We’ll get right on that. /abh

Great set of supporting docume…

Posted
Jul 26th, 2011

Tags
Twitter

Author
documentcloud

Great set of supporting documents: http://t.co/nfOi6dQ … for @latimes series on the Sinaloa cartel: http://t.co/BECSXEr /ja

DocumentCloud now has “freelan…

Posted
Jul 20th, 2011

Tags
Twitter

Author
documentcloud

DocumentCloud now has “freelancer” accounts, for times when you only want to share access to a single project: http://t.co/c62iH5n /ja

New Feature: Accounts for Freelance Contributors

Posted
Jul 20th, 2011

Tags
Accounts,Workspace

Author
Jeremy Ashkenas

On large document-driven projects, newsrooms often bring together teams of collaborators that include independent researchers who aren’t formally part of the newsroom. Newsrooms that want a research team to evaluate thousands of documents — more than our collaboration tools are designed to accommodate — can take advantage of our new access level: the freelancer. A “freelancer” can upload, annotate, and edit documents like any other user, but they can only access documents you’ve explicitly shared with them.

To add a user (or ten) who is going to be contributing reporting but shouldn’t have access to the rest of your newsroom’s documents, you can create an account for a freelancer.

Freelancer accounts are good for anyone that you regularly work with, but who doesn’t actually work for your organization, or for folks you’re bringing together on a single reporting project.

For more information, check out our accounts documentation.

Tip: you can link to any page …

Posted
Jul 19th, 2011

Tags
Twitter

Author
documentcloud

Tip: you can link to any page by appending #p7 to a document URL. eg. http://bit.ly/qvd0W8 /abh

The Telegraph uses DocumentClo…

Posted
Jul 13th, 2011

Tags
Twitter

Author
documentcloud

The Telegraph uses DocumentCloud to annotate News of the World articles that may have involved phone hacking. http://t.co/m2VEflY /ja

It’s been a big week for prima…

Posted
Jul 9th, 2011

Tags
Twitter

Author
documentcloud

It’s been a big week for primary sources on the web — we’ve passed 1 million documents viewed in the past 7 days. /ja

Great new example of DocumentC…

Posted
Jul 6th, 2011

Tags
Twitter

Author
documentcloud

Great new example of DocumentCloud integration from @NPRStateImpact. New York Hydrofracking Memo: http://t.co/1CS92aZ