<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Posse List &#187; Maura Grossman</title>
	<atom:link href="http://www.theposselist.com/tag/maura-grossman/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.theposselist.com</link>
	<description>Your source for news, commentary and trends in the contract legal market</description>
	<lastBuildDate>Sat, 21 Jan 2012 17:51:18 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>March 3rd in Richmond, VA:  Symposium &#8211; &#8220;Electronic Discovery in a World of Cloud Computing, Data Hoarding, and Social Networking&#8221;</title>
		<link>http://www.theposselist.com/2011/03/01/march-3rd-in-richmond-va-symposium-electronic-discovery-in-a-world-of-cloud-computing-data-hoarding-and-social-networking/</link>
		<comments>http://www.theposselist.com/2011/03/01/march-3rd-in-richmond-va-symposium-electronic-discovery-in-a-world-of-cloud-computing-data-hoarding-and-social-networking/#comments</comments>
		<pubDate>Tue, 01 Mar 2011 15:27:42 +0000</pubDate>
		<dc:creator>mrposse</dc:creator>
				<category><![CDATA[Webinars, Seminars, Surveys]]></category>
		<category><![CDATA[Anthony J. Diana]]></category>
		<category><![CDATA[Bennett Borden]]></category>
		<category><![CDATA[Chief Magistrate Judge Paul Grimm]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[counsel at Wachtell]]></category>
		<category><![CDATA[Data Hoarding]]></category>
		<category><![CDATA[e-discovery]]></category>
		<category><![CDATA[Electronic Discovery]]></category>
		<category><![CDATA[Jason R. Baron]]></category>
		<category><![CDATA[JOL]]></category>
		<category><![CDATA[Journal of Law & Technology]]></category>
		<category><![CDATA[Leslie Haley]]></category>
		<category><![CDATA[Lipton]]></category>
		<category><![CDATA[Maura Grossman]]></category>
		<category><![CDATA[Rosen & Katz]]></category>
		<category><![CDATA[Social Networking]]></category>
		<category><![CDATA[University of Richmond]]></category>

		<guid isPermaLink="false">http://www.theposselist.com/?p=7024</guid>
		<description><![CDATA[On Thursday, March 3, from 1-5pm, the University of Richmond Journal of Law &#38; Technology (JOLT) will host a symposium, “Electronic Discovery in a World of Cloud Computing, Data Hoarding, and Social Networking,” in the law school’s moot court room. Approved for 3.5 CLE credits, the symposium is free and open to the public. Held [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.theposselist.com/wp-content/uploads/2011/03/JOLT.jpg"><img class="alignnone size-medium wp-image-7025" title="JOLT" src="http://www.theposselist.com/wp-content/uploads/2011/03/JOLT-300x61.jpg" alt="" width="300" height="61" /></a></p>
<p>On Thursday, March 3, from 1-5pm, the University of Richmond Journal of Law &amp; Technology (JOLT) will host a symposium, “Electronic Discovery in a World of Cloud Computing, Data Hoarding, and Social Networking,” in the law school’s moot court room. Approved for 3.5 CLE credits, the symposium is free and open to the public. Held in conjunction with the publication of JOLT’s Annual Survey, the symposium will feature the survey’s contributors speaking on the latest legal issues in technology and of the application of e-discovery law.</p>
<p style="text-align: justify;">Presenters include keynote speaker<strong> </strong>Chief Magistrate Judge Paul Grimm for the U.S. District Court of Maryland, who has published some of the most important opinions on e-discovery during the past five years. Grimm will speak about the implications of Federal Rule of Evidence 502 on e-discovery.</p>
<p>Other speakers include:</p>
<ul>
<li>Jason R. Baron, director of litigation for the National Archives, presenting his article exploring trends in e-discovery search.</li>
<li>Bennett Borden, co-chair of the Williams Mullen e-discovery section, presenting his article summarizing 2010 developments in e-discovery law.</li>
<li>Anthony J. Diana, partner at Mayer-Brown, presenting an article he co-authored on e-discovery in social media.</li>
<li>Maura Grossman, counsel at Wachtell, Lipton, Rosen &amp; Katz, presenting an article she co-authored on the proficiency of technology-assisted e-discovery review.  </li>
<li>Leslie Haley, senior assistant ethics counsel, Virginia State Bar, presenting on the ethical hazards posed by the digital age.</li>
</ul>
<p>For full details <a href="http://news.richmond.edu/features/article/law/5398/-richmonds-journal-of-law-and-technology-sponsors-symposium-on-e-discovery.html" target="_blank"><span style="color: #000080;"><strong><em>click here</em></strong></span></a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theposselist.com/2011/03/01/march-3rd-in-richmond-va-symposium-electronic-discovery-in-a-world-of-cloud-computing-data-hoarding-and-social-networking/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>March 11th &#8212; free webinar: &#8220;Why sound project management is critical to the success of eDiscovery&#8221; (with Maura R. Grossman)</title>
		<link>http://www.theposselist.com/2010/03/10/march-11th-free-webinar-why-sound-project-management-is-critical-to-the-success-of-ediscovery/</link>
		<comments>http://www.theposselist.com/2010/03/10/march-11th-free-webinar-why-sound-project-management-is-critical-to-the-success-of-ediscovery/#comments</comments>
		<pubDate>Thu, 11 Mar 2010 00:35:01 +0000</pubDate>
		<dc:creator>mrposse</dc:creator>
				<category><![CDATA[Webinars, Seminars, Surveys]]></category>
		<category><![CDATA[Georgetown]]></category>
		<category><![CDATA[Georgetown Law Advanced E-Discovery Institute]]></category>
		<category><![CDATA[Georgetown Law E-Discovery Training Academy]]></category>
		<category><![CDATA[LDM Global]]></category>
		<category><![CDATA[LDMglobal]]></category>
		<category><![CDATA[Maura Grossman]]></category>
		<category><![CDATA[Maura R. Grossman]]></category>
		<category><![CDATA[project management]]></category>

		<guid isPermaLink="false">http://www.theposselist.com/?p=5926</guid>
		<description><![CDATA[      Sound project management is vital to any eDiscovery matter.  Without it, data can be overlooked, budgets overrun, deadlines missed, and defensibility compromised.   To ensure that your eDiscovery matter is managed properly, you need more than just a good project manager; you need a well thought-out process that incorporates quality assessment and control. To find [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.theposselist.com/wp-content/uploads/2010/03/LDMGlobal-logo.png"><img class="size-full wp-image-5927 alignleft" title="LDMGlobal logo" src="http://www.theposselist.com/wp-content/uploads/2010/03/LDMGlobal-logo.png" alt="" width="225" height="54" /></a> </p>
<p>    Sound project management is vital to any eDiscovery matter.  Without it, data can be overlooked, budgets overrun, deadlines missed, and defensibility compromised.   To ensure that your eDiscovery matter is managed properly, you need more than just a good project manager; you need a well thought-out process that incorporates quality assessment and control.</p>
<p>To find out how ways you can streamline the eDiscovery process, avoid missed deadlines,  and minimize the potential for costly errors and challenges by opposing counsel, please join LDMglobal for a webinar on &#8220;The Keys to eDiscovery Project Management&#8221;.</p>
<p>The webinar will be held Thursday March 11th at 11:00 AM EST, 4:00 PM GMT, and will feature Maura R. Grossman, Counsel at Wachtell, Lipton, Rosen &amp; Katz. </p>
<p>LDM Global is a company we know well.  It provides services to some of the world’s largest companies.   They have an enormous footprint in the pharmaceutical industry vis-a-vis e-discovery work.   We have been working with them in both the U.S. and in Europe in their search for project managers.   They &#8220;get it&#8221; as far as the fact that different countries and courts require different formats and strategies for crafting responses.   We met their team at LegalTech New York this year, and we are meeting them again in a few weeks in London.   They have great resources in all aspects of discovery, from regulatory compliance to complex litigation support.</p>
<p>Maura Grossman is a member of The Sedona Conference® Working Groups on Best Practices for Electronic Document Retention and Production, and on International Electronic Information Management, Discovery, and Disclosure.  She assisted in drafting and editing The Sedona Conference® Commentary on Achieving Quality in the E-Discovery Process (May 2009).</p>
<p>This past fall The Posse List saw Maura give a brilliant presentation on the challenges of search at the Georgetown Law Advanced E-Discovery Institute (<a href="http://www.theposselist.com/2009/11/15/from-the-georgetown-law-advanced-e-discovery-institute-advanced-search-and-retrieval-technology/" target="_blank"><span style="color: #000080;"><strong><em>click here</em></strong></span></a>).  She is on the faculty of the Georgetown Law E-Discovery Training Academy where this year she gave a presentation on the mapping of the steps of the e-discovery process, an exploration of the full spectrum of procedures and strategies that might occur during the e-discovery process.</p>
<p>Maura is a powerhouse in the e-discovery world and this event will be well-worth attending..</p>
<p>You can register for this webinar <a href="https://www2.gotomeeting.com/register/645933995" target="_blank"><span style="color: #000080;"><em><strong>by clicking here</strong></em></span></a>.   If you require any further details about this webinar, please contact Rebecca Dealtry, on <a href="mailto:rdealtry@ldmglobal.com"><span style="color: #000080;"><strong>rdealtry@ldmglobal.com</strong></span></a>  or +44 (0)20 7613 1160.   </p>
]]></content:encoded>
			<wfw:commentRss>http://www.theposselist.com/2010/03/10/march-11th-free-webinar-why-sound-project-management-is-critical-to-the-success-of-ediscovery/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>From the Georgetown Law Advanced E-Discovery Institute: Advanced Search and Retrieval Technology</title>
		<link>http://www.theposselist.com/2009/11/15/from-the-georgetown-law-advanced-e-discovery-institute-advanced-search-and-retrieval-technology/</link>
		<comments>http://www.theposselist.com/2009/11/15/from-the-georgetown-law-advanced-e-discovery-institute-advanced-search-and-retrieval-technology/#comments</comments>
		<pubDate>Sun, 15 Nov 2009 23:18:01 +0000</pubDate>
		<dc:creator>mrposse</dc:creator>
				<category><![CDATA[Georgetown Law Center: Advanced E-Discovery Institute]]></category>
		<category><![CDATA[Early Case Assessment]]></category>
		<category><![CDATA[ESI]]></category>
		<category><![CDATA[Georgetown Law Advanced E-Discovery Institute]]></category>
		<category><![CDATA[Jason R. Baron]]></category>
		<category><![CDATA[Maura Grossman]]></category>
		<category><![CDATA[Ralph Losey]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Text REtrieval Conference]]></category>
		<category><![CDATA[TREC]]></category>

		<guid isPermaLink="false">http://www.theposselist.com/?p=5368</guid>
		<description><![CDATA[15 November 2009 The presentation on Advanced Search and Retrieval Technology was made by Jason R. Baron, Maura Grossman and Ralph Losey, all powerhouses in the e-discovery world. Baron and Losey started off with their multimedia PowerPoint presentation (to the tune of Darude’s Sandstorm which we had just seen at the Capital One Future of [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone size-full wp-image-5316" title="Georgetown Law CLE 2" src="http://www.theposselist.com/wp-content/uploads/2009/11/Georgetown-Law-CLE-21.gif" alt="Georgetown Law CLE 2" width="180" height="70" /></p>
<p><em>15 November 2009</em></p>
<p>The presentation on <em>Advanced Search and Retrieval Technology</em> was made by <a href="http://www.eddupdate.com/2009/04/jason-baron-the-king-of-search.html" target="_blank"><span style="color: #000080;"><strong>Jason R. Baron</strong></span></a>, <a href="http://www.wlrk.com/Page.cfm/Thread/Attorneys/SubThread/Search/Name/Grossman,%20Maura%20R" target="_blank"><span style="color: #000080;"><strong>Maura Grossman</strong></span></a> and <a href="http://www.akerman.com/public/attorneys/aBiography.asp?id=718" target="_blank"><span style="color: #000080;"><strong>Ralph Losey</strong></span></a>, all powerhouses in the e-discovery world.</p>
<p>Baron and Losey started off with their multimedia PowerPoint presentation (to the tune of Darude’s <em>Sandstorm</em> which we had just seen at the <a href="http://www.theposselist.com/2009/11/10/capital-ones-first-annual-seminar-on-e-discovery-the-future-of-search" target="_blank"><span style="color: #000080;"><strong>Capital One <em>Future of Search</em> conference</strong></span></a>  and it blew away the crowd – and us, too, again.  In a nutshell, ediscovery is expanding exponentially and Ralph Losey talked petabytes, and exabytes &#8212; not terabytes. This was the “beta version” of a presentation that Losey and Baron will give at LegalTech in New York City this coming February.  </p>
<p>As an introduction (not necessary for this audience but a great set-up nonetheless) Jason said there are technologies available to help the litigator reduce the costs of reviewing and producing ESI while at the same time accomplish the objective of responding to a request for production.  Most commonly used by litigators today are review tools that enable reviewers to review the ESI in an online repository.  Vendors that provide these review tools also typically offer filtering and processing services, where they take ESI that has been collected, and, behind the scenes, apply filters to the ESI to narrow the volume to the ESI that is likely to be relevant to the request for production.</p>
<p>A popular filter is the application of keywords, developed by the litigator, to the collected ESI. After applying the keywords, the vendor provides a “frequency report” or “hit list” of the number or percentage of documents that hit on a particular keyword so that the litigator can evaluate the efficacy of the selected keywords.  </p>
<p>There may be various iterations of this process until the litigator approves the results in the frequency report.  The vendor then processes the filtered ESI and uploads it to a web-based review tool for the review to begin.</p>
<p>There is also new automated technology called “early case assessment” technology that has entered the marketplace, and which review tool vendors are rushing to add to their current products. This technology allows for a thorough front-end look at the volume of ESI collected in response to the request for production, instead of just the ESI that is filtered, processed and uploaded to the review tool. Thus, by using this new technology, the litigator can find the “significant documents” very early on in the case instead of waiting until the end of the review process after the reviewers have reviewed and “tagged” the significant documents.</p>
<p>Moreover, this technology enables the administrator and/or the litigator to perform keyword searching and other filtering on their own without incurring any additional charges and without having to rely on the vendor for these services. This technology also provides automated analytics so that the litigator can obtain a high level understanding of the ESI, which can identify key players, lines of communications between custodians and types of significant documents. This knowledge will help shape the review and the litigator’s investigation of the facts of the case.</p>
<p>Maura Grossman then followed with what we thought was a brilliant presentation on the challenges of search.  Our review cannot do it justice (we have links below to background material provided by Maura and Jason) so just some high points from her presentation:</p>
<p>1.  There is no way to review everything manually, in large matters, in the time frames dictated by the typical litigation or investigation.</p>
<p>2.  Manual review does not scale well, and how the cost of responsiveness and privilege review can quickly dwarf the costs of all of the other stages of the e-discovery process.</p>
<p>3.  Lawyers are not nearly as talented at search as they think they are.  The Blair and Maron study (in 1985) was the first study to demonstrate the significant gap or disconnect between lawyers’ perceptions of their ability to ferret out relevant documents, and their actual ability to do so.   In a 40,000 document case &#8212; consisting of 350,000 pages &#8212; the lawyers estimated that their searches had identified 75% of the relevant documents,  when, in fact, they had only identified about 20% of them.</p>
<p>4.  The use of keywords, alone, is unlikely to reliably produce all relevant documents from a large, heterogeneous document collection, for a whole host of reasons, including:</p>
<p>     a.  That information retrieval is already a very difficult problem when it involves plain vanilla, English-language, text documents. That problem is magnified when you address a multi-lingual set of documents, with nontextual forms of ESI, such as photographs or audio and video files, which are typically not searchable.</p>
<p>      b.  The inherent ambiguity of language, in particular:</p>
<p>            <em>Synonymy</em> = there can be considerable variation in describing the same person or thing, i.e., diplomat, ambassador, consul, official, etc.</p>
<p>           <em>Polysemy</em> = the same term can have multiple meanings, i.e., Bush (referring to two presidents; a shrub; a place in Africa; a thick furry tail; “bush league,” among other slang usages). Strike (referring to a labor activity; the act of hitting; the baseball kind; finding oil or gold and “striking it rich;” and so on).</p>
<p>       c.  The ubiquity of human error, i.e., misspellings and typos (there were 250 different spellings for the word “tobacco” in the MSA database; “management” will miss managment” and “mangement”).</p>
<p>       d.  Abbreviations (i.e., “P&amp;C/ACC”); colloquialisms (i.e., Haynes &amp; Boone / H&amp;B / HayBoo); slang; code words; and new short-forms used in text messaging and IM (i.e., “FWIW”, “LMAO”).</p>
<p>      e.   The problem is compounded by optical character recognition (“OCR”).</p>
<p>      f.  Poor records management, including lack of organization and/or proper labeling, the reflexive use of “Reply” even when the subject matter of an email has changed, and so on.</p>
<p>      g.  Deadlines and resource constraints that place practical limits on what can be achieved.</p>
<p>       h.  And finally, there is a widespread failure to employ “best practices” in the area of search and retrieval. Lawyers believe that because they know how to use Westlaw, Lexis, and Google, they know how to do search, but finding a few good examples of something is a very different task than finding as close to all of that thing as possible, without also including a lot of junk.</p>
<p>So, what are the “best practices” for keyword searching?</p>
<p>1.  You start with the complaint, the subpoena, or the request for production. First  you determine: who are the relevant custodians?  what is the applicable time frame?  what terms-of-art are employed?  </p>
<p>2.  Then, you translate what the request is seeking into plain, everyday English to get as close as possible to the terms that people are most likely to use in their daily communications.</p>
<p>3. Try to have a couple of different people do this to ensure that you are getting the benefit of multiple interpretations of the requests and potential keywords from different vantage points.</p>
<p>4.  This is the basic starting point for your search-term list.</p>
<p>5.  Next—and this is the step that is most often overlooked by lawyers—you must seek input from the people who actually created, sent, or received the documents.  These are your best subject-matter experts.</p>
<p>6.  Ask them questions like:  “Who would be most likely to have created, sent, or received emails or documents on these subjects?”  “What distribution lists would have been used?”  “What time frame would these emails or documents cover?”   “What events would these emails or documents discuss?”   “What names, words, or terms would be likely to appear in these emails or documents?”  “What abbreviations, acronyms, slang, or code words might have been used?”   “If you were looking for emails or documents responsive to these requests,  how would you go about finding them?”  “What kinds of attachments would these emails have?”</p>
<p>7. If warranted by the stakes of your matter, consider whether an hour or two of a linguist’s or substantive expert’s time would help you to significantly improve the quality of your search term list.</p>
<p>8. Next, look at a bunch of documents that you already know to be responsive (for example, some that you obtain from a key custodian).  Ask yourself, what unique words or phrases distinguish these documents? In what context do the documents appear? (If you are using a search tool that employs machine learning, these documents can be the start of your “seed” or training set.)</p>
<p>9. If possible, have your vendor index the documents in the set and provide you with a list of the words that appear in the documents, ranked from most to least frequently appearing. Use that list to identify documents that are likely to be unresponsive (“birthday,” “baby shower”) or privileged, and to identify search terms you may have missed.</p>
<p> </p>
<p>Ok, there was a lot more.  To help, here is a link to Jason and Maura’s slides (<a href="http://theposselist.com/pipermail/test_theposselist.com/attachments/20091119/2dffb41e/attachment-0001.pdf" target="_blank"><span style="color: #000080;"><strong><em>click here</em></strong></span></a>).</p>
<p>Some  suggested references:</p>
<p>* Craig Ball has a paper on his website summarizing search steps.   It is entitled “Surefire Steps to Splendid Search” (June/July 2009) (<a href="http://www.craigball.com/Surefire_Steps_to_Splendid_Search_June%202009.pdf" target="_blank"><span style="color: #000080;"><strong><em>Click here</em></strong></span></a>).</p>
<p>* The Sedona Conference® Best Practices Commentary on the Use of Search and Information Retrieval Methods in E-Discovery (Aug. 2007 Public Comment Version) (<a href="http://www.thesedonaconference.org/dltForm?did=Best_Practices_Retrieval_Methods___revised_cover_and_preface.pdf" target="_blank"><span style="color: #000080;"><strong><em>click here</em></strong></span></a>)</p>
<p>*  The Sedona Conference® Commentary on Achieving Quality in the E-Discovery Process (May 2009 Public Comment Version) (<a href="http://www.thesedonaconference.org/dltForm?did=Achieving_Quality.pdf" target="_blank"><span style="color: #000080;"><em><strong>click here</strong></em></span></a>)</p>
<p>* The National Institute of Standards and Technology (NIST) Text REtrieval Conference (TREC) 2009 Legal Track (<a href="http://trec-legal.umiacs.emd.edu/" target="_blank"><span style="color: #000080;"><em><strong>click here</strong></em></span></a>)  </p>
<p> </p>
<p><em><strong>Take-Away Messages from the panel</strong></em></p>
<p>1.  Success in search requires a well thought-out process with substantial input at the front-end and some degree of testing, sampling, feedback and/or iteration.</p>
<p>2.  The amount of testing, sampling, feedback and/or iteration should reflect the same proportionality considerations inherent in all discovery, i.e., the amount in controversy, the time and resources available, the importance of the evidence to the determination of the dispute, etc.</p>
<p>3.  Different search approaches are best for different tasks. For example, some things are simply easier to search for than others, i.e., patent or pharmaceutical litigation versus evidence regarding off-shore accounts or document destruction/shredding.  Do you need a few good examples, or are you trying to find “all”?</p>
<p>4.  There is no guarantee that any search method will identify all responsive documents in a large, homogeneous data set, and different search methods can produce different result sets. Hybrid or fusion approaches tend to be more successful, but are also more costly and time-consuming.</p>
<p>5.  Automated technology can help, but its not the “end-all-be-all.” Due diligence is absolutely necessary in this current “Wild West” marketplace.</p>
<p>6.  At least some degree of transparency and collaboration is necessary. Obviously, an agreed-upon search methodology (or search-term list) is preferable to a unilateral approach that is subject to second-guessing and “do-overs.”  Parties must be able to explain what they have done and why it is reasonable under the circumstances. </p>
<p>7.  It is important for practitioners to keep up with the case law, research, and literature in this area because it is quickly evolving. There are consultants (including linguists and statisticians) who have expertise in this area and can help devise or mediate a reasonable search protocol if the parties cannot agree on one.</p>
<p><strong><em>A  (very) brief note on Text REtrieval Conference (TREC)</em></strong></p>
<p>TREC was mentioned several times at the panel (and all during the conference) especially the opportunity of  participating in the 2010 TREC Legal Track.  We will have a detailed post on TREC before the year out but just a short “bio” on TREC from Ellen M. Voorhees of the National Institute of Standards and Technology (NIST) who was scheduled to appear but could not:</p>
<p>Evaluation is a fundamental component of the scientific method: researchers form a hypothesis, construct an experiment that tests the hypothesis, and then assess the extent to which the experimental results support the hypothesis.  A very common type of experiment is a comparative experiment in which the hypothesis asserts that Method 1 is a more effective solution than Method 2, and the experiment compares the performance of the two methods on a common set of problems.</p>
<p>The set of sample problems together with the evaluation measures used to assess the quality of the methods’ output form a benchmark task.  Information retrieval researchers have used test collections, a form of benchmark task, ever since Cyril Cleverdon and his colleagues created the first test collection for the Cranfield tests in the 1960’s. Many experiments followed in the subsequent two decades and several other test collections were built.</p>
<p>Yet by 1990 there was growing dissatisfaction with the methodology. While some research groups did use the same test collections, there was no concerted effort to work with the same data, to use the same evaluation measures, or to compare results across systems to consolidate findings. The available test collections were so small—the largest of the generally available collections contained about 12,000 documents and fewer than 100 queries—that operators of commercial retrieval systems were unconvinced that the techniques developed using test collections would scale to their much larger document sets. Even some experimenters were questioning whether test collections had out-lived their usefulness.</p>
<p>At this time, NIST was asked to build a large test collection for use in evaluating test retrieval technology developed as part of the Defense Advanced Research Projects Agency’s TIPSTER project. NIST proposed that instead of simply building a single large test collection, it organize a workshop that would both build a collection and investigate the larger issues surrounding test collection use. This was the genesis of the Text REtrieval Conference (TREC). The first TREC workshop was held in November 1992, and there has been a workshop held annually since then.</p>
<p>We will have a detailed post on TREC before the year out.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theposselist.com/2009/11/15/from-the-georgetown-law-advanced-e-discovery-institute-advanced-search-and-retrieval-technology/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

