The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"  
  "http://www.w3.org/TR/html4/loose.dtd">  
<html > 
<head><title>References</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> 
<meta name="generator" content="TeX4ht (http://www.cse.ohio-state.edu/~gurari/TeX4ht/)"> 
<meta name="originator" content="TeX4ht (http://www.cse.ohio-state.edu/~gurari/TeX4ht/)"> 
<!-- html,2 --> 
<meta name="src" content="DocMain.tex"> 
<meta name="date" content="2009-06-16 09:20:00"> 
<link rel="stylesheet" type="text/css" href="DocMain.css"> 
</head><body 
>
   <!--l. 1--><div class="crosslinks"><p class="noindent">[<a 
href="DocMainse7.html" >prev</a>] [<a 
href="DocMainse7.html#tailDocMainse7.html" >prev-tail</a>] [<a 
href="#tailDocMainli2.html">tail</a>] [<a 
href="DocMainpa1.html# " >up</a>] </p></div>
   <h3 class="likesectionHead"><a 
 id="x39-670007.5.1"></a>References</h3>
<!--l. 1--><p class="noindent" >
    <div class="thebibliography">
    <p class="bibitem" ><span class="biblabel">
  [1]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XInternational_85"></a>Documentation - Methods for examining documents, determining their subjects,
    and selecting index terms. International Organization for Standardization, Standard
    5963-1985.
    </p>
    <p class="bibitem" ><span class="biblabel">
  [2]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XHjorland06"></a>Lifeboat        for        knowledge        organization:        indexing        theory.
    http://www.db.dk/bh/Lifeboat_KO/CONCEPTS/indexing_theory.htm.
    </p>
    <p class="bibitem" ><span class="biblabel">
  [3]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="Xardo99:_online99"></a>A.&#x00A0;Ardö and T.&#x00A0;Koch.  Automatic classification applied to the full-text Internet
    documents in a robot-generated subject index. In <span 
class="ecti-1095">Online Information 99, Proceedings</span>,
    pages 239&#8211;246, Dec. 1999. http://www.it.lth.se/anders/online99/.
    </p>
    <p class="bibitem" ><span class="biblabel">
  [4]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XBang_etal_06"></a>S.&#x00A0;L. Bang, J.&#x00A0;D. Yang, and H.&#x00A0;J. Yang.  Hierarchical document categorization
    with  k-nn  and  concept-based  thesauri.   <span 
class="ecti-1095">Information Processing and Management</span>,
    (42):387&#8211;406, 2006.
    </p>
    <p class="bibitem" ><span class="biblabel">
  [5]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XChen_Dumais_2000"></a>H.&#x00A0;Chen and S.&#x00A0;T. Dumais. Bringing order to the web: automatically categorizing
    search results. In <span 
class="ecti-1095">Proc. of CHI-00, ACM International Conference on Human Factors</span>
    <span 
class="ecti-1095">in Computing Systems</span>, pages 145&#8211;152, 2000.
    </p>
    <p class="bibitem" ><span class="biblabel">
  [6]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XGarces_etal_06"></a>P.&#x00A0;J.  Garcés,  J.&#x00A0;A.  Olivas,  and  F.&#x00A0;P.  Romero.   Concept-matching  ir  systems
    versus word-matching information retrieval systems: Considering fuzzy interrelations
    for indexing web pages. <span 
class="ecti-1095">JASIS&amp;T</span>, 57(4):564&#8211;576, 2006.
    </p>
    <p class="bibitem" ><span class="biblabel">
  [7]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XGolub_Review"></a>K.&#x00A0;Golub.  Automated subject classification of textual Web documents.  <span 
class="ecti-1095">Journal</span>
    <span 
class="ecti-1095">of Documentation</span>, 62(3):350&#8211;371, 2006.

    </p>
    <p class="bibitem" ><span class="biblabel">
  [8]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XGolubChall"></a>K.&#x00A0;Golub.   Automated subject classification of textual web pages, based on a
    controlled vocabulary: challenges and recommendations.  <span 
class="ecti-1095">New review of hypermedia</span>
    <span 
class="ecti-1095">and multimedia</span>, 12(1):11&#8211;27, June 2006.   Special issue on knowledge organization
    systems and services.
    </p>
    <p class="bibitem" ><span class="biblabel">
  [9]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XGolubRole"></a>K.&#x00A0;Golub. The role of different thesauri terms in automated subject classification
    of text. In <span 
class="ecti-1095">IEEE/WIC/ACM International Conference on Web Intelligence</span>, Dec. 2006.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [10]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="Xardo05:_ECDL"></a>K.&#x00A0;Golub  and  A.&#x00A0;Ardö.      Importance  of  HTML  Structural  Elements  in
    Automated   Subject   Classification.       In   A.&#x00A0;Rauber,   S.&#x00A0;Christodoulakis,   and
    A.&#x00A0;M.   Tjoa,   editors,   <span 
class="ecti-1095">9th   European   Conference   on   Research   and   Advanced</span>
    <span 
class="ecti-1095">Technology  for  Digital  Libraries  -  ECDL  2005</span>,  volume  3652  of  <span 
class="ecti-1095">Lecture  Notes</span>
    <span 
class="ecti-1095">in  Computer  Science</span>,  pages  368  &#8211;  378.  Springer,  Sept.  2005.    Manuscript  at:
    http://www.it.lth.se/knowlib/publ/ECDL2005.pdf.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [11]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="Xardo06:_ECDL"></a>K.&#x00A0;Golub, A.&#x00A0;Ardö, D.&#x00A0;Mladenic, and M.&#x00A0;Grobelnik. Comparing and Combining
    Two  Approaches  to  Automated  Subject  Classification  of  Text.    In  J.&#x00A0;Gonzalo,
    C.&#x00A0;Thanos, M.&#x00A0;F. Verdejo, and R.&#x00A0;C. Carrasco, editors, <span 
class="ecti-1095">10th European Conference on</span>
    <span 
class="ecti-1095">Research and Advanced Technology for Digital Libraries - ECDL 2006</span>, volume 4172
    of <span 
class="ecti-1095">Lecture Notes in Computer Science</span>, pages 467&#8211;470. Springer, Sept. 2006.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [12]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="X1Ingwersen_05"></a>P.&#x00A0;Ingwersen and K.&#x00A0;Järvelin.  <span 
class="ecti-1095">The turn: integration of information seeking and</span>
    <span 
class="ecti-1095">retrieval in context</span>. Springer, Dordrecht, The Netherlands, 2005.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [13]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XLancaster_03"></a>F.&#x00A0;W. Lancaster. <span 
class="ecti-1095">Indexing and abstracting in theory and practice</span>. Facet, London,
    2003. 3rd ed.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [14]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XLewis_etal_04"></a>D.&#x00A0;D. Lewis, Y.&#x00A0;Yang, T.&#x00A0;Rose, and F.&#x00A0;Li. Rcv1: A new benchmark collection for
    text categorization research. <span 
class="ecti-1095">The Journal of Machine Learning Research</span>, (5):361&#8211;397,
    2004.
    </p>

    <p class="bibitem" ><span class="biblabel">
 [15]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XMedelyan_06"></a>O.&#x00A0;Medelyan and I.&#x00A0;Witten. Thesaurus based automatic keyphrase indexing. In
    <span 
class="ecti-1095">Proceedings of the Sixth ACM/IEEE Joint Conference on Digital Libraries, JCDL 06</span>,
    pages 296&#8211;297, 2006.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [16]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XOlson_01"></a>H.&#x00A0;A. Olson and J.&#x00A0;J. Boll. <span 
class="ecti-1095">Subject analysis in online catalogs</span>. Englewood, CO:
    Libraries Unlimited, 2001. 2nd ed.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [17]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XSebastiani_02"></a>F.&#x00A0;Sebastiani.    Machine  learning  in  automated  text  categorization.    <span 
class="ecti-1095">ACM</span>
    <span 
class="ecti-1095">Computing Surveys</span>, 34(1):1&#8211;47, 2002.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [18]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XSvenonius_2000"></a>E.&#x00A0;Svenonius.   <span 
class="ecti-1095">The intellectual foundations of information organization</span>.   MIT
    Press, Cambridge, MA, USA, 2000.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [19]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XRafael06"></a>R.&#x00A0;R. Trujilo. Simulation tool to study focused web crawling strategies. Master&#8217;s
    thesis, Dept. of Information Technology, Lund University, P.O. Box 118, S-221&#x00A0;00
    Lund, Sweden, Mar. 2006. http://combine.it.lth.se/CrawlSim/CrawlSim.pdf.
    </p>
    <p class="bibitem" ><span class="biblabel">
 [20]<span class="bibsp">&#x00A0;&#x00A0;&#x00A0;</span></span><a 
 id="XYang_99"></a>Y.&#x00A0;Yang.  An evaluation of statistical approaches to text categorization.  <span 
class="ecti-1095">Journal</span>
    <span 
class="ecti-1095">of Information Retrieval</span>, (1):67&#8211;88, 1999.
</p>
    </div>
<!--l. 28--><p class="indent" >


   <!--l. 30--><div class="crosslinks"><p class="noindent">[<a 
href="DocMainse7.html" >prev</a>] [<a 
href="DocMainse7.html#tailDocMainse7.html" >prev-tail</a>] [<a 
href="DocMainli2.html" >front</a>] [<a 
href="DocMainpa1.html# " >up</a>] </p></div>
<!--l. 30--><p class="indent" >   <a 
 id="tailDocMainli2.html"></a>  
</body></html>