<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
<html >
<head><title>References</title>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<meta name="generator" content="TeX4ht (http://www.cse.ohio-state.edu/~gurari/TeX4ht/)">
<meta name="originator" content="TeX4ht (http://www.cse.ohio-state.edu/~gurari/TeX4ht/)">
<!-- html,2 -->
<meta name="src" content="DocMain.tex">
<meta name="date" content="2009-06-16 09:20:00">
<link rel="stylesheet" type="text/css" href="DocMain.css">
</head><body
>
<!--l. 1--><div class="crosslinks"><p class="noindent">[<a
href="DocMainse7.html" >prev</a>] [<a
href="DocMainse7.html#tailDocMainse7.html" >prev-tail</a>] [<a
href="#tailDocMainli2.html">tail</a>] [<a
href="DocMainpa1.html# " >up</a>] </p></div>
<h3 class="likesectionHead"><a
id="x39-670007.5.1"></a>References</h3>
<!--l. 1--><p class="noindent" >
<div class="thebibliography">
<p class="bibitem" ><span class="biblabel">
[1]<span class="bibsp">   </span></span><a
id="XInternational_85"></a>Documentation - Methods for examining documents, determining their subjects,
and selecting index terms. International Organization for Standardization, Standard
5963-1985.
</p>
<p class="bibitem" ><span class="biblabel">
[2]<span class="bibsp">   </span></span><a
id="XHjorland06"></a>Lifeboat for knowledge organization: indexing theory.
http://www.db.dk/bh/Lifeboat_KO/CONCEPTS/indexing_theory.htm.
</p>
<p class="bibitem" ><span class="biblabel">
[3]<span class="bibsp">   </span></span><a
id="Xardo99:_online99"></a>A. Ardö and T. Koch. Automatic classification applied to the full-text Internet
documents in a robot-generated subject index. In <span
class="ecti-1095">Online Information 99, Proceedings</span>,
pages 239–246, Dec. 1999. http://www.it.lth.se/anders/online99/.
</p>
<p class="bibitem" ><span class="biblabel">
[4]<span class="bibsp">   </span></span><a
id="XBang_etal_06"></a>S. L. Bang, J. D. Yang, and H. J. Yang. Hierarchical document categorization
with k-nn and concept-based thesauri. <span
class="ecti-1095">Information Processing and Management</span>,
(42):387–406, 2006.
</p>
<p class="bibitem" ><span class="biblabel">
[5]<span class="bibsp">   </span></span><a
id="XChen_Dumais_2000"></a>H. Chen and S. T. Dumais. Bringing order to the web: automatically categorizing
search results. In <span
class="ecti-1095">Proc. of CHI-00, ACM International Conference on Human Factors</span>
<span
class="ecti-1095">in Computing Systems</span>, pages 145–152, 2000.
</p>
<p class="bibitem" ><span class="biblabel">
[6]<span class="bibsp">   </span></span><a
id="XGarces_etal_06"></a>P. J. Garcés, J. A. Olivas, and F. P. Romero. Concept-matching ir systems
versus word-matching information retrieval systems: Considering fuzzy interrelations
for indexing web pages. <span
class="ecti-1095">JASIS&T</span>, 57(4):564–576, 2006.
</p>
<p class="bibitem" ><span class="biblabel">
[7]<span class="bibsp">   </span></span><a
id="XGolub_Review"></a>K. Golub. Automated subject classification of textual Web documents. <span
class="ecti-1095">Journal</span>
<span
class="ecti-1095">of Documentation</span>, 62(3):350–371, 2006.
</p>
<p class="bibitem" ><span class="biblabel">
[8]<span class="bibsp">   </span></span><a
id="XGolubChall"></a>K. Golub. Automated subject classification of textual web pages, based on a
controlled vocabulary: challenges and recommendations. <span
class="ecti-1095">New review of hypermedia</span>
<span
class="ecti-1095">and multimedia</span>, 12(1):11–27, June 2006. Special issue on knowledge organization
systems and services.
</p>
<p class="bibitem" ><span class="biblabel">
[9]<span class="bibsp">   </span></span><a
id="XGolubRole"></a>K. Golub. The role of different thesauri terms in automated subject classification
of text. In <span
class="ecti-1095">IEEE/WIC/ACM International Conference on Web Intelligence</span>, Dec. 2006.
</p>
<p class="bibitem" ><span class="biblabel">
[10]<span class="bibsp">   </span></span><a
id="Xardo05:_ECDL"></a>K. Golub and A. Ardö. Importance of HTML Structural Elements in
Automated Subject Classification. In A. Rauber, S. Christodoulakis, and
A. M. Tjoa, editors, <span
class="ecti-1095">9th European Conference on Research and Advanced</span>
<span
class="ecti-1095">Technology for Digital Libraries - ECDL 2005</span>, volume 3652 of <span
class="ecti-1095">Lecture Notes</span>
<span
class="ecti-1095">in Computer Science</span>, pages 368 – 378. Springer, Sept. 2005. Manuscript at:
http://www.it.lth.se/knowlib/publ/ECDL2005.pdf.
</p>
<p class="bibitem" ><span class="biblabel">
[11]<span class="bibsp">   </span></span><a
id="Xardo06:_ECDL"></a>K. Golub, A. Ardö, D. Mladenic, and M. Grobelnik. Comparing and Combining
Two Approaches to Automated Subject Classification of Text. In J. Gonzalo,
C. Thanos, M. F. Verdejo, and R. C. Carrasco, editors, <span
class="ecti-1095">10th European Conference on</span>
<span
class="ecti-1095">Research and Advanced Technology for Digital Libraries - ECDL 2006</span>, volume 4172
of <span
class="ecti-1095">Lecture Notes in Computer Science</span>, pages 467–470. Springer, Sept. 2006.
</p>
<p class="bibitem" ><span class="biblabel">
[12]<span class="bibsp">   </span></span><a
id="X1Ingwersen_05"></a>P. Ingwersen and K. Järvelin. <span
class="ecti-1095">The turn: integration of information seeking and</span>
<span
class="ecti-1095">retrieval in context</span>. Springer, Dordrecht, The Netherlands, 2005.
</p>
<p class="bibitem" ><span class="biblabel">
[13]<span class="bibsp">   </span></span><a
id="XLancaster_03"></a>F. W. Lancaster. <span
class="ecti-1095">Indexing and abstracting in theory and practice</span>. Facet, London,
2003. 3rd ed.
</p>
<p class="bibitem" ><span class="biblabel">
[14]<span class="bibsp">   </span></span><a
id="XLewis_etal_04"></a>D. D. Lewis, Y. Yang, T. Rose, and F. Li. Rcv1: A new benchmark collection for
text categorization research. <span
class="ecti-1095">The Journal of Machine Learning Research</span>, (5):361–397,
2004.
</p>
<p class="bibitem" ><span class="biblabel">
[15]<span class="bibsp">   </span></span><a
id="XMedelyan_06"></a>O. Medelyan and I. Witten. Thesaurus based automatic keyphrase indexing. In
<span
class="ecti-1095">Proceedings of the Sixth ACM/IEEE Joint Conference on Digital Libraries, JCDL 06</span>,
pages 296–297, 2006.
</p>
<p class="bibitem" ><span class="biblabel">
[16]<span class="bibsp">   </span></span><a
id="XOlson_01"></a>H. A. Olson and J. J. Boll. <span
class="ecti-1095">Subject analysis in online catalogs</span>. Englewood, CO:
Libraries Unlimited, 2001. 2nd ed.
</p>
<p class="bibitem" ><span class="biblabel">
[17]<span class="bibsp">   </span></span><a
id="XSebastiani_02"></a>F. Sebastiani. Machine learning in automated text categorization. <span
class="ecti-1095">ACM</span>
<span
class="ecti-1095">Computing Surveys</span>, 34(1):1–47, 2002.
</p>
<p class="bibitem" ><span class="biblabel">
[18]<span class="bibsp">   </span></span><a
id="XSvenonius_2000"></a>E. Svenonius. <span
class="ecti-1095">The intellectual foundations of information organization</span>. MIT
Press, Cambridge, MA, USA, 2000.
</p>
<p class="bibitem" ><span class="biblabel">
[19]<span class="bibsp">   </span></span><a
id="XRafael06"></a>R. R. Trujilo. Simulation tool to study focused web crawling strategies. Master’s
thesis, Dept. of Information Technology, Lund University, P.O. Box 118, S-221 00
Lund, Sweden, Mar. 2006. http://combine.it.lth.se/CrawlSim/CrawlSim.pdf.
</p>
<p class="bibitem" ><span class="biblabel">
[20]<span class="bibsp">   </span></span><a
id="XYang_99"></a>Y. Yang. An evaluation of statistical approaches to text categorization. <span
class="ecti-1095">Journal</span>
<span
class="ecti-1095">of Information Retrieval</span>, (1):67–88, 1999.
</p>
</div>
<!--l. 28--><p class="indent" >
<!--l. 30--><div class="crosslinks"><p class="noindent">[<a
href="DocMainse7.html" >prev</a>] [<a
href="DocMainse7.html#tailDocMainse7.html" >prev-tail</a>] [<a
href="DocMainli2.html" >front</a>] [<a
href="DocMainpa1.html# " >up</a>] </p></div>
<!--l. 30--><p class="indent" > <a
id="tailDocMainli2.html"></a>
</body></html>