Academic literature on the topic 'HTML documents'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'HTML documents.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "HTML documents"
Bonhomme, Stéphane, and Cécile Roisin. "Interactively restructuring HTML documents." Computer Networks and ISDN Systems 28, no. 7-11 (May 1996): 1075–84. http://dx.doi.org/10.1016/0169-7552(96)00042-6.
Full textSato, S. y. "Dynamic rewriting of HTML documents." Computer Networks and ISDN Systems 27, no. 2 (November 1994): 307–8. http://dx.doi.org/10.1016/s0169-7552(94)90147-3.
Full textvon Tetzchner, J. Stephenson. "Converting formatted documents to HTML." Computer Networks and ISDN Systems 27, no. 2 (November 1994): 309–10. http://dx.doi.org/10.1016/s0169-7552(94)90154-6.
Full textO, Geum-Yong, and In-Jun Hwang. "Automatically Converting HTML Documents with Similar Pattern into XML Documents." KIPS Transactions:PartD 9D, no. 3 (June 1, 2002): 355–64. http://dx.doi.org/10.3745/kipstd.2002.9d.3.355.
Full textKAJI, NOBUHIRO, and MASARU KITSUREGAWA. "Acquiring Polar Sentences from HTML Documents." Journal of Natural Language Processing 15, no. 3 (2008): 77–90. http://dx.doi.org/10.5715/jnlp.15.3_77.
Full textGupta, Suhit, Gail E. Kaiser, Peter Grimm, Michael F. Chiang, and Justin Starren. "Automating Content Extraction of HTML Documents." World Wide Web 8, no. 2 (June 2005): 179–224. http://dx.doi.org/10.1007/s11280-004-4873-3.
Full textVállez, Mari, Rafael Pedraza-Jiménez, Lluís Codina, Saúl Blanco, and Cristòfol Rovira. "A semi-automatic indexing system based on embedded information in HTML documents." Library Hi Tech 33, no. 2 (June 15, 2015): 195–210. http://dx.doi.org/10.1108/lht-12-2014-0114.
Full textTHIEMANN, PETER. "A typed representation for HTML and XML documents in Haskell." Journal of Functional Programming 12, no. 4-5 (July 2002): 435–68. http://dx.doi.org/10.1017/s0956796802004392.
Full textGupta, Shivangi, and Mukesh Rawat. "Keyword based Automatic Summarization of HTML Documents." International Journal of Computer Applications 127, no. 8 (October 15, 2015): 24–29. http://dx.doi.org/10.5120/ijca2015906421.
Full textWu, Qi, Xing-shu Chen, Kai Zhu, and Chun-hui Wang. "Relevance-based content extraction of HTML documents." Journal of Central South University 19, no. 7 (July 2012): 1921–26. http://dx.doi.org/10.1007/s11771-012-1226-8.
Full textDissertations / Theses on the topic "HTML documents"
Xie, Wei University of Ballarat. "Classification of HTML Documents." University of Ballarat, 2006. http://archimedes.ballarat.edu.au:8080/vital/access/HandleResolver/1959.17/12774.
Full textMaster of Computing
Xie, Wei. "Classification of HTML Documents." University of Ballarat, 2006. http://archimedes.ballarat.edu.au:8080/vital/access/HandleResolver/1959.17/15628.
Full textMaster of Computing
Levering, Ryan Reed. "Multi-stage modeling of HTML documents." Diss., Online access via UMI:, 2004.
Find full textStachowiak, Maciej 1976. "Automated extraction of structured data from HTML documents." Thesis, Massachusetts Institute of Technology, 1998. http://hdl.handle.net/1721.1/9896.
Full textIncludes bibliographical references (leaf 45).
by Maciej Stachowiak.
M.Eng.
Nálevka, Petr. "Compound XML documents." Master's thesis, Vysoká škola ekonomická v Praze, 2007. http://www.nusl.cz/ntk/nusl-1746.
Full textTemelkuran, Baris 1980. "Hap-Shu : a language for locating information in HTML documents." Thesis, Massachusetts Institute of Technology, 2003. http://hdl.handle.net/1721.1/87882.
Full textMeziane, Souad. "Analyse et conversion de documents : du pixel au langage HTML." Lyon, INSA, 1998. http://www.theses.fr/1998ISAL0128.
Full textThis work is part of the thematic "Document Analysis" in the Laboratory Reconnaissance de Forme et Vision(RFV). To achieve an analysis system ables to, interpret documents and to restore its structure, the Methodologies we have chosen lean on several approaches and particularly on the syntactic and structural approach of the Pattern Recognition. The aim in this work is to convert some paper documents into HTML documents because these documents are more used on the Internet. The application domain of such systems could be general; however, we concentrate us on a particular type of documents with a rich typography: the summaries. In this context, we have realized a system that exploits on one hand the information about content of the document such as its physical and logical structures, and on the other hand on two level grammars. It is composed with two grammars: a meta-grammar and a hyper-grammar. In our system, the role of the meta-grammar is to describe the physical and logical structures of the document. The hyper-grammar is constituted with a set of calculus rules and describes the treatments to do in order to convert the document in HTML. The summary analysis is done in two steps: analysis and identification of the document, and then translation into HTML. During of the first step, the system constructs a learning base by using the grammatical inference. This base contains several patterns of synopses to identify. An unknown document, submitted to the system is identified by matching with the patterns of the base by using all the attributes obtained in the analysis step. The layout of HTML document construction is based on the grammatical analysis of the hyper-grammar. The last is obtained by translation of the logical labels and some typographic parameters into HTML commands. The result of the grammatical analysis of the hyper-grammar produces the structured HTML document corresponding to the studied document. This last will be visualized by software of navigation
Mohammadzadeh, Hadi. "Improving Retrieval Accuracy in Main Content Extraction from HTML Web Documents." Doctoral thesis, Universitätsbibliothek Leipzig, 2013. http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa-130500.
Full textDas rasante Wachstum von textbasierten Informationen im World Wide Web und die Vielfalt der Anwendungen, die diese Daten nutzen, macht es notwendig, effiziente und effektive Methoden zu entwickeln, die den Hauptinhalt identifizieren und von den zusätzlichen Inhaltsobjekten wie z.B. Navigations-Menüs, Anzeigen, Design-Elementen oder Haftungsausschlüssen trennen. Zunächst untersuchen, entwickeln und evaluieren wir in dieser Arbeit R2L, DANA, DANAg und AdDANAg, eine Familie von neuartigen Algorithmen zum Extrahieren des Inhalts von Web-Dokumenten. Das grundlegende Konzept hinter R2L, das auch zur Entwicklung der drei weiteren Algorithmen führte, nutzt die Besonderheiten der Rechts-nach-links-Sprachen aus, um den Hauptinhalt von Webseiten zu extrahieren. Da der lateinische Zeichensatz und die Rechts-nach-links-Zeichensätze durch verschiedene Abschnitte des Unicode-Zeichensatzes kodiert werden, lassen sich die Rechts-nach-links-Zeichen leicht von den lateinischen Zeichen in einer HTML-Datei unterscheiden. Das erlaubt dem R2L-Ansatz, Bereiche mit einer hohen Dichte von Rechts-nach-links-Zeichen und wenigen lateinischen Zeichen aus einer HTML-Datei zu erkennen. Aus diesen Bereichen kann dann R2L die Rechts-nach-links-Zeichen extrahieren. Die erste Erweiterung, DANA, verbessert die Wirksamkeit des Baseline-Algorithmus durch die Verwendung eines HTML-Parsers in der Nachbearbeitungsphase des R2L-Algorithmus, um den Inhalt aus Bereichen mit einer hohen Dichte von Rechts-nach-links-Zeichen zu extrahieren. DANAg erweitert den Ansatz des R2L-Algorithmus, so dass eine Sprachunabhängigkeit erreicht wird. Die dritte Erweiterung, AdDANAg, integriert eine neue Vorverarbeitungsschritte, um u.a. die Weblinks zu normalisieren. Die vorgestellten Ansätze werden in Bezug auf Effizienz und Effektivität analysiert. Im Vergleich mit mehreren etablierten Hauptinhalt-Extraktions-Algorithmen zeigen wir, dass sie in diesen Punkten überlegen sind. Darüber hinaus findet die Extraktion der Überschriften aus Web-Artikeln vielfältige Anwendungen. Hierzu entwickeln wir mit TitleFinder einen sich nur auf den Textinhalt beziehenden und sprachabhängigen Ansatz. Das vorgestellte Verfahren ist in Bezug auf Effektivität und Effizienz besser als bekannte Ansätze, die auf strukturellen und visuellen Eigenschaften der HTML-Datei beruhen
Yerra, Rajiv. "Detecting Similar HTML Documents Using A Sentence-Based Copy Detection Approach." Diss., CLICK HERE for online access, 2005. http://contentdm.lib.byu.edu/ETD/image/etd977.pdf.
Full textSinger, Ron. "Comparing machine learning and hand-crafted approaches for information extraction from HTML documents." Thesis, McGill University, 2003. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=79127.
Full textBooks on the topic "HTML documents"
DeRose, Steven J. The SGML FAQ book: Understanding the foundation of HTML and XML. Boston: Kluwer Academic Publishers, 1997.
Find full textThe Document object model: Processing structured documents. New York: McGraw-Hill/Osborne, 2002.
Find full textHeslop, Brent. HTML Publishing on the Internet: Create great-looking documents online: home pages, newsletters, catalogs, ads and forms. Research Triangle Park, NC: Ventana Communications, 1996.
Find full textGuthrie, Malcolm. Forms: Interactivity for the World Wide Web : creating HTML and PDF form documents. San Jose, Calif: Adobe Press, 1998.
Find full textHeslop, Brent. HTML publishing on the Internet for Windows: Create great-looking documents online:home pages, newsletters, catalogs, ads & forums. Chapel Hill, NC: Ventana Press, 1995.
Find full textHeslop, Brent D. HTML publishing on the Internet for Macintosh: Create great-looking documents online : home pages, newsletters, catalogs, ads & forms. Research Triangle Park, NC: Ventana, 1995.
Find full textHeslop, Brent. HTML publishing on the Internet for Macintosh: Create great-looking documents online : home pages, newsletters, catalogs, ads & forms. Research Triangle Park, NC: Ventana, 1995.
Find full textHeslop, Brent. HTML publishing on the internet: For windows : create great-looking documents online: home pages, newsletters, catalogs, ads & forums. Chapel Hill, N.C: Ventana Press, Inc., 1995.
Find full textHeslop, Brent D. HTML publishing on the Internet for Windows: Create great-looking documents online : home pages, newsletters, catalogs, ads & forums. Chapel Hill, NC: Ventana Press, 1995.
Find full textBook chapters on the topic "HTML documents"
Freeman, Adam. "Creating HTML Documents." In The Definitive Guide to HTML5, 117–50. Berkeley, CA: Apress, 2011. http://dx.doi.org/10.1007/978-1-4302-3961-1_7.
Full textWhite, Bebo. "Authoring HTML Documents." In HTML and the Art of Authoring for the World Wide Web, 151–84. Boston, MA: Springer US, 1996. http://dx.doi.org/10.1007/978-1-4613-1351-9_7.
Full textWhite, Bebo. "Dynamically Created HTML Documents." In HTML and the Art of Authoring for the World Wide Web, 223–30. Boston, MA: Springer US, 1996. http://dx.doi.org/10.1007/978-1-4613-1351-9_12.
Full textWhite, Bebo. "Converting Formatted Documents to HTML." In HTML and the Art of Authoring for the World Wide Web, 213–14. Boston, MA: Springer US, 1996. http://dx.doi.org/10.1007/978-1-4613-1351-9_10.
Full textWang, Yalin, and Jianying Hu. "Detecting Tables in HTML Documents." In Lecture Notes in Computer Science, 249–60. Berlin, Heidelberg: Springer Berlin Heidelberg, 2002. http://dx.doi.org/10.1007/3-540-45869-7_29.
Full textLiu, Mengchi. "Capturing Semantics in HTML Documents." In Lecture Notes in Computer Science, 103–12. Berlin, Heidelberg: Springer Berlin Heidelberg, 2002. http://dx.doi.org/10.1007/3-540-46146-9_11.
Full textCiancarini, Paolo, Cecilia Mascolo, and Fabio Vitali. "Visualizing Z Notation in HTML Documents." In ZUM ’98: The Z Formal Specification Notation, 81–95. Berlin, Heidelberg: Springer Berlin Heidelberg, 1998. http://dx.doi.org/10.1007/978-3-540-49676-2_7.
Full textFaghani, Shabanali, Ali Hadian, and Behrouz Minaei-Bidgoli. "Charset Encoding Detection of HTML Documents." In Information Retrieval Technology, 215–26. Cham: Springer International Publishing, 2015. http://dx.doi.org/10.1007/978-3-319-28940-3_17.
Full textLim, Seung-Jin, and Yiu-Kai Ng. "A Heuristic Approach for Converting HTML Documents to XML Documents." In Computational Logic — CL 2000, 1182–96. Berlin, Heidelberg: Springer Berlin Heidelberg, 2000. http://dx.doi.org/10.1007/3-540-44957-4_79.
Full textSchultz, David, and Craig Cook. "Adding Style to Your Documents: CSS." In Beginning HTML with CSS and XHTML, 227–50. Berkeley, CA: Apress, 2007. http://dx.doi.org/10.1007/978-1-4302-0350-6_9.
Full textConference papers on the topic "HTML documents"
Kim, Yeon-seok, and Kyong-ho Lee. "Generating Structured Documents from HTML Tables." In 2006 International Conference on Hybrid Information Technology. IEEE, 2006. http://dx.doi.org/10.1109/ichit.2006.253669.
Full textMolinari, Andrea, Gabriella Pasi, and R. A. Marques Pereira. "An indexing model of HTML documents." In the 2003 ACM symposium. New York, New York, USA: ACM Press, 2003. http://dx.doi.org/10.1145/952532.952697.
Full textRapela, Joaquin. "Automatically combining ranking heuristics for HTML documents." In Proceeding of the third international workshop. New York, New York, USA: ACM Press, 2001. http://dx.doi.org/10.1145/502932.502945.
Full textGupta, Suhit, Gail Kaiser, David Neistadt, and Peter Grimm. "DOM-based content extraction of HTML documents." In the twelfth international conference. New York, New York, USA: ACM Press, 2003. http://dx.doi.org/10.1145/775152.775182.
Full textKirschning, Ingrid, and Joaquín O. Rueda. "Animated agents and TTS for HTML documents." In the 2005 Latin American conference. New York, New York, USA: ACM Press, 2005. http://dx.doi.org/10.1145/1111360.1111375.
Full textBurget, R. "Layout Based Information Extraction from HTML Documents." In Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2. IEEE, 2007. http://dx.doi.org/10.1109/icdar.2007.4376990.
Full text"CONCEPTS EXTRACTION BASED ON HTML DOCUMENTS STRUCTURE." In International Conference on Agents and Artificial Intelligence. SciTePress - Science and and Technology Publications, 2012. http://dx.doi.org/10.5220/0003748305030506.
Full textHoon Hwangbo and Hongchul Lee. "Reusing of information constructed in HTML documents: A conversion of HTML into OWL." In 2008 International Conference on Control, Automation and Systems (ICCAS). IEEE, 2008. http://dx.doi.org/10.1109/iccas.2008.4694654.
Full textCanan Pembe, F., and Tunga Gungor. "Heading-based sectional hierarchy identification for HTML documents." In 2007 22nd international symposium on computer and information sciences. IEEE, 2007. http://dx.doi.org/10.1109/iscis.2007.4456839.
Full textJern, Mikael, Jakob Rogstadius, Tobias Åström, and Anders Ynnerman. "Visual Analytics Presentation Tools Applied in HTML Documents." In 2008 12th International Conference Information Visualisation (IV). IEEE, 2008. http://dx.doi.org/10.1109/iv.2008.22.
Full textReports on the topic "HTML documents"
Gupta, Suhit, Gail Kaiser, David Neistadt, and Peter Grimm. DOM-based Content Extraction of HTML Documents. Fort Belvoir, VA: Defense Technical Information Center, January 2005. http://dx.doi.org/10.21236/ada437440.
Full textPalme, J., A. Hopmann, and N. Shelness. MIME Encapsulation of Aggregate Documents, such as HTML (MHTML). RFC Editor, March 1999. http://dx.doi.org/10.17487/rfc2557.
Full textPalme, J., and A. Hopmann. MIME E-mail Encapsulation of Aggregate Documents, such as HTML (MHTML). RFC Editor, March 1997. http://dx.doi.org/10.17487/rfc2110.
Full text