@jssmcadharwad.com
Assistant Professor
JSS SHRI MANJUNATHESHWARA MCA INSTITUTE
Manjunath Pujar is a dedicated academician and researcher with a strong background in Computer Science Engineering. With an MTech in CSE and ongoing PhD research, he has demonstrated excellence in both teaching and research. He currently serves as an Assistant Professor at JSS Shri Manjunatheshwara MCA Institute, Dharwad, where he has been working since August 2010. Manjunath is passionate about educating and guiding students, having overseen over 80 academic projects at both undergraduate and postgraduate levels. His research interests lie in machine learning, web content mining, and deep learning techniques, areas in which he has published extensively in reputed Scopus-indexed journals.
Manjunath is also qualified in UGC NET and KSET, further solidifying his expertise in the field. He has contributed to curriculum development, academic project guidance, and exam coordination at various institutions. His technical proficiency includes programming languages like Python, Java, C++, and
Pursuing Ph.D, BE(CSE), MTech(CSE)
Artificial Intelligence, Computer Science
Scopus Publications
Scholar Citations
Scholar h-index
Manjunath Pujar, Monica R. Mundada, B. J. Sowmya, S. Supreeth, and G. Shruthi
Springer Science and Business Media LLC
Manjunath Pujar and Monica R Mundada
The Science and Information Organization
In recent years, the emergence of WWW (World Wide Web) led to the accumulation of huge amount of information and data. Hence the web is found to consist of unstructured and structured information that impacts the day to day life of the society. Because of such availability of huge information, utilization of the required information becomes more challenging. This paper provided a comprehensive survey on the current situation and recent trends on web content mining (WCM) and its applications thereby contributing to the enhancement of the upcoming research in WCM. The paper focused mainly on the mining and retrieval techniques, various WCM approaches, challenges and process of information retrieval and information extraction. The paper describes the four major tasks of web content mining that is information retrieval, information extraction, generalization and validation in detail. WCM concentrates on orchestrating, sorting, classifying, collecting, congregating of web data and provide the improved data which can be easily accessed by the users. Web content mining tools were needed to scan text, images and HTML documents and provide results to the search engine. It guides the search engine to provide better productive results for every search based on their importance. The paper also analysed different web content mining tools for the extraction of relevant information from the corresponding web page. Keywords—Web content mining; web structure mining; web usage mining; data mining; information retrieval; information extraction