Showing 1600 open source projects for "java html parser"

View related business solutions
  • Earn up to 15% annual interest with Nexo. Icon
    Earn up to 15% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Earn up to 15% annual interest with Nexo. Icon
    Earn up to 15% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Java Tablesaw

    Java Tablesaw

    Java dataframe and visualization library

    Tablesaw is a dataframe and visualization library that supports loading, cleaning, transforming, filtering, and summarizing data. If you work with data in Java, it may save you time and effort. Tablesaw also supports descriptive statistics and can be used to prepare data for working with machine learning libraries like Smile, Tribuo, H20.ai, DL4J. Import data from RDBMS, Excel, CSV, TSV, JSON, HTML, or Fixed Width text files, whether they are local or remote (http, S3, etc.) Tablesaw supports data visualization by providing a wrapper for the Plot.ly JavaScript plotting library. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    commonmark-java

    commonmark-java

    Java library for parsing and rendering CommonMark (Markdown)

    Java library for parsing and rendering Markdown text according to the CommonMark specification (and some extensions). Provides classes for parsing input to an abstract syntax tree of nodes (AST), visiting and manipulating nodes, and rendering to HTML. It started out as a port of commonmark.js, but has since evolved into a full library with a nice API.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees. It’s widely used in academia and industry to build all sorts of languages, tools, and frameworks. Twitter search uses ANTLR for query parsing, with over 2 billion queries a day. The languages for...
    Downloads: 11 This Week
    Last Update:
    See Project
  • Wiz: #1 Cloud Security Software for Modern Cloud Protection Icon
    Wiz: #1 Cloud Security Software for Modern Cloud Protection

    Protect Everything You Build and Run in the Cloud

    Use the Wiz Cloud Security Platform to build faster in the cloud, enabling security, dev and devops to work together in a self-service model built for the scale and speed of your cloud development.
    Learn More
  • 5
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    JavaParser

    JavaParser

    Java 1-17 Parser and Abstract Syntax Tree for Java

    This project contains a set of libraries implementing a Java 1.0 - Java 17 Parser with advanced analysis functionalities. The project binaries are available in Maven Central. We strongly advise users to adopt Maven, Gradle or another build system for their projects. If you are not familiar with them we suggest taking a look at the maven quickstart projects. Since Version 3.5.10, the JavaParser project includes the JavaSymbolSolver.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    JSON Schema Validator

    JSON Schema Validator

    A fast Java JSON schema validator that supports draft V4, V6, V7, V201

    This is a Java implementation of the JSON Schema Core Draft v4, v6, v7, v2019-09 and v2020-12(partial) specification for JSON schema validation. In addition, it also works for OpenAPI 3.0 request/response validation with some configuration flags. For users who want to collect information from a JSON node based on the schema, the walkers can help.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    JSqlParser

    JSqlParser

    JSqlParser parses an SQL statement and translate it into a hierarchy

    JSqlParser is a SQL statement parser. It translates SQLs in a traversable hierarchy of Java classes. JSqlParser is not limited to one database but provides support for a lot of specials of Oracle, SqlServer, MySQL, PostgreSQL. To name some, it has support for Oracles join syntax using (+), PostgreSQLs cast syntax using ::, relational operators like != and so on. Then the result can be accessed in a structured way.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    jackson-core

    jackson-core

    Core part of Jackson that defines Streaming API

    This project contains core low-level incremental ("streaming") parser and generator abstractions used by Jackson Data Processor. It also includes the default implementation of handler types (parser, generator) that handle JSON format. The core abstractions are not JSON specific, although naming does contain 'JSON' in many places, due to historical reasons. Only packages that specifically contain the word 'json' are JSON-specific. This package is the base on which Jackson's data-binding...
    Downloads: 14 This Week
    Last Update:
    See Project
  • One Unified Time Tracking Software For Projects, Billing, Pay and Compliance Icon
    One Unified Time Tracking Software For Projects, Billing, Pay and Compliance

    For companies of all sizes looking for a Time Tracking software

    Replicon's time-tracking platform is scalable and configurable to support the diverse needs of small, mid & large businesses with a remote and globally distributed workforce. Replicon’s Time Tracking is a cloud-based, enterprise-grade solution that tracks employee time across projects, tasks, presence, and absence to facilitate client billing, project costing, and compliant payroll processing. The scalable and configurable platform offers seamless integration with common business technology stacks, such as ERP, CRM, Accounting, and payroll solutions. With AI-powered time capture, mobile apps, and labor compliance as a service, Replicon makes time tracking hassle-free.
    Learn More
  • 10
    thymeleaf

    thymeleaf

    Thymeleaf is a modern server-side Java template engine for web

    HTML templates written in Thymeleaf still look and work like HTML, letting the actual templates that are run in your application keep working as useful design artifacts. Eclipse, IntelliJ IDEA, Spring, Play, even the up-and-coming Model-View-Controller API for Java EE 8. Write Thymeleaf in your favourite tools, using your favourite web-development framework.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Acode

    Acode

    A powerful text/code editor for Android

    ...Acode lets you build and run websites right in your browser, debug with ease using the built-in console, and edit a wide range of source files from Python and CSS to Java, JavaScript, Dart, and more.
    Downloads: 65 This Week
    Last Update:
    See Project
  • 12
    JBake

    JBake

    Java based open source static site/blog generator for developers

    JBake is a Java-based, open source, static site/blog generator for developers & designers. The project uses Gradle 4.9+ as the build system. We configured the gradle check style Plugin to run with the check task. It does not break the build if convention violations are found. But prints a warning and generates a report. Source available on GitHub, licensed under MIT License.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Apache Baremaps

    Apache Baremaps

    Create custom vector tiles from OpenStreetMap

    Baremaps is an Apache Incubator project that provides tools and a Java-based pipeline for building and rendering vector tiles from OpenStreetMap (OSM) data. It’s designed for fast map generation, serving tiles, and supporting real-time updates, making it a powerful backend for map-based applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor based on JavaFX 20

    Asciidoc Editor and Toolchain written with JavaFX 19

    Asciidoc FX is a WYSIWYG editor for the Asciidoc markup language. You can build PDF, Epub, and HTML books, documents, and slides. Supported Operating Systems and Builds shows the list of available builds with links for reference. If you are looking for the very latest version, visit the link in the note above to be guaranteed of downloading the latest and greatest version of AsciidocFX. AsciidocFX converts documents via the AsciidoctorJ library.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    EasyExcel

    EasyExcel

    Lightweight Java library developed by Alibaba for reading and writing

    EasyExcel is a Java library focused on reading and writing Excel files with very low memory usage, making it suitable for large datasets that overwhelm traditional APIs. It uses streaming/event-driven parsing to avoid loading entire workbooks into memory, and it maps rows to Java objects via simple annotations. Writers support multiple sheets, custom styles, merged cells, and template-based filling so production reports remain maintainable. The API emphasizes developer ergonomics: callbacks...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    TeXtidote

    TeXtidote

    Spelling, grammar and style checking on LaTeX documents

    If so, you probably know that the process is far from simple. Since LaTeX documents contain special commands and keywords (the so-called "markup") that are not part of the "real" text, you cannot run a grammar checker directly on these files: it cannot tell the difference between markup and text. The other option is to remove all this markup, leaving only the "clear" text; however, when a grammar tool points to a problem at a specific line in this clear text, it becomes hard to retrace that...
    Downloads: 145 This Week
    Last Update:
    See Project
  • 17
    Joern

    Joern

    Open-source code analysis platform for C/C++/Java/Binary/Javascript

    Joern is a platform for analyzing source code, bytecode, and binary executables. It generates code property graphs (CPGs), a graph representation of code for cross-language code analysis. Code property graphs are stored in a custom graph database. This allows code to be mined using search queries formulated in a Scala-based domain-specific query language. Joern is developed with the goal of providing a useful tool for vulnerability discovery and research in static program analysis.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    Apache Log4j

    Apache Log4j

    Apache Log4j 2 is a versatile, feature-rich, efficient logging API

    Apache Log4j is a versatile, industrial-grade Java logging framework composed of an API, its implementation, and components to assist the deployment for various use cases. Log4j is used by 8% of the Maven ecosystem and listed as one of the top 100 critical open source software projects. The project is actively maintained by a team of several volunteers and supported by a big community.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 19
    Byte Buddy

    Byte Buddy

    Runtime code generation for the Java virtual machine

    Byte Buddy is a code generation and manipulation library for creating and modifying Java classes during the runtime of a Java application and without the help of a compiler. Other than the code generation utilities that ship with the Java Class Library, Byte Buddy allows the creation of arbitrary classes and is not limited to implementing interfaces for the creation of runtime proxies. Furthermore, Byte Buddy offers a convenient API for changing classes either manually, using a Java agent or...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Hawtio

    Hawtio

    Hawtio web console helps you manage your JVM stuff and stay cool

    Hawtio is a lightweight and modular Web console for managing Java applications. Hawtio has plugins such as: Apache Camel and JMX (Logs, Spring Boot, Quartz, and more will be provided soon). You can dynamically extend Hawtio with your own plugins or automatically discover plugins inside the JVM. The only server-side dependency (other than the static HTML/CSS/JS/images) is the excellent Jolokia library which has a small footprint (around 300KB) and is available as a JVM agent or comes embedded as a servlet inside the hawtio-default.war.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    cordova-plugin-camera-preview

    cordova-plugin-camera-preview

    Cordova plugin that allows camera interaction from HTML code

    Cordova Camera Preview is a plugin that allows developers to add real-time camera preview functionality to Cordova-based mobile apps. It provides direct access to the device's camera feed with customizable overlays and controls, enabling the creation of augmented reality, scanning, and camera-based interaction apps.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    WebMagic

    WebMagic

    A scalable web crawler framework for Java

    WebMagic is a scalable crawler framework. It covers the whole lifecycle of crawler, downloading, url management, content extraction and persistent. It can simplify the development of a specific crawler. WebMagic is a simple but scalable crawler framework. You can develop a crawler easily based on it. WebMagic has a simple core with high flexibility, a simple API for html extracting. It also provides annotation with POJO to customize a crawler, and no configuration is needed. Some other...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    capacitor

    capacitor

    Build cross-platform native progressive web apps for iOS and Android

    Capacitor is an open source native runtime for building Web Native apps. Create cross-platform iOS, Android, and Progressive Web Apps with JavaScript, HTML, and CSS. Capacitor’s native plugin APIs make it extremely easy to access and invoke common device functionality across multiple platforms. Build web-based applications that run equally well across iOS, Android, and as Progressive Web Apps. Access the full Native SDKs on each platform, and easily deploy to the App Stores (and the web)....
    Downloads: 41 This Week
    Last Update:
    See Project
  • 24
    Kryo

    Kryo

    Java binary serialization and cloning, fast, efficient, automatic

    Kryo is a fast and efficient binary object graph serialization framework for Java. The goals of the project are high speed, low size, and an easy-to-use API. The project is useful any time objects need to be persisted, whether to a file, database or over the network. Kryo can also perform automatic deep and shallow copying/cloning. This is direct copying from object to object, not object to bytes to object. Kryo has three sets of methods for reading and writing objects. If the concrete class...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Agents-Flex

    Agents-Flex

    Agents-Flex is an elegant LLM Application Framework like LangChain

    Agents-Flex includes a variety of network protocols for connecting LLMs, such as HTTP, SSE and WS. Its simple and flexible design allows developers to easily connect to various LLMs, including OpenAI, LLama, and other AI. Agents-Flex provides a rich set of development templates and Prompt Frameworks, including FEW-SHOT, CRISPE, BROKE, and ICIO. Developers can also customize their own unique prompt templates. Agents-Flex has a very flexible Function Calling component. It supports local method...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB