Skip to content
@ContentMine

The ContentMine

The ContentMine is extracting 100 million facts from the academic literature

Popular repositories Loading

  1. quickscrape quickscrape Public

    A scraping command line tool for the modern web

    JavaScript 260 42

  2. getpapers getpapers Public

    Get metadata, fulltexts or fulltext URLs of papers matching a search query

    JavaScript 197 37

  3. journal-scrapers journal-scrapers Public

    Journal scraper definitions for the ContentMine framework

    Ruby 66 33

  4. norma norma Public

    Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML

    HTML 37 21

  5. workshop-resources workshop-resources Public

    This repository contains material helping you to set up a ContentMine workshop. It also includes tutorials for learning the ContentMine tools on your own.

    37 13

  6. scraperJSON scraperJSON Public

    The scraperJSON standard for defining web scrapers as JSON objects

    33 2

Repositories

Showing 10 of 101 repositories
  • norma Public

    Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML

    ContentMine/norma’s past year of commit activity
    HTML 37 Apache-2.0 21 34 12 Updated Jan 22, 2024
  • getpapers Public

    Get metadata, fulltexts or fulltext URLs of papers matching a search query

    ContentMine/getpapers’s past year of commit activity
    JavaScript 197 MIT 37 70 6 Updated Jul 15, 2020
  • contentmine-gui Public

    GUI for executing ContentMine commands - browser SPA for running locally on user's machine.

    ContentMine/contentmine-gui’s past year of commit activity
    JavaScript 1 0 3 0 Updated Jun 21, 2020
  • CMForestPlots Public

    Things for managing the ContentMine forest plot functionality in normal

    ContentMine/CMForestPlots’s past year of commit activity
    Python 0 Apache-2.0 1 0 0 Updated Nov 17, 2019
  • sciencesource-wikibase-docker Public Forked from wmde/wikibase-docker

    🐳 Docker images and compose file for Wikibase and the query service

    ContentMine/sciencesource-wikibase-docker’s past year of commit activity
    Shell 2 95 0 0 Updated Oct 24, 2019
  • vms Public

    ContentMine virtual machines

    ContentMine/vms’s past year of commit activity
    3 CC0-1.0 6 10 1 Updated Oct 23, 2019
  • cephis Public

    Document processing including support libraries and PDFBox2

    ContentMine/cephis’s past year of commit activity
    1 0 0 0 Updated Aug 31, 2019
  • stataforestplots Public

    documents and tests relating to ForestPlots in Stata format

    ContentMine/stataforestplots’s past year of commit activity
    0 Apache-2.0 0 2 0 Updated Jul 9, 2019
  • junk Public

    analysis of documents containing forest plots in Stata format

    ContentMine/junk’s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Jun 20, 2019
  • ContentMine/ScienceSourceReview’s past year of commit activity
    Go 1 Apache-2.0 1 0 0 Updated May 18, 2019

Top languages

Loading…

Most used topics

Loading…