home » dokk/manpages.git
ID: 9bc1022c046a144c7aff0459db69f421227dace8
33 lines — 1004B — View raw


Software prerequisites:
    - apt-cacher-ng
    - debiman (install from repository, follow instructions in README)
    - mandoc
    - python3
    - python3-venv

After installing apt-cacher-ng, the proxy is started automatically and should be
listening on port 3142.
debiman by default will download from http://localhost:3142/deb.debian.org/
This option can be changed with -local_mirror or -remote_mirror. Non-LTS Debians are
removed from the mirrors; use the archive instead: https://archive.debian.org

How to use the scripts:

1. Download manpages from Debian:

    $ MIRROR="https://archive.debian.org" CODENAME="buster" DEBIMAN_SERVING_DIR=/path make download

2. Extract all the pages from .gz:

    $ DEBIMAN_SERVING_DIR=/path make extract

3. Convert manpages to plaintext and html:

    $ DEBIMAN_SERVING_DIR=/path make convert

4. Create RDF graph:

    $ python3 -m venv venv
    $ source venv/bin/activate
    $ pip install rdflib
    $ CODENAME="buster" DEBIMAN_SERVING_DIR=/path ./rdf.py