Convert to text
19. 7. 2025pdf ↓
# package: poppler-utils
pdftotext file.pdf
epub ↓
# Doesn't need X
# https://github.com/kevinboone/epub2txt2
epub2txt -a -n file.epub > file.txt
or much slower:
# Needs X
# package: calibre
ebook-convert file.epub file2.txt
html ↓
# package: html2text
html2text < file.htm > file.txt
or worse:
# package: lynx (or w3m or elinks)
lynx --dump file.htm > file.txt
or
# package: pandoc
pandoc -f html -t plain file.htm -o file.txt