this post was submitted on 07 Feb 2025
539 points (98.0% liked)
Programmer Humor
20417 readers
241 users here now
Welcome to Programmer Humor!
This is a place where you can post jokes, memes, humor, etc. related to programming!
For sharing awful code theres also Programming Horror.
Rules
- Keep content in english
- No advertisements
- Posts must be related to programming or programmer topics
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I'm still mad there's no straightforward way to convert a PDF into semantic HTML. There's plenty of tools to convert it into HTML that looks the same with pages and such, but I just want the content.
Would it work to convert it to a simpler intermediate format like rtf or txt, and then convert into html? Why html anyway, Isn't epub more appropriate?
I just hate two column paginated lay outs. Give me pageless single column text.
Yeah I get that. I've just gotten used to leaving pdfs the way they are, and choosing to read them on more appropriate devices like laptops or tablets.
Sounds like a good opportunity for a crowdfunded start up.