this post was submitted on 09 Sep 2023
23 points (79.5% liked)

Asklemmy

43328 readers
1455 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy πŸ”

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 5 years ago
MODERATORS
 
  1. Don't have ChatGPT
  2. OCR needed
  3. Preferably Android

Thanks.

you are viewing a single comment's thread
view the rest of the comments
[–] nottheengineer@feddit.de 7 points 1 year ago (9 children)

Android won't be easy, but you can slap together a python script that runs tesseract or easyOCR and runs it through a pretrained LLM like T5. Those are well-known and well-documented, so chatGPT can probably write the script for you without too many hiccups.

[–] ciagovv@lemm.ee 0 points 1 year ago (2 children)

And you can run that in termux, so you csn use it in android

[–] nottheengineer@feddit.de 2 points 1 year ago (1 children)

Good luck trying to install tesseract and a deep learning framework in termux.

[–] HamBrick@programming.dev 2 points 1 year ago

You can’t tell me what to do! Just watch me

load more comments (6 replies)