Image based PDF upload
in progress
C
Conceptual Wren
Google Gemini supports direct upload of image-based PDFs. Currently, I'm not able to upload an image-based PDF as your platform parses the text locally and then sends it to the LLM. Please add functionality to support native platform file upload.
S
Solar Crocodile
Esto esta repetido y tiene otros 23 votos.
https://typingmind.canny.io/feature-requests/p/send-pdfs-directly-to-model-for-processing
S
Solar Crocodile
Tony Dinh Do you have any updates this feature? This has become a critical feature for us.
We’ve received multiple complaints from our users. This creates a very negative perception: many users conclude that our platform doesn't work and abandon it entirely, even when we provide technical workarounds like our own external OCR tool. For older users or those with limited tech experience — which is the majority of our user base — these extra steps are an imposible barrier.
Adding this feature would significantly improve the user experience. Is there any progress in this?
Thank you in advance,
S
Solar Crocodile
Ngoc Nguyen Just wondering if you have any rough idea of when this feature might be released? Super excited about it!"
B
Blizzard blue Lobster
looks like all providers support this now:
- Claude: https://docs.anthropic.com/en/docs/build-with-claude/pdf-support?q=pdf#process-pdfs-with-claude
- Gemini: https://ai.google.dev/gemini-api/docs/document-processing
- ChatGPT: https://platform.openai.com/docs/guides/pdf-files?api-mode=chat#base64-encoded-files
- Openrouter: https://openrouter.ai/docs/features/images-and-pdfs
R
Ruby Scallop
That would be a game changer
N
Ngoc Nguyen
in progress
M
Magenta Sailfish
Claude can do this too. Please add support for both.