Ir à oferta completa

ONLY FOR @PRAYOGO803 AUTOMATED EXTRACTION AND STRUCTURING OF RECIPES FROM PDF COOKBOOKS

Descrição da oferta de emprego

I have a collection of approximately 50 PDF cookbooks, containing an estimated total of recipes.
Some of these cookbooks have already been split into individual recipe files, while others remain in their original format.
I am seeking a freelancer to automate the process of separating the remaining recipes, using the most appropriate tools, potentially including DALL-E or other advanced OCR and text recognition software.
Scope of Work.
The selected freelancer will be responsible for the following tasks.
1.
Text Recognition.
• Extract text from the recipes, even if they are embedded in images within the PDF files.
• Ensure the recognition process accurately captures all text, including any non-standard fonts or formats used in the cookbooks.
2.
Recipe Identification.
• Detect and separate individual recipes, even if they span across multiple pages.
• Ensure that each recipe is fully captured, without splitting across pages unless the recipe itself does.
3.
Data Conversion.
• Convert the extracted text from each recipe into a structured JSON format.
• The JSON should include fields such as Title, Title of the Book, Short Description, Ingredients, Cooking Process, Categories, and Tags.
• Categories and Tags will be provided by me.
4.
Language Consistency.
• Ensure that all extracted recipes are in English.
For any non-English recipes, a translation process may be necessary.
5.
Database Creation.
• Input the JSON-formatted recipes into a database platform, such as Airtable, which I will provide access to.
• Each recipe entry in the database must include a unique identification number and all the relevant fields.
Deliverables.
• A fully populated Airtable database containing all the recipes, accurately categorized and tagged.
• JSON files for each recipe, stored in a systematic folder structure.
• A report detailing the process, including any challenges encountered and how they were addressed.
To ensure that we can be autonomous in future projects involving new books, it is essential that the deliverables include all the supporting files used throughout the process.
This should encompass everything from initial materials to final versions, as well as any relevant documentation explaining the workflow and configurations used.
These files will enable us to replicate and adapt the processes independently in the future, ensuring continuity and efficiency in our publishing projects.
Skills Required.
• Expertise in OCR technology and text extraction from PDFs, especially where text is embedded in images.
• Experience with tools like DALL-E or similar for image and text recognition.
• Strong knowledge of JSON formatting and database management.
• Familiarity with Airtable or similar database platforms.
• Fluency in English, with experience in translation if necessary.
Timeline.
Please provide an estimated timeline for completing this project, considering the volume of work involved.
Budget.
I am open to bids, but please provide a detailed breakdown of costs, including any software licenses or tools that may be required.
Application Requirements.
• Please provide examples of similar projects you have completed.
• A brief outline of the tools and methods you would use to accomplish this project.
• Your proposed timeline and budget.
Dall-E Data Collection JSON OCR Python ID do Projeto.
# Sobre o projeto 15 propostas Aberto para ofertas Projeto remoto Ativo em 19 minutos atrás
Ir à oferta completa

Detalhes da oferta

Empresa
  • Indeterminado
Localidade
  • Em todo Portugal
Endereço
  • Indeterminado - Indeterminado
Data de publicação
  • 03/09/2024
Data de expiração
  • 02/12/2024
French and english backoffice support for hotel hybrid work
Paco recrutiment

Prowadź rejestracja lub akcja z klientem, rejestrując szczegółowe zapytania, skargi lub komentarze, a także wynikające z nich działania... 2023! twoje: zadanie odpowiedzialność za wsparcie rozwiązania, zapewnienie prawidłowej i skutecznej realizacji polityk, procedur i działań......

French and English Backoffice Support for Hotel Hybrid work
Paco Recrutiment

Przygotowywanie, utrzymywanie i przeglądanie plików zakupów, raportów i cenników... sprawdzaj przesyłki po ich otrzymaniu, aby upewnić się, że zamówienia zostały prawidłowo zrealizowane i że towary odpowiadają określonym specyfikacjom... nasze oczekiwania: obywatelstwo ue lub zezwolenie na pobyt w portugalii......

Customer Support German and English for Insurance Company
Paco Recrutiment

Nie musisz mieć wcześniejszego doświadczenia - oferuj szkolenia! lokalizacja: wschód – lizbona projekt rozpoczyna się 8... 2023 firma: rozwiązanie na świecie notowana na giełdzie firma zajmująca się ubezpieczeniami majątkowymi i osobowymi... polityka zmieniania, aktualizująca szczegóły płatności zgłaszaj......

Automation and Robotics Engineer
TECNICOAT, LDA

Stay abreast of industry trends and emerging technologies in automation and robotics... conduct feasibility studies and cost analyses for automation projects... strong knowledge of robotic systems, sensors, and control systems... young graduate with a strong desire for designing and implementing automation......

Position: Translator and Content Specialist (Portuguese)
DAC SERVICES AND SOLUTIONS LTD

Adapt surveys and questionnaires for portuguese-speaking audiences, ensuring cultural relevance and clarity... marketing materials:- translate brochures and other marketing materials from de>pt and en>pt... verify correct settings and specifications for the portuguese market... proven experience in translation......

SAP BO – Reporting and Data Analyst
Equação it

We are looking for a sap bo – reporting and data analyst with the following requirements: requisitos do trabalho • extraction and analysis of data from various sources;• participation in the data delivery process with the entire delivery environment;• import (incl... maintenance) of raw data from various......

NURSE FOR SENIOR CARE IN GERMANY
Eugenia talent recruitment

• monitor and record residents' health status and needs... they are in search of empathetic and proficient nursing staff with expertise in nursing or medicine, capable of offering thorough care services such as dressing, bathing, and maintaining cleanliness for the well-being of their residents......

Customer Support with Dutch & English for Search Engine
Paco Recrutiment

” we are looking for dutch speaking employees for our team in lisbon to support our customers... its flagship product is their search engine, and its declared mission - 'to organize the world's information resources so that they become widely available and useful for everyone... project starts 15......

Customer Support with German & English for Search Engine
Paco recrutiment

” we are looking for german speaking employees for our team in lisbon to support our customers... its flagship product is their search engine, and its declared mission - 'to organize the world's information resources so that they become widely available and useful for everyone... project starts 15......

Project Manager (Knowledge of agile methodology)
Equação it

We are looking for a project manager with the following requirements: requisitos do trabalho key skills and responsibilities:• english - c1;• candidates should be confident technical project managers with experience of the full software development lifecycle;• knowledge of agile methodology;• experience......