Ir à oferta completa

WEBSITE CONTENT EXTRACTOR IN PYTHON

Descrição da oferta de emprego

Request for Proposal (RFP) Project Title Python Program for Extracting Articles from a Website Site Map into.
ocx Files Project Overview We are seeking a proficient Python developer to create a program that extracts articles from a specific website’s site map (e.
., [login to view URL]) and downloads each article published within a specified time range (e.
., the past 24 hours).
Each article should be saved into a separate.
ocx file, named according to the publication date and time.
The final program should be user-friendly and well-documented to allow non-technical users to configure and run the script.
Project Scope and Deliverables 1.
Python Script (.
y file).
• Develop a Python script that takes a site map URL (e.
., [login to view URL]) as input and extracts all article URLs from the specified page.
• Implement logic to filter articles based on a given time range (e.
., the last 24 hours or between specific start and end times).
• Download each article found in the specified time range and save it in a separate.
ocx file.
The.
ocx file should include.
• Title of the article (as the document header) • URL of the source page • Publication Date and Time • Author Name (if available) • Main Body Text • Filename format.
[login to view URL] (e.
., [login to view URL]).
• Implement options to include/exclude metadata (e.
., tags, categories) as needed.
2.
Output Files.
• Each article should be saved in a separate.
ocx file in the specified output directory.
• Store additional metadata or a summary file (e.
., a.
sv file listing all downloaded articles with their URLs and publication times) if needed.
3.
User Interface & Usability.
• Provide a user-friendly interface or command-line options for configuring parameters such as.
• Site Map URL.
Input the URL of the site map page (e.
., [login to view URL]).
• Time Range.
Specify a time range for filtering articles (e.
., “last 24 hours” or between YYYY-MM-DD and YYYY-MM-DD).
• Output Directory.
Set the destination folder for saving the downloaded.
ocx files.
• Error handling should be robust, with clear messages for common issues (e.
., “Invalid site map URL” or “No articles found in the specified time range”).
4.
Detailed Documentation.
• Provide a README file with.
• Installation instructions (including dependencies).
• Detailed usage instructions, covering.
• How to set up and run the script.
• How to specify the time range and site map URL.
• Optional configuration settings.
• Troubleshooting guide for common errors.
5.
Code Quality.
• The code should be clean, modular, and well-commented, adhering to Python best practices and the PEP8 coding standard.
• Use meaningful variable names and clear function structures.
Technical Requirements 1.
Programming Language.
Python (Latest stable version).
2.
Libraries.
• Suggested libraries include BeautifulSoup, requests, lxml, and python-docx.
• The developer can recommend additional libraries as needed, but must document their usage in a [login to view URL] file.
3.
Environment Compatibility.
The script should be compatible with Windows and Unix-based systems.
4.
Time Range Specification.
Implement logic to handle time ranges in hours or days (e.
., articles published within the last 24 hours, or between specific start and end dates).
5.
Data Compliance.
Ensure the solution adheres to the target website’s Terms of Service and does not violate any legal restrictions.
Project Timeline The project is expected to be completed within 4 weeks from the award date, with the following milestones.
1.
Day 1.
Initial project setup and development of site map extraction module.
2.
Day 2.
Implementation of time range filtering and.
ocx export functionality.
3.
Day 3.
Internal testing and optimization of the script.
4.
Day 4.
Delivery of a beta version for client review, followed by final adjustments and delivery of the completed project.
Project Budget Proposals should include a detailed cost breakdown, including estimated hours for each development phase and any additional costs for third-party libraries or tools.
Submission Requirements 1.
Proposal Submission Deadline.
[Insert Deadline Date] 2.
Proposal Format.
• Company or freelancer profile.
• Portfolio of relevant Python and web scraping projects.
• Proposed approach and implementation strategy.
• Project timeline and cost estimate.
• Contact details.
3.
Evaluation Criteria.
• Expertise in Python programming, web scraping, and data extraction.
• Experience in working with.
ocx file formats.
• Ability to create a user-friendly solution.
• Adherence to the timeline and budget constraints.
Submission Contact All proposals should be submitted to.
• Contact Name.
• Email Address.
Additional Notes 1.
The developer must provide post-delivery support for a period of 2 weeks to address any bugs or issues discovered in the program.
2.
All intellectual property rights to the source code and documentation will be transferred to the client upon project completion and final payment.
3.
Any changes to the project scope should be mutually agreed upon and documented.
Python MySQL ID do Projeto.
# Sobre o projeto 42 propostas Aberto para ofertas Projeto remoto Ativo em 15 minutos atrás
Ir à oferta completa

Detalhes da oferta

Empresa
  • Indeterminado
Localidade
  • Em todo Portugal
Endereço
  • Indeterminado - Indeterminado
Data de publicação
  • 10/10/2024
Data de expiração
  • 08/01/2025
Position: Translator and Content Specialist (Portuguese)
DAC SERVICES AND SOLUTIONS LTD

Content updates:- update translated product descriptions, ingredients, feeding recommendations, and other related content as needed... content localization- translate and adapt landing pages to engage portuguese-speaking users... proven experience in translation and content localization, preferably in......

DUTCH VIDEO CONTENT ANALYST
SpotOn Connections

Our client is looking for a dutch video content analyst to join their growing team in lisbon – portugal... do you love social media? are you a fan of vlogging or constantly looking to videos to help solve your challenges? if theanswer is yes, then you must start your career with a global company working......

German Video Content Analyst
SpotOn Connections

Our client is looking for a german video content analyst to join their growing team in lisbon – portugal... do you love social media? are you a fan of vlogging or constantly looking to videos to help solve your challenges? if theanswer is yes, then you must start your career with a global company working......

Video Content Analyst with Norwegian speakers
Wow business consulting srl

Your day to day: review user flagging reports regarding website content understand and remain up-to-date with client’s policies and guidelines analyze and identify content that is not in compliance with requirements and flag it for action in a timely manner review the reported content within......

Video content analyst (m,f) german or dutch
Personalbüro u. herrmann

Start: asap duties and responsibilities: review user reports regarding website content daily content compliance monitoring and corrective measures application make well-balanced decisions and help resolve inquiries to defined policies and procedures propose solutions to improve the support of user community......

Content moderator (m,f) social media german
Personalbüro u. herrmann

Tasks: · review user reports regarding video content on a streaming platform review sensitive content, sometimes involving graphic or disturbing subject matter audit and update content maintain broad knowledge of client products and/or services make well-balanced decisions and help resolve inquiries......

Content Moderator (m,f) Greek or Slovak or Latvian
Personalbüro U. Herrmann

Review graphic content that has been received from users to ensure that it complies with the community guidelines review, classify and / or eliminate highly sensitive content comply with instructions, procedures related and complementary to the role comply with corporate confidentiality policies and......

Content Moderation - German Speaker (Porto)
Gi Group

We are recruiting a content moderator (m/f/d) for the german market, for one of the largest social media platforms... your day to day: • reviewing online videos/content/complaints/legal notices received from the end customer on any incorrect decisions taken related to their copyright work/material •......

Ukrainian Speaker - Content Moredation - Lisbon
SmartRecruitments

Join a leading outsourcing company specializing in providing top-notch customer/technical support and content moderation services and become an integral part of our team of experts! responsibilities: screens user-generated content for online platforms including text, images, video, and more; reviewing......

Polish speaker Content moderator for Social Media
SmartRecruitments

Your profilenative level of written and verbal communication skills in polish (mandatory);natural fast learner so you can develop your skills within a short period of time;fluency in english (minimum level b2);empathic; motivated and with a positive attitude;attention to detail; experience in dealing......