RNN-TRANSDUCER MODEL FOR NESTED ENTITY RECOGNITION IN BIOMEDICAL LITERATURE
Descrição da oferta de emprego
This project involves the development and training of a Recurrent Neural Network-Transducer (RNN-T) model for the challenging task of Nested Entity Recognition (NER) in biomedical literature.
Nested NER aims to identify and classify multiple overlapping entities within a given text, such as genes, proteins, diseases, and drugs.
Key Responsibilities.
Data Preparation.
Data Collection.
Gather and curate a high-quality dataset of biomedical literature annotated with nested entity labels.
Data Preprocessing.
Clean and preprocess the text data, including tokenization, sentence splitting, and handling special characters.
Implement appropriate data augmentation techniques to enhance model robustness and prevent overfitting.
Create suitable input and output representations for the RNN-T model, considering factors like character-level or word-level embeddings.
Model Development.
Architecture.
Design and implement an RNN-T architecture suitable for nested entity recognition, potentially incorporating.
Encoder.
Bidirectional LSTMs or GRUs for capturing contextual information.
Prediction Network.
A connectionist temporal classification (CTC) or similar approach for joint acoustic and language modeling.
Attention Mechanisms.
To improve focus on relevant parts of the input sequence.
Training.
Train the model using an appropriate optimization algorithm (e.
., Adam) and loss function (e.
., Connectionist Temporal Classification loss).
Hyperparameter Tuning.
Conduct thorough hyperparameter tuning (e.
., learning rate, dropout rate, hidden layer sizes) to optimize model performance.
Evaluation.
Metrics.
Evaluate model performance using relevant metrics for nested NER, such as.
F1-score.
For overall entity recognition accuracy.
Precision and Recall.
For individual entity types.
Exact Match.
To assess the accuracy of complete entity spans.
Analysis.
Analyze model performance, identify areas for improvement, and generate insights into the challenges of nested NER in biomedical literature.
Documentation.
Code Documentation.
Provide clear and concise documentation for all code, including comments, docstrings, and README files.
Project Report.
Prepare a comprehensive report summarizing the project methodology, results, and findings.
Required Skills.
Strong proficiency in Python and deep learning frameworks (e.
., TensorFlow, PyTorch).
Solid understanding of Natural Language Processing (NLP) concepts, including tokenization, word embeddings, and sequence-to-sequence models.
Experience with RNN-T models or similar sequence-to-sequence architectures.
Familiarity with nested entity recognition and its challenges.
Experience with data preprocessing, feature engineering, and model evaluation.
Excellent communication and documentation skills.
Deliverables.
Trained RNN-T model.
A well-trained and optimized model for nested entity recognition in biomedical literature.
Code.
Clean, well-documented, and reproducible code for all stages of the project.
Data.
Preprocessed and annotated datasets used for model training and evaluation.
Project Report.
A comprehensive report detailing the project methodology, results, and findings.
Deep Learning NLP Tokenization Python Pytorch Tensorflow ID do Projeto.
# Sobre o projeto 4 propostas Aberto para ofertas Projeto remoto Ativo em 20 minutos atrás
Detalhes da oferta
- Indeterminado
- Em todo Portugal
- Indeterminado - Indeterminado
- 03/01/2025
- 03/04/2025
Benefits: • apartment accommodation can be provided for the initial quarter... • minimum 3 years of professional nursing experience for candidates without a diploma... they are in search of empathetic and proficient nursing staff with expertise in nursing or medicine, capable of offering thorough care......
The candidates book their own transportation and we will reimburse them fully (up to 700€ for external relocators and up to 150€ for internals)... job description:you will be a single point of contact for the bank's clients for different types of inquiriesyou are able to manage all different types of......
For employment beyond two years, a permanent contract may be offered... employment term: initial contracts of 12 months, extendable for another 12 months... accommodation benefit: for those staying in designated apartments, this benefit is tax-free... meal allowance: preloaded debit card for tax-free......
Develop backend applications for aws using java, kotlin or typescript... we are looking for highly skilled aws developer to join pixida with a hybrid working model in the porto area... be responsible for the creation and maintenance of services and ci/cd pipelines... requisitos do trabalho have experience......
As you can see, there's a lot for you to do here... you'll also be part of working for the country's leading centre for rare + complex conditions, along with 1 of the largest transplant centres... want to help write healthcare history? the cambridge biomedical campus, the largest medical research and......
Job description:you will be a single point of contact for the bank's clients for different types of inquiriesyou are able to manage all different types of inquiries generated via inbound activities such as chat and email... start: asap banking project: german on-siterole: as customer service agent (m/f/d)......
Job description:you will be a single point of contact for the bank's clients for different types of inquiriesyou are able to manage all different types of inquiries generated via inbound activities such as chat and email... banking project: german on-site start: 13 th december or laterrole: as customer......
They are recognized for their innovation in kitchen and home solutions, providing a wide range of products to enhance daily living... this offer is a great opportunity for anyone who is looking for a new challenge & working in the center of lisbon! ✅ as a customer support you will: after sales support......
Pm (hybrid)- location: lisboa for apply, send your cv for *****@***** with the reference 'records'... fórum selecção is looking for a records management office (m/f) for a corporate bank main activities: - maintain and update rmo (records management office) policy, procedures and retention schedule......
We’re looking for new team members with the react... for more information you can consult our privacy policy we will contact you only if your profile is selected for the next stage of recruitment... net: mvc, web api, entity framework core; webservices (soap, rest, xml); mvc or web api; react......