Data Scientist Bioinformatics

 Houston, , United States

 Full Time

Job Details

Full job description The primary purpose of the Data Scientist is to leverage the advancement of next-generation sequencing data to pioneer the discovery and development of groundbreaking therapeutics for cancer patients. This role revolves around innovating and refining sophisticated pipelines and methodologies for analyzing intricate genetic information at the single-cell level. By pioneering the development of cutting-edge computational tools and algorithms, the Data Scientist will lead the charge in accelerating scientific breakthroughs. This integral contribution will drive the creation of novel therapies and diagnostics that can transform patient care and health outcomes. Led by Prof. Bissan Al-Lazikani, Director of Therapeutics Data Science, the intelligent and ever-learning A3D3a platform is part of the new initiative in Therapeutics Data Science and part of our ambitious Institute for Data Science in Oncology at MD Anderson. A3D3a will accelerate the discovery and impact of novel therapies for cancer by enabling novel opportunities for optimized therapies for patients with a focus on rare and hard-to-treat cancers through the development of novel machine learning and AI technologies. JOB SPECIFIC COMPETENCIES Carry out preparation, clean-up, and quality control of biological data, including scRNA-Seq, scATAC-Seq, Spatial transcriptomics and/or other multi-dimensional omic data modalities. Develop and maintain pipelines for bioinformatics and statistical analyses of aforementioned data types; activities to include handling raw data, evaluating outputs, optimizing parameters and summarizing findings. Keep abreast of advancements in single-cell sequencing and other new technologies and data analysis techniques. Stay engaged with the scientific to identify emerging technologies and methodologies. Actively collaborate with interdisciplinary teams to design experiments, understand data generation protocols and optimize analytical workflows accordingly. Rigorously validate newly developed methods using benchmark datasets and simulated data to assess their accuracy, sensitivity, specificity, and scalability. Present results at multidisciplinary project meetings. Contribute to open-source projects and publish findings in peer-reviewed journals to share insights, methodologies, and tools with the wider scientific community. Prepare written reports, manuscripts, and grant applications with investigators. Work closely with the team and collaborators to discover novel therapeutic opportunities for cancer patients. Expected Skills Deep knowledge of bioinformatics tools and their implementation as part of pipelines, particularly for scRNA-Seq, scATAC-Seq, Spatial transcriptomics and/or other multi-dimensional omic data modalities. Advanced knowledge of statistical methods and data analysis techniques relevant to single-cell genomics, including differential expression analysis, clustering, dimensionality reduction, trajectory inference, and data integration. Proficiency in machine learning techniques for analyzing high-dimensional single-cell data, such as supervised and unsupervised learning algorithms. Addressing challenges in bioinformatics as well as mitigation strategies such as bias, batch correction, etc. Utilizing High Performance Computing to run large-scale analyses. Strong programming skills in languages commonly used in bioinformatics and data science, such as Python and R. Ability to write efficient, modular, and maintainable code for data manipulation, analysis, and visualization. Experienced with code version control systems (e.g., Gitlab and Github). Other duties as assigned. COMPETENCIES With Inclusion, you understand that your ideas and contributions are valued. You promote the same for others. You address your own biases while promoting diversity and equity. (Competencies: Cultural Humility, Cultural Awareness, Cultural Intelligence) With Drive, you see that you can serve as a leader whether you have a formal leadership role or not. You tackle problems, move past setbacks and hardships, and don't lose sight of your goals. (Competencies: Self-Confidence, Analytical Thinking, Innovative Thinking, Technical Expertise) You demonstrate Professionalism by setting the example for others and consistently modeling MD Anderson's values and service standards. You communicate effectively in a variety of ways. (Competencies: Inspire Trust, Oral Communication, Written Communication) Through Emotional Intelligence, you maintain awareness of your own emotions and the emotions of those around you. Use nonverbal cues and feelings to engage others in an inclusive and responsive way. (Competencies: Active Listening, Teaming, Self-Reflection) Having Coachability means you are engaged in relentless learning. You constantly ask questions and stay curious. You understand that the organization constantly evolves, and you should as well. (Competencies: Develop Oneself, Adaptability) Working Conditions Laboratory environment This position requires: Working in Office Environment ____No __X_ Yes Working in Patient Care Unit (e.g., Nursing unit; outpatient clinic) _X_ No ____ Yes Exposure to human/animal blood, body fluids, or tissues __X__No __ Yes Exposure to harmful chemicals __X__No ___ Yes Exposure to radiation _X_ No ____ Yes Physical Demands Indicate the time required to do each of the following physical demands: Time Spent Never 0% Occasionally 1-33% Frequently 34-66% Continuously 67-100% Standing X Walking X Sitting X Reaching X Lifting/Carrying Up to 10 lbs. X 10lbs to 50 lbs. X More than 50 lbs. X Pushing/Pulling Up to 10 lbs. X 10lbs to 50 lbs. X More than 50 lbs. X Use computer/keyboard X EDUCATION: Required: Bachelor's degree in Biomedical Engineering, Electrical Engineering, Computer Engineering, Physics, Applied Mathematics, Science, Engineering, Computer Science, Statistics, Computational Biology, or related field. Preferred: PhD in Biomedical Engineering, Electrical Engineering, Computer Engineering, Physics, Applied Mathematics, Science, Engineering, Computer Science, Statistics, Computational Biology, or related field. EXPERIENCE: Required: Three years experience in scientific software or industry development/analysis. With Master's degree, one years experience required. With PhD, no experience required. Preferred: Single cell sequencing, next generation sequencing, publications. It is the policy of The University of Texas MD Anderson Cancer Center to provide equal employment opportunity without regard to race, color, religion, age, national origin, sex, gender, sexual orientation, gender identity/expression, disability, protected veteran status, genetic information, or any other basis protected by institutional policy or by federal, state or local laws unless such distinction is required by law. Additional Information Requisition ID: 167724 Employment Status: Full-Time Employee Status: Regular Work Week: Days Minimum Salary: US Dollar (USD) 103,000 Midpoint Salary: US Dollar (USD) 129,000 Maximum Salary : US Dollar (USD) 155,000 FLSA: exempt and not eligible for overtime pay Fund Type: Soft Work Location: Hybrid Onsite/Remote Pivotal Position: Yes Referral Bonus Available?: Yes Relocation Assistance Available?: Yes Science Jobs: Yes #LI-Hybrid
Apply Here

About this company

Read more
Give Feedback