Share this Job

Data Scientist

Date:  Mar 13, 2023

Remote, MA, US, Remote

Onsite or Remote:  Remote
Company Name:  EBSCO Information Services

EBSCO Information Services (EIS) provides a complete and optimized research solution comprised of e-journals, e-books, and research databases - all combined with the most powerful discovery service to support the information needs and maximize the research experience of our end-users. Headquartered in Ipswich, MA, EIS employs more than 2,700 people worldwide, most now working hybrid or remotely. We are the leader in our field due to our cutting-edge technology, forward-thinking philosophy, and outstanding team. EIS is a company that will motivate you, inspire you, and allow you to grow. Our mission is to transform lives by providing relevant and reliable information when, where, and how people need it. We are looking for bright and creative individuals whose unique differences will allow us to achieve this inclusive mission around the world.


This role can be performed 100% remotely. After initial onboarding, there may be optional opportunities to travel to Ipswich MA for Agile planning sessions, in-person training, or development workshops. 


You are a data scientist who wants to work in the areas of machine learning, natural language processing, information extraction, graphical models, summarization, information retrieval, recommender systems, and/or knowledge graphs. You will bring new ideas, iterate quickly, share findings, and shepherd the adoption of solutions within the team. You extract and identify relevant, meaningful, and actionable information from structured and unstructured data in real-time and provide advanced ways of accessing this data (such as search, summarization, and recommendations). The data assets of EIS are vast, and your talent and experience can be brought to bear on state-of-the-art challenges.


Primary Responsibilities:

  • Proactively explore, gather, clean, describe, and analyze data.
  • Develop tools and strategies related to text analytics, machine learning, and semantic enrichment.
  • Develop and maintain strong working relationships across departments and with key partners and stakeholders.
  • Continuously improving the text data infrastructure and processes through testing, experimentation, and the adoption of new technologies and techniques.
  • Learn cutting-edge research in advanced ML & NLP topics and devise an efficient application for projects.
  • Collaborate closely with domain experts on project ideas, anticipate team needs, and translate their needs and requirements into technical designs and implementations
  • Drive, design, and develop projects as the principal point-of-contact, with the ability to determine suitable ML models, direct feature engineering processes and negotiate KPIs per business needs.
  • Participate in special projects and perform other duties as assigned.


Required Qualifications:

  • Bachelor's degree in computer science, mathematics, or related technical field or equivalent experience.
  • Demonstrated experience with natural language processing, machine learning model design, implementation, and data infrastructure.
  • Fluency in Python for data science, including Pandas, Jupyter notebooks, and open-source machine learning modules (Scikit, NLTK, Spacy).
  • 2+ years' experience with data processing, including extraction, transformation, and loading (ETL) of large data sets from unstructured and semi-structured data (plaintext, PDF, JSON, XML)
  • 2+ years' experience with database querying, extraction of data, and design using SQL.
  • 1+ years' experience with distributed computing (Docker, Spark, Kubernetes).
  • Knowledge of statistics and information retrieval quality measures, including regression, hypothesis testing, precision, recall, f-measure, and AUC-ROC.
  • Demonstrated ability to share insights from data with others, listen to subject matter experts, translate expert knowledge into ML features, and connect with project stakeholders.


Preferred Qualifications:

  • Advanced degree in a science, technology, or engineering discipline.
  • Advanced Python skills, including object-oriented programming, unit testing, and scripts in production.
  • Advanced machine learning skills, including deep learning, large language models, model architectures, parameters, and ensemble modeling.
  • Experience of automating workflows based on manual, intellectual tasks
  • Familiarity with PyTorch/Keras/TensorFlow
  • Knowledge of MLOps processes including versioning, experimentation, deployment, and quality review
  • Knowledge of Lean-Agile/SAFe project management methodologies
  • Familiarity with search engine approaches (Solr, Elasticsearch, Lucene)
  • Understanding publishing industry perspectives
  • Understanding needs and processes of libraries


Target Annual Salary Range: $78,150 - $111,640.  The actual salary offer will carefully consider a wide range of factors including your skills, qualifications, education, training, and experience, as well as the position’s work location. EBSCO provides a generous benefits program including medical, dental, vision, life and disability insurance, flexible spending accounts, a retirement savings plan, paid parental leave, holidays and paid time off (PTO), as well as tuition reimbursement. View more about EBSCO’s benefits here:




We are an equal opportunity employer and comply with all applicable federal, state, and local fair employment practices laws. We strictly prohibit and do not tolerate discrimination against employees, applicants, or any other covered persons because of race, color, sex, pregnancy status, age, national origin or ancestry, ethnicity, religion, creed, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, training, promotion, discipline, compensation, benefits, and termination of employment. We comply with the Americans with Disabilities Act (ADA), as amended by the ADA Amendments Act, and all applicable state or local law.

Job Segment: Open Source, Computer Science, Database, Developer, SQL, Technology