Logo of Archipelo

Remote Data Science Engineer

Photo of Tatiana Barysheva
Tatiana Barysheva
Must have skills:
Considering candidates from:
Work arrangement:
Remote only
Developer tools
Company size:
2-10 employees
Logo of Archipelo

Remote Data Science Engineer

Archipelo is building an intelligent code discovery platform that provides the best tools for developers to discover code in any form—and benefit through insights, recognition, and greater productivity. They are transforming code search to improve the practice of modern programming—using a graph-based approach drawing on data from the entire open source ecosystem. They're on a mission to build the world's best code discovery engine. Archipelo is well-funded by top investors in Silicon Valley, including the first investors of Google, Twitter, Zoom, LinkedIn, and Uber. Their team has backgrounds from NASA, LinkedIn, Facebook, Amazon, AWS, Cisco and MIT, Harvard, Stanford, and Berkeley.
Right now, they are seeking a Senior Data Science Engineer to lead technology development on the frontier of code discovery and developer productivity. 


  • Develop, maintain and test distributed systems that collect and aggregate large amounts of data
  • Help build the infrastructure required to reliably and securely transport data from a wide variety of sources
  • Lead ETL effort with data team, employing weak and distant supervision for labeling
  • Perform exploratory data analysis, and create compelling data visualizations
  • Maintain and evolve the pipeline APIs that the entire organization relies upon
  • Establish, promote, and coach engineering teams on technical best practices
  • Establish technical standards around design, software reliability and quality
  • Stay on the cutting edge of emerging technology, forging key relationships with peers at public and private institutions
  • Generate new ideas for data-driven product features and optimizing search relevance 
  • Own our technical roadmap and architecture
  • Code every day

Must have skills:

  • Advanced working knowledge of information retrieval and search technologies
  • Extensive knowledge of OSS tools and active participation in OSS community
  • Experience building complex software outside of frameworks or existing infrastructure
  • Knowledge of security operations best practices
  • ElasticSearch, Solr or equivalent experience
  • Significant experience with Python
  • Ego maturity with first-rate interpersonal skills
  • Minimum 10 years of professional data science or software engineering experience
Nice to have skills: 

  • Graduate degree or equivalent experience in computer science/engineering, operations research, industrial engineering, physics, mathematics, statistics, or other related technical fields
  • Extensive experience optimizing search relevance
  • Experience as an internal technical leader within an startup engineering organization 
  • Experience with enterprise architecture and deployment on premises
  • Experience in platform development to solve complex problems at scale
  • Experience working with large, semi-structured data sets and databases
  • Experience with distributed software development and systems
  • Polyglot programming experience, multiple languages or ability to quickly learn

Position closed, but we can still help

Check out our current open positions