Logo of Archipelo

Remote Data Scientist

Photo of Nikita Tsibulsky
Nikita Tsibulsky
Must have skills:
One of skills:
Nice to have skills:
Considering candidates from:
Work arrangement:
Remote only
Developer tools
Senior or lead
Company size:
2-10 employees
Logo of Archipelo

Remote Data Scientist

Archipelo is building an intelligent code discovery platform that provides the best tools for developers to discover code in any form—and benefit through insights, recognition, and greater productivity. They are transforming code search to improve the practice of modern programming—using a graph-based approach drawing on data from the entire open source ecosystem. They're on a mission to build the world's best code discovery engine. Archipelo is well-funded by top investors in Silicon Valley, including the first investors of Google, Twitter, Zoom, LinkedIn, and Uber. Their team has backgrounds from NASA, LinkedIn, Facebook, Amazon, AWS, Cisco and MIT, Harvard, Stanford, and Berkeley.
Right now, they are seeking a Senior Data Scientist to lead technology development on the frontier of code discovery and developer productivity. A successful applicant is an expert in data science, machine learning, and complex data analysis spanning natural language, code syntax and networks.

Depending on your skills you will either focus on GNN (graph neural networks) OR Code Representation.

Must-have skills:

  • Knowledge of microservices and cloud computing—expert in at least one cloud platform
  • Familiar with distributed systems and the orchestration of large numbers of independent commodity machines into complete, functional systems to handle diverse workloads
  • Experience building and maintaining graph neural networks (GNN) OR Formal experience with knowledge representation
  • Experience with NLP & NLU
  • Expertise performing data science research
  • Expertise writing world-class Python code
  • Experience coding in Go

Nice-to-have skills:

  • PhD in computer science, artificial intelligence, machine learning or related technical field
  • 10+ years of professional data science or software engineering experience
  • Advanced working knowledge of information retrieval and search technologies and have set up and used open-source search systems to query and understand data
  • Experience with many of the following technologies:
    • ElasticSearch, Solr and equivalent 
    • Kubernetes
    • Machine learning infrastructure
    • Deep learning
    • Relevance engineering
    • CircleCI, GitHub Actions, Jenkins or equivalent
    • Any graph database

Position closed, but we can still help

Check out our current open positions