Software Engineer (Ray Data)

Anyscale commercializes Ray, an open-source project creating an ecosystem of libraries for scalable machine learning.
$170,112 - $237,000
Data
Mid-Level Software Engineer
In-Person
2+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Business Intel. Engineer III, Customer eXperience Impressions

Senior Business Intelligence Engineer role at Amazon focusing on building analytical systems to measure and improve customer experience through data-driven insights and real-time detection systems.

Business Intelligence Engineer, Wondery

Business Intelligence Engineer role at Wondery, focusing on data analytics and insights for podcast and entertainment content, requiring 3+ years of data analysis experience.

Business Intelligence Engineer II, EU Inbound & Inventory Placement

Business Intelligence Engineer II role at Amazon's EU HQ in Luxembourg, focusing on supply chain analytics and optimization using SQL and data visualization tools.

Business Intelligence Engineer II, EU Inbound & Inventory Placement

Business Intelligence Engineer II position at Amazon's EU HQ in Luxembourg, focusing on supply chain analytics and optimization of inventory placement systems.

Data Engineer, Just Walk Out

Data Engineer position at Amazon's Just Walk Out division, building data infrastructure for checkout-free shopping technology using advanced ML and computer vision.

Description For Software Engineer (Ray Data)

Anyscale, backed by prominent investors with $250+ million in funding, is revolutionizing distributed computing through Ray, their open-source project. They're seeking a Software Engineer for their Ray Data team to work on their Datasets library, which is crucial for machine learning pipelines and production use cases at major companies like Amazon and Alibaba.

The role involves developing and maintaining the Ray Datasets library, built on Apache Arrow and Ray Core. You'll work on performance optimization, ML training integration, stability testing, and streaming workload integration. The position requires expertise in distributed systems, data processing, and database internals.

This is an exciting opportunity to join a team that's making distributed computing accessible to developers of all skill levels. You'll contribute to open-source software used by industry leaders like OpenAI, Uber, and Spotify. The role offers competitive compensation ($170,112-$237,000) and comprehensive benefits including equity, healthcare, and education stipends.

The ideal candidate should have at least 2 years of experience, strong algorithmic background, and expertise in scalable systems. You'll be working in either San Francisco or Palo Alto, contributing to projects that directly impact the efficiency and accessibility of machine learning applications.

Working at Anyscale means being at the forefront of AI infrastructure development, with the opportunity to shape how distributed computing evolves. The company culture values technical excellence, innovation, and effective communication, as evidenced by the expectation to share knowledge through talks and blog posts.

Last updated a month ago

Responsibilities For Software Engineer (Ray Data)

  • Develop high quality open source software to simplify distributed programming (Ray)
  • Identify, implement, and evaluate architectural improvements to Ray core and Datasets
  • Improve the testing process for Ray to make releases smooth
  • Communicate work through talks, tutorials, and blog posts
  • Performance optimization of Ray Datasets at large scale
  • Integration with ML training and data sources
  • Lead future work integrating streaming workloads into Ray
  • Differentiate Data operations in Anyscale hosted Ray service

Requirements For Software Engineer (Ray Data)

Python
  • At least 2 year of relevant work experience
  • Solid background in algorithms, data structures, system design
  • Experience in building scalable and fault-tolerant distributed systems
  • Experience with data processing, database internals including Spark or Dask (streaming is a plus)

Benefits For Software Engineer (Ray Data)

Equity
Medical Insurance
401k
Education Budget
Parental Leave
Commuter Benefits
  • Stock Options
  • Healthcare plans covered 99% by Anyscale
  • 401k Retirement Plan
  • Education & Wellbeing Stipend
  • Paid Parental Leave
  • Fertility Benefits
  • Flexible Time Off
  • Commute reimbursement
  • 100% of in office meals covered

Interested in this job?