Taro Logo

Data Engineer II (Python and MongoDB)

Recorded Future is the world's most advanced, and largest, intelligence company with 1,000 intelligence professionals, over $300M in sales, and serving over 1,800 clients worldwide.
Data
Mid-Level Software Engineer
In-Person
2+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Data Engineer II (Python and MongoDB)

Recorded Future, the world's most advanced and largest intelligence company, is seeking a Data Engineer II to join their team. With 1,000 intelligence professionals, over $300M in sales, and serving over 1,800 clients worldwide, Recorded Future is at the forefront of cyber intelligence.

As a Data Engineer II, you'll be part of a motivated team building reliable data pipelines that ingest raw data and convert it into actionable analytics to identify and combat cyber security threats. You'll work with data from various public web sources and structured data feeds, including public forums, social media sites, and digital marketplaces.

Your responsibilities will include leading the development and maintenance of high-volume data extraction systems, creating well-monitored production data pipelines, and collaborating with both junior and senior engineers through various software engineering best practices.

The ideal candidate should have at least 2 years of Python programming experience, strong data skills, and a deep understanding of data stores like MongoDB, ElasticSearch, S3, and SQL. You should be comfortable working with large, complex data structures and have experience in pipeline software development and ETL processes.

This role offers the opportunity to work with cutting-edge technology in the cyber intelligence field, contributing to the security efforts of numerous global clients. Join Recorded Future and be part of a diverse team of "Futurists" representing over 40 nationalities, embodying core values of high standards, inclusion, and ethical practices.

Recorded Future offers a collaborative work environment, the chance to work on impactful projects, and the opportunity to grow your skills in data engineering and cyber intelligence. If you're passionate about data, cyber security, and want to make a difference in combating global cyber threats, this role could be the perfect fit for you.

Last updated 10 months ago

Responsibilities For Data Engineer II (Python and MongoDB)

  • Lead development, productionalization and maintenance of high volume data extraction via custom harvesters, APIs, and data stores
  • Create well monitored production data pipelines which convert raw data into well organized and actionable information
  • Maintain and upgrade existing data flow production systems
  • Collaborate with junior engineers and senior engineers through design reviews, code reviews, pair programming, and other software engineering best practice collaboration

Requirements For Data Engineer II (Python and MongoDB)

Python
MongoDB
  • 2+ years of python programming: comfortable writing production-grade data processing infrastructure in python, strong debugging skills
  • Data skills: comfortable working with large, complex data structures, experience transforming large datasets in Python, querying with modern datastores such as MongoDB and ElasticSearch, and developing corresponding APIs
  • Well versed in both computer programing and data engineering
  • Pipeline software development, writing durable ETL programs and systematically sending and retrieving data from APIs
  • Strong understanding of data stores (MongoDB, ElasticSearch, S3, and SQL) and their different use cases
  • Programmatic loading of high volume structured data using python data science libraries while adhering to limited computation resource constraints
  • Developing pipeline monitoring in python for both new and existing applications

Interested in this job?