10025 - Sr. Big Data Engineer
10025 – Sr. Big Data Engineer
Job Summary
Design, build, and maintain the Information/Proposed Changes platform that enables large-scale data processing and analysis. Responsible for developing and maintaining data pipelines, data lakes, and other data-related platform. Work closely with data scientists, analysts, and other stakeholders to ensure that data is properly collected, stored, and processed for analysis and reporting purposes. Implement and maintain data security and access controls to ensure that data is protected. Responsible for troubleshooting and resolving technical issues related to data infrastructure and ensuring that data systems are scalable and efficient. Play a critical role in enabling organization to derive insights and value from their data assets.
ESSENTIAL FUNCTIONS
- This job requires experience in building and maintaining scalable data pipelines and robust data models from structured and unstructured sources for AI/ML.
- The ideal candidate should have advanced SQL skills and be able to query and transform large structured/unstructured datasets using Spark/PySpark, Spark SQL/Hive and Hive/NoSQL.
- They should also have experience in developing Big Data pipelines in orchestration tools such as Airflow and Oozie, designing tooling for access management, monitoring, data controls, and self-service ETL/Analytics pipelines.
- Other requirements include hands-on experience with On-Prem Big Data Platform, sound knowledge of Distributed Data Processing frameworks, resource management frameworks like YARN, and proficiency in writing data pipelines using Spark, Python and Scala.
- The ideal candidate should also have experience in developing frameworks/utilities in Python, working in a Dev/Ops environment, and following development best practices such as code reviews and unit testing.
- Additionally, the candidate should be able to diagnose software issues and engineering workarounds, have a good understanding of BI tools such as Tableau/Power BI and MicroStrategy for Big Data, and be able to lead, guide and assist team members with project development and problem solving.
- The candidate should also be flexible and able to learn and use new technologies, work well in a team environment as well as independently to achieve goals.
Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.
Required Skills, Attributes & Education
- Bachelor’s degree OR equivalent (with major course work in computer science) preferred
- 10+ years of IT experience working with hands on working experience, in software development, building data pipelines and data processing frameworks.
- 7+ years of experience as a Data Engineer
- Big Data Technologies: Knowledge of Big Data technologies such as Hadoop, Spark, Hive, Pig, Kafka, and NoSQL databases such as MongoDB, Cassandra, and HBase.
- Distributed Systems: Understanding of distributed systems and distributed computing principles.
- Programming Languages: Proficiency in programming languages such as Java, Python, Scala, and SQL.
- Data Modeling: Knowledge of data modeling techniques and tools to design efficient data structures for Big Data systems.
- Data Processing: Experience with data processing and ETL (Extract, Transform, Load) tools and techniques.
- Cloud Computing: Familiarity with cloud computing platforms such as AWS, Azure, and Google Cloud.
- Data Security: Knowledge of data security principles and experience implementing security measures for Big Data systems.
- Data Warehousing: Understanding of data warehousing concepts and experience designing and maintaining data warehouses.
- Analytics and Machine Learning: Familiarity with analytics and machine learning tools and techniques and their implementation in Big Data systems.
- Performance Tuning: Experience with performance tuning and optimization techniques for Big Data systems to ensure scalability, reliability, and high availability.
Certifications
Cloudera/Hortonworks/Databricks - Spark/Hadoop certification
Salary Range: $103,170 to $158,873
Recommended Jobs
SQL Database Administrator - PST Only
Top Skills' Details 1. SQL 2. Snowflake 3. MongoDB Description Job Summary We are seeking a skilled and proactive Database Administrator (DBA) to manage, maintain, and secure our orga…
National Organizer
California Nurses Association National Nurses United National Organizer Based out of Glendale, CA Join one of the most effective organizing teams in today’s labor movement. Bring your skil…
Purchasing Specialist
Job Description Job Description Crystal Stairs, Inc. Improving the Lives of Families through Child Care Services, Research, and Advocacy Crystal Stairs is committed to building and susta…
Equity Underwriter
Description Position at loanDepot Our mission is simple: to make our customers’ home finance, lending and home services transactions simple, easy and innovative. We lean into what’s about to be …
Direct Support Professional
NCI Affiliates and Achievement House, Inc. are local, non-profit organizations with a history of serving individuals with developmental disabilities. For over 70 years, we have been serving the commu…
Urgent Care APP | Great Schedule | $150K+ | Nevada - Reno-Tahoe | No State Income Tax
Join Our Growing Urgent Care Team! Are you an experienced Nurse Practitioner (NP ) or Physician Assistant/Physician Associate (PA) looking for a flexible schedule and competitive compensation …
Manager, Audience Insights
The Manager of Audience Insights, a digitally and technologically adept professional, will play a pivotal role in the success of the media network by providing actionable insights that shape programm…
Human Resources Business Partner Manager (Business Development)
CEVA Logistics provides global supply chain solutions to connect people, products, and providers all around the world. Present in 170+ countries and with more than 110,000 employees spread over 1,500…
Physical Therapist - Inpatient Part-time
Overview: Physical Therapist: Part-time Hospital Inpatien t "Interstate has provided me with the foundation and support to grow not only as a therapist but as a leader in my field. From mentorsh…
Full Time Primary Care Physician Job San Fernando, CA
The Inline Group - Per Diem Hours:Flexible Schedule | Per deim with possible opportunity for full-time Employed New Graduates Average Patients seen: 16-24 Call Schedule: Very Light |…