Design and develop data pipelines and etl processes using apache airflow and google cloud dataflow
extract and analyze large datasets from relational databases (bigquery, redshift, oracle, teradata) using sql
perform data analysis using python and sql with advanced statistical methods
create data visualizations using tableau, power bi, or r-shiny
conduct hypothesis testing and a/b testing for business insights
use git for version control and collaborative development
optimize database queries and create views across multiple tables
report analytical findings to business stakeholders
skills and qualifications:
* 5+ years of experience in data analysis
* advanced english
* proficiency in python and sql
* experience with apache airflow for workflow orchestration
* experience with google cloud dataflow and etl processes
* experience with relational databases (bigquery, redshift, oracle, or teradata)
* proficiency with data visualization tools (tableau, power bi, or r-shiny)
* experience with git version control
* knowledge of statistical analysis and hypothesis testing
* bachelor's degree in quantitative field (statistics, computer science, data science, or related)
#j-18808-ljbffr