Toggle theme

Simson Chiu

Full Stack Data Engineer

I'm a full stack data engineer who loves building useful products.

Personal Projects

LocoStudio
Privacy focused, offline-first AI chat for desktop. I use this every day as my main chat app now. My favorite feature is chatting with Youtube videos.

Tech Stack

ElectronViteReactTypeScriptTailwind CSSDexie.js
Kidzbook
Make picture books with AI on the web and iOS. I built this for my daughter to have fun with AI and learn full-stack development

Tech Stack

Next.jsReactTailwind CSSTypeScriptReact NativeExpoPostgreSQLGoogle Cloud RunCloud StorageCloud BuildCloud CDNDocker
GrowBites
Food journal app for parents. I built this for my wife to help her track our daughter's food allergy reactions.

Tech Stack

React NativeExpoTypeScriptSQLite
FocusBlock
Block distracting websites without getting in the way of your work. This was my first project that I deployed out into the wild. I still use it every day to keep me focused while I work.

Tech Stack

JavascriptHTMLCSS

Work Experience

Google

New York City, NY
Data EngineerSep 2020 - May 2024

Skills & Technologies

Data PipelinesData VisualizationData MigrationSQLJavaScriptBigQueryAirflowTeradataSQL Server
  • Developed data pipelines and dashboards for providing visibility for Google Ads and YouTube business metrics, utilizing best practices of Google's data warehouse infrastructure and tooling.
  • Created a unique solution of extracting and cleaning data from Google sheets into the internal data warehouse using Google Apps Scripts (JavaScript), SQL, and internal tools, saving 90+ man-hours every month.
  • Helped customers migrate their data warehouses into Google Cloud by advising on data architecture, migration strategy, data security, and code development.
  • Created pipelines for migrating TBs of data from on-prem databases (Teradata, Oracle, SQL Server) to BigQuery.
  • Led technical architecture discussions with the development team and prioritized work for each sprint.

Instagram

Menlo Park, CA
Data Engineer, AnalyticsJun 2018 - Sep 2020

Skills & Technologies

Data PipelinesData VisualizationSQLPythonPrestoSpark SQLHive
  • Designed end-to-end data solutions from logging and pipelines to dashboards and metrics for monitoring and measuring effectiveness of security and integrity products on Instagram.
  • Managed and took ownership of the data warehouse by creating data quality alerts and optimizing pipeline performance utilizing SQL, Python, Presto, Spark, Hive, and internal tools.
  • Collaborated with data scientists, software engineers, and product managers to understand data needs, communicate timelines, create metrics, and influence product decisions.

Advantage Solutions

El Segundo, CA
Senior Data EngineerFeb 2016 - May 2018

Skills & Technologies

Data WarehousingData PipelinesT-SQLSSISSSASPower BIC#DevOpsPowerShell
  • Developed a data warehouse from the ground up that integrated and processed billions of rows of sales, marketing, and product data from the largest grocery retailers in the US using SSIS, T-SQL, and C#.
  • Architected and created SSAS Tabular models and trained users how to interact with them in Power BI and Excel.
  • Improved DevOps best practices using continuous integration with PowerShell deployments.

Principal Development Group Consulting

Los Angeles, CA
Business Intelligence DeveloperAug 2012 - Jan 2016

Skills & Technologies

Business IntelligenceOLAPSQL ServerOracle.NETETLSSISSSASSSRST-SQL
  • Sony Pictures Entertainment - Business Intelligence Developer for the International TV Distribution Department
  • Designed a finance reporting OLAP cube for revenue and sales of TV distribution, integrated multiple back-end technologies including SQL Server, ORACLE, and .NET.
  • Created a data mart and ETL pipelines SSIS packages for automated processing and partitioning of SSAS models to achieve real-time data that updated every 15 minutes.
  • CBS Corporation - Business Intelligence Developer for CBS's sales planning and financial systems
  • Developed complex reports for finance and intellectual property rights, utilizing SSRS, T-SQL stored procedures, functions, and views