The Evolving Role of the Data Engineer
豆瓣
Change and Continuity in Data Practices
Andy Oram
简介
Companies working to become data driven often view data scientists as heroes, but that overlooks the vital role that data engineers play in the process. While data scientists focus on finding new insights from datasets, data engineers deal with preparation—obtaining, cleaning, and creating enhanced versions of the data an organization needs. In this report, Andy Oram examines how the role of data engineer has quickly evolved.
DBAs, software engineers, developers, and students will explore the responsibilities of modern data engineers and the skills and tools necessary to do the job. You’ll learn how to deal with software engineering concepts such as rapid and continuous development, automation and orchestration, modularity, and traceability. Decision makers considering a move to the cloud will also benefit from the in-depth discussion this report provides.
This report covers:
Major tasks of data engineers today
The different levels of structure in data and ways to maximize its value
Capabilities of third-party cloud options
Tools for ingestion, transfer, and enrichment
Using containers and VMs to run the tools
Software engineering development
Automation and orchestration of data engineering