IBM
IBM Data Architect with SQL, Spark & Kafka Professional Certificate
IBM

IBM Data Architect with SQL, Spark & Kafka Professional Certificate

Launch your career as a data architect. Learn in-demand skills like data engineering, database management, and data architecture in less than 5 months.

Muhammad Yahya
SkillUp
Romeo Kienzler

Instructors: Muhammad Yahya

Included with Coursera Plus

Earn a career credential that demonstrates your expertise

(112 reviews)

Beginner level

Recommended experience

4 months at 10 hours a week
Flexible schedule
Earn a career credential
Share your expertise with employers
Earn a career credential that demonstrates your expertise

(112 reviews)

Beginner level

Recommended experience

4 months at 10 hours a week
Flexible schedule
Earn a career credential
Share your expertise with employers

What you'll learn

  • Build the job-ready skills you need to succeed as a data architect, including database design, data engineering, and database management.

  • Learn how to use tools such as Airflow, Kafka, Spark, and Hadoop to build ETL workflows and process big data for analytics.

  • Manage relational and non-relational databases, data pipelines, and data warehouses.

  • Implement data privacy measures and governance and regulatory compliance protocols.

Overview

What’s included

Shareable certificate

Add to your LinkedIn profile

Taught in English
152 practice exercises

Professional Certificate - 13 course series

What you'll learn

  • List basic skills required for an entry-level data engineering role.

  • Discuss various stages and concepts in the data engineering lifecycle.

  • Describe data engineering technologies such as Relational Databases, NoSQL Data Stores, and Big Data Engines.

  • Summarize concepts in data security, governance, and compliance.

Skills you'll gain

Data Pipelines, Extract, Transform, Load, Data Warehousing, Data Architecture, Data Security, Relational Databases, Data Governance, Data Store, SQL, Big Data, NoSQL, Apache Spark, Data Lakes, Apache Hadoop, Databases, and Data Science

What you'll learn

  • Describe data, databases, relational databases, and cloud databases.

  • Describe information and data models, relational databases, and relational model concepts (including schemas and tables). 

  • Explain an Entity Relationship Diagram and design a relational database for a specific use case.

  • Develop a working knowledge of popular DBMSes including MySQL, PostgreSQL, and IBM DB2

Skills you'll gain

Relational Databases, Database Design, SQL, PostgreSQL, MySQL, Database Architecture and Administration, Data Manipulation, Data Modeling, IBM DB2, Database Management Systems, Data Integrity, Databases, Command-Line Interface, and Data Management

What you'll learn

  • Analyze data within a database using SQL.

  • Create a relational database on Cloud and work with tables.

  • Write SQL statements including SELECT, INSERT, UPDATE, and DELETE.

  • Build more powerful queries with advanced SQL techniques like views, transactions, stored procedures and joins.

Skills you'll gain

SQL, Stored Procedure, Transaction Processing, Data Manipulation, Relational Databases, Data Analysis, Microsoft SQL Servers, MySQL, Query Languages, Database Systems, Databases, IBM DB2, and Database Management

What you'll learn

  • Describe the Linux architecture and common Linux distributions and update and install software on a Linux system.

  • Perform common informational, file, content, navigational, compression, and networking commands in Bash shell.

  • Develop shell scripts using Linux commands, environment variables, pipes, and filters.

  • Schedule cron jobs in Linux with crontab and explain the cron syntax. 

Skills you'll gain

Linux Commands, Shell Script, Linux, File Management, Scripting, Unix, Unix Commands, Software Installation, Network Protocols, Bash (Scripting Language), Linux Servers, Command-Line Interface, Operating Systems, Ubuntu, Scripting Languages, and Automation

What you'll learn

  • Create, query, and configure databases and access and build system objects such as tables.

  • Perform basic database management including backing up and restoring databases as well as managing user roles and permissions. 

  • Monitor and optimize important aspects of database performance. 

  • Troubleshoot database issues such as connectivity, login, and configuration and automate functions such as reports, notifications, and alerts. 

Skills you'll gain

Database Management, Database Architecture and Administration, Relational Databases, Database Design, Database Systems, Database Administration, Encryption, MySQL, Disaster Recovery, Role-Based Access Control (RBAC), Data Storage Technologies, User Accounts, IBM DB2, Performance Tuning, PostgreSQL, System Monitoring, and Operational Databases

What you'll learn

  • Job-ready data warehousing skills in just 6 weeks, supported by practical experience and an IBM credential.

  • Design and populate a data warehouse, and model and query data using CUBE, ROLLUP, and materialized views.

  • Identify popular data analytics and business intelligence tools and vendors and create data visualizations using IBM Cognos Analytics.

  • How to design and load data into a data warehouse, write aggregation queries, create materialized query tables, and create an analytics dashboard.

Skills you'll gain

Data Warehousing, Data Lakes, Snowflake Schema, Star Schema, Data Mart, IBM DB2, SQL, Extract, Transform, Load, Database Systems, Data Integration, Database Design, Data Modeling, Data Validation, Data Cleansing, Query Languages, Data Architecture, PostgreSQL, and Data Quality

What you'll learn

  • Differentiate among the four main categories of NoSQL repositories.

  • Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.

  • Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.

  • Execute keyspace, table, and CRUD operations in Cassandra.

Skills you'll gain

NoSQL, MongoDB, Apache Cassandra, Data Modeling, Query Languages, Distributed Computing, Scalability, Database Architecture and Administration, Data Manipulation, Databases, JSON, Database Management, and IBM Cloud

What you'll learn

  • Describe and contrast Extract, Transform, Load (ETL) processes and Extract, Load, Transform (ELT) processes.

  • Explain batch vs concurrent modes of execution.

  • Implement ETL workflow through bash and Python functions.

  • Describe data pipeline components, processes, tools, and technologies.

Skills you'll gain

Extract, Transform, Load, Data Pipelines, Apache Airflow, Apache Kafka, Shell Script, Command-Line Interface, Data Integration, Data Migration, Data Mart, Data Transformation, Unix Shell, Scalability, Big Data, Web Scraping, Performance Tuning, Data Warehousing, and Data Processing

What you'll learn

  • Explain the impact of big data, including use cases, tools, and processing methods.

  • Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.

  • Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.

  • Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Skills you'll gain

Apache Spark, Distributed Computing, Big Data, Apache Hadoop, IBM Cloud, Debugging, Apache Hive, Scalability, Kubernetes, Data Transformation, Data Processing, PySpark, Docker (Software), and Performance Tuning

What you'll learn

  • Build valuable applied data storage, integration, and migration skills employers need. 

  • Gain hands-on experience using industry-specific data tools.

  • Demonstrate you understand data-related best practices and can apply methodologies through industry-standard processes. 

  • Showcase your ability to solve problems related to data processes that you can talk about in interviews.

Skills you'll gain

Data Migration, Extract, Transform, Load, Data Integration, Cloud Storage, Data Storage, Data Security, Data Infrastructure, Data Pipelines, Data Architecture, Data Management, and Disaster Recovery

What you'll learn

  • Develop and implement effective data privacy and security strategies.

  • Understand and apply security measures to protect and govern organizational data.

  • Conduct risk assessments and implement appropriate risk management practices.

  • Navigate and comply with relevant legal and regulatory compliance requirements.

Skills you'll gain

Data Architecture, Data Security, Compliance Management, Encryption, Data Governance, Incident Response, Data Quality, Risk Management, Cybersecurity, Personally Identifiable Information, Threat Detection, Data Integrity, Information Privacy, Security Controls, and Law, Regulation, and Compliance

What you'll learn

  • Importance, benefits, and core components of Enterprise Data Architecture (EDA) and popular data architecture frameworks.

  • How to design and implement Enterprise Data Architectures for specific use cases.

  • How to develop and implement policies and procedures, such as data retention policies and operational standards.

  • How to plan and execute data system migrations and modernizations.

Skills you'll gain

Data Governance, Data Migration, Data Architecture, Enterprise Architecture, Scalability, Data Management, Data Modeling, Emerging Technologies, Database Architecture and Administration, Data Warehousing, Data Storage, Data Processing, Extract, Transform, Load, Data Integration, Technology Strategies, Dataflow, and Application Frameworks

What you'll learn

  • Gain hands-on experience working with data architecture that you can showcase in your portfolio and talk about in interviews.

  • Analyze and assess existing data architecture in alignment with business objectives.

  • Implement data migration and integration solutions, including building data pipelines.

  • Apply data governance and security protocols, ensuring compliance and data protection.

Skills you'll gain

Data Migration, Data Integration, Case Studies, Enterprise Architecture, Data Integrity, and Compliance Management

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Muhammad Yahya
IBM
5 Courses93,028 learners
SkillUp
SkillUp
107 Courses324,226 learners
Romeo Kienzler
IBM
10 Courses793,903 learners
Rav Ahuja
IBM
56 Courses4,372,168 learners
Sandip Saha Joy
IBM
5 Courses649,987 learners
Priya Kapoor
IBM
1 Course228,209 learners
Steve Ryan
IBM
12 Courses726,073 learners
Lavanya Thiruvali Sunderarajan
SkillUp
8 Courses228,149 learners
Aije Egwaikhide
IBM
6 Courses754,435 learners
Yan Luo
IBM
7 Courses379,108 learners
Ramesh Sannareddy
IBM
15 Courses451,032 learners
Sabrina Spillner
IBM
1 Course63,447 learners

Offered by

IBM
SkillUp

Compare with similar products

Rating
Level
Skills
Tools
Last updated
Number of practice exercises
Degree eligibility
Part of Coursera Plus

You might also like

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Coursera Plus

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions

¹ Median salary and job opening data are sourced from Lightcast™ Job Postings Report. Content Creator, Machine Learning Engineer and Salesforce Development Representative (1/1/2024 - 12/31/2024) All other job roles (10/1/2024 - 10/1/2025)