Qu'elles soient, big ou small, structurées ou semi structurées, relationnelles ou pas, nous vous proposons un tour des solutions data dans Azure.
Vidéo disponible ici : https://youtu.be/6D4jAcxfWCQ.
This document provides an overview of cloud computing and the top 6 cloud service providers:
1. It defines cloud, cloud computing, and cloud services as computing resources, data storage, and services available over the internet.
2. The top 6 cloud service providers are identified as Amazon Web Services, Microsoft Azure, Google Cloud, Alibaba Cloud, IBM Cloud, and Oracle.
3. Each provider is briefly described, highlighting their service categories including compute, storage, databases, analytics, AI/ML, security, and networking.
The document discusses the intelligent edge and hybrid cloud computing. It defines the intelligent edge as where data is created and processed outside traditional centralized data centers. It predicts that by 2025, 75% of enterprise data will be created and processed at the edge. It then provides an overview of different Azure products and solutions for intelligent edge computing, including Azure Sphere, IoT Edge, Stack Edge, and Stack Hub. It discusses how these products bring cloud services and capabilities to the edge through appliances, gateways, and on-premises servers to enable hybrid cloud solutions.
What are the Business Benefits of Microsoft AzureChris Roche
This document outlines the business benefits of Microsoft Azure, including its strong security features with data centers like spy movie facilities, cost savings from no longer needing to replace servers, scalability to flexibly adjust needs, ability to use hybrid cloud and on-premise resources, fast speed, disaster recovery by backing up to the cloud, and compliance with security and privacy demands.
Cassandra at eBay - Cassandra Summit 2013Jay Patel
"Buy It Now! Cassandra at eBay" talk at Cassandra Summit 2013
This session will cover various use cases for Cassandra at eBay. It’ll start with overview of eBay’s heterogeneous data platform comprised of SQL & NoSQL databases, and where Cassandra fits into that. For each use case, Jay will go into detail of system design, data model & multi-datacenter deployment. To conclude, Jay will summarize the best practices that guide Cassandra utilization at eBay.
http://www.datastax.com/company/news-and-events/events/cassandrasummit2013
This document provides an overview of Google Cloud Platform (GCP) services. It discusses computing services like App Engine and Compute Engine for hosting applications. It covers storage options like Cloud Storage, Cloud Datastore and Cloud SQL. It also mentions big data services like BigQuery and machine learning services like Prediction API. The document provides brief descriptions of each service and highlights their key features. It includes code samples for using Prediction API to train a model and make predictions on new data.
The document discusses Microsoft Azure storage solutions and services, highlighting key capabilities like Azure Files for file shares, Premium Storage for high performance workloads, and integration with hybrid solutions like StorSimple. It also provides an overview of Azure Storage APIs and compares Azure storage features to competitive offerings from AWS. The document is aimed at helping customers understand how Azure storage can meet their needs for scalability, reliability, security and hybrid cloud capabilities.
ITCamp 2018 - Thomas Maurer - Azure Stack - Everything you need to know!ITCamp
Microsoft released Azure Stack as a Azure appliance for your datacenter. Learn what Azure Stack is, what challenges it solves, how you deploy, manage and operate a Azure Stack in your datacenter. Learn about the features and services you will get by offering Azure Stack to your customers and how you can build a true Hybrid Cloud experience.
In this presentation Thomas Maurer (Microsoft MVP) will guide you through the highly anticipated innovations and experience during the Azure Stack Early Adaption Program and Azure Stack Technology Adoption Program (TAP).
This is a brief introduction to Microsoft Azure cloud. I used these slides in an intro session for developers. I did few demos during the session that not included in the slide. Brand name and logos are properties of their respective owners.
Data saturday Oslo Azure Purview Erwin de KreukErwin de Kreuk
Azure Purview provides unified data governance capabilities including automated data discovery, classification, and lineage visualization. It helps organizations overcome data governance silos, comply with regulations, and increase data agility. The key components of Azure Purview include the Data Map for automated metadata extraction and lineage, the Data Catalog for data discovery and governance, and Insights for monitoring data usage. It supports governance of data across cloud and on-premises environments in a serverless and fully managed platform.
Google Cloud Storage | Google Cloud Platform Tutorial | Google Cloud Architec...Edureka!
(Google Cloud Certification Training - Cloud Architect: https://www.edureka.co/google-cloud-architect-certification-training)
This tutorial on Google Cloud Storage will provide you with a detailed introduction to the various Cloud Storage Services provided by Google. You will also get hands-on on each of the storage options.
Azure templates can be used to deploy and manage Azure resources in a declarative and repeatable way. They define the resources to deploy, including virtual machines, databases, and networking components, as well as the relationships between resources. Azure templates allow for idempotent deployments, simplified orchestration of rollbacks and upgrades, and cross-resource configuration and updates. They are stored as JSON or ARM template files in source control and can be deployed via the Azure CLI, PowerShell, or REST APIs. A wide range of community-created quickstart templates are available on GitHub for common workload deployments.
Mastering azure devOps - Dot Net TricksGaurav Singh
DevOps is the combination of "development and operations" where the Collaboration of software development (Dev) and information-technology operations (Ops) aims to to deliver applications and software services at high speed and high velocity using combination of cultural philosophies, practices, and tools.
Amazon Web Services (AWS) provides a set of cloud computing services including compute, storage, databases, analytics, and application services. AWS is the market leader in cloud services and offers virtual machines (EC2), file storage (S3), relational databases (RDS), data warehousing (Redshift), streaming data (Kinesis), and other services. This document demonstrates several AWS services including EC2, S3, RDS, Redshift, DynamoDB, and Kinesis. It provides guidance on choosing the appropriate AWS services for different use cases and discusses best practices for managing costs when using AWS.
The cloud is all the rage. Does it live up to its hype? What are the benefits of the cloud? Join me as I discuss the reasons so many companies are moving to the cloud and demo how to get up and running with a VM (IaaS) and a database (PaaS) in Azure. See why the ability to scale easily, the quickness that you can create a VM, and the built-in redundancy are just some of the reasons that moving to the cloud a “no brainer”. And if you have an on-prem datacenter, learn how to get out of the air-conditioning business!
This document discusses Google Cloud Platform and how Google powers its own services. It notes that Google is the fourth largest server manufacturer and would be the second largest internet service provider by traffic. It describes how Google builds customized hardware from cheap commodity parts and manages vast numbers of homogeneous servers at scale with software resilience and horizontal layers rather than hardware resilience and vertical stacks. The document also provides an overview of how Google's global data centers, communications network, data storage and distribution, services and APIs, and compute platforms can be utilized to build and scale applications. It includes several customer stories about how companies have used Google Cloud Platform for applications experiencing peak traffic, global data storage, crowd-sourcing weather data, and syncing notes across devices.
MSHOWTO ile Tech Summit 1'de Bende Özgür Çebi ile birlikte Citrix on Azure oturumunu gerçekleştirdim. Bu oturuma ait sunumu bu adresten inceleyebilirsiniz.
Azure Stack - Azure in your own Data CenterAdnan Hashmi
This document summarizes a presentation on Azure Stack. Azure Stack allows organizations to run Azure services on-premises, providing a consistent experience with the public Azure cloud. It builds on cloud-inspired hybrid infrastructure using cloud-consistent delivery of infrastructure as a service (IaaS) and platform as a service (PaaS). Azure Stack enables development of applications that are cloud-native and cloud-optimized, taking advantage of features like Azure resource groups and Resource Manager templates both on-premises and in the public cloud. The presentation covered the components, evolution, and use cases of Azure Stack.
This document provides an overview of Microsoft Azure cloud services and why businesses use the cloud. It discusses Infrastructure as a Service (IaaS), Platform as a Service (PaaS), and Software as a Service (SaaS) models. Key Azure services are mentioned, including Virtual Machines, SQL Database, storage, and web apps. The cloud allows businesses to rapidly setup environments, scale as needed, and increase efficiency at a lower cost compared to on-premises infrastructure.
Architecting Enterprise Applications in the Cloud presentation by Matt Tavis, AWS Solutions Architect, and the Cloud for the Enterprise Event in NY on October 19, 2009
The document discusses hybrid cloud applications using Azure and Azure Stack. It describes Azure Stack as an extension of Azure that allows using Azure services on-premises. Data and applications can be shared between private and public clouds using a hybrid cloud environment. The document also covers data migration to Azure SQL Database Managed Instance, hybrid identity using Azure AD Connect, and hybrid CI/CD pipelines that allow deploying applications to both Azure and Azure Stack.
Technical session on Databases as Service in Azure
Technical session - Azure SQL DB on Dec 20, 2020
https://youtu.be/Cl4IDpc_0yc
Technical session - 2 on Azure SQL DB - Dec 27, 2020
https://youtu.be/_4lZ54eI3F0
Technical session on Azure Cosmos DB -Dec 27, 2020
https://youtu.be/rtDwX1K_64k
This document provides an overview and summary of the author's background and expertise. It states that the author has over 30 years of experience in IT working on many BI and data warehouse projects. It also lists that the author has experience as a developer, DBA, architect, and consultant. It provides certifications held and publications authored as well as noting previous recognition as an SQL Server MVP.
Rising Interest in Open Source Relational DatabasesChristopher Foot
We discuss the rising interest in cloud and on-premises open source relational databases and why we should consider them as viable replacements for their commercial counterparts. We analyze the causal factors driving their increasing popularity, learn how the cloud is accelerating that growth and evaluate the perennial favorites as well as some of the more recent open source offerings. We end with a discussion on recommended commercial DBMS product replacement strategies and open source vendor selection best practices.
In this session, Sergio covered the Lakehouse concept and how companies implement it, from data ingestion to insight. He showed how you could use Azure Data Services to speed up your Analytics project from ingesting, modelling and delivering insights to end users.
This document provides an overview of using open source databases on Microsoft Azure. It discusses trends in open source databases and how Azure supports popular open source databases like MySQL, MariaDB, and PostgreSQL as fully managed database services. It covers benefits of migrating on-premises or third party databases to Azure databases, including cost savings, global scale, built-in high availability, security, and integration with other Azure services. Migration from commercial databases like Oracle to open source databases on Azure like PostgreSQL is also discussed.
If you are seeking ways to improve your cloud database environment with EDB Postgres, this presentation reviews how you can create a Database-as-a-Service (DBaaS) with EDB Postgres on AWS.
This presentation outlines how EDB Ark can play a key role in your digital transformation with more agility and speed.
It highlights:
● How EDB Ark can integrate with your existing AWS environment and other clouds
● How you can automate your database deployments to instantly spin up new databases
● How to manage your database environment easier using the same GUI for all clouds
● How to boost developer efficiency and satisfaction
Whether your database is currently in the cloud or you are considering the cloud as an option, this presentation will provide you with the information you need to evaluate EDB Postgres and EDB Ark.
The recording of this presentation includes a demonstration. Visit www.edbpostgres.com > resources > webcasts
This document summarizes key components of Microsoft Azure's data platform, including SQL Database, NoSQL options like Azure Tables, Blob Storage, and Azure Files. It provides an overview of each service, how they work, common use cases, and demos of creating resources and accessing data. The document is aimed at helping readers understand Azure's database and data storage options for building cloud applications.
2014.10.22 Building Azure Solutions with Office 365Marco Parenzan
This document discusses building Azure solutions with Office 365. It provides an overview of Microsoft Azure services including compute, storage, networking and identity services. It also discusses Office 365 APIs for integrating with calendar, mail and contacts. Code samples are shown for accessing these APIs through REST calls and a library that abstracts away the REST requests. The document concludes with a demonstration of an application that integrates Office 365 and Azure services.
This document provides an introduction to Cloudant, which is a fully managed NoSQL database as a service (DBaaS) that provides a scalable and flexible data layer for web and mobile applications. The presentation discusses NoSQL databases and why they are useful, describes Cloudant's features such as document storage, querying, indexing and its global data presence. It also provides examples of how companies like FitnessKeeper and Fidelity Investments use Cloudant to solve data scaling and management challenges. The document concludes by outlining next steps for signing up and exploring Cloudant.
A sharing in a meetup of the AWS Taiwan User Group.
The registration page: https://bityl.co/7yRK
The promotion page: https://www.facebook.com/groups/awsugtw/permalink/4123481584394988/
What is in a modern BI architecture? In this presentation, we explore PaaS, Azure Active Directory and Storage options including SQL Database and SQL Datawarehouse.
Azure SQL Database is a relational database-as-a-service hosted in the Azure cloud that reduces costs by eliminating the need to manage virtual machines, operating systems, or database software. It provides automatic backups, high availability through geo-replication, and the ability to scale performance by changing service tiers. Azure Cosmos DB is a globally distributed, multi-model database that supports automatic indexing, multiple data models via different APIs, and configurable consistency levels with strong performance guarantees. Azure Redis Cache uses the open-source Redis data structure store with managed caching instances in Azure for improved application performance.
Clash of Technologies Google Cloud vs Microsoft AzureMihail Mateev
This document compares Google Cloud and Microsoft Azure on various features. It discusses their pricing models, infrastructure as a service and platform as a service capabilities. Some key findings are that Azure has better coverage in Asia while Google Cloud has better coverage in the US. AWS leads the cloud market currently. The document also analyzes storage performance, virtual machine pricing and types, database offerings, microservices support, load balancing options and example use cases for each provider.
* Use cases of MySQL as well as edge cases of MySQL topologies using real-life examples and "war" stories
* How scalability and proxy wars make MySQL topologies more robust to serve webscale shops
* Open-source tools, utilities, and surrounding MySQL Ecosystem.
A Tour of Azure SQL Databases (NOVA SQL UG 2020)Timothy McAliley
This document provides information about upcoming webinars on Azure SQL and AI/ML hosted by various user groups. It lists the experience of the person running the user groups and provides an agenda for upcoming webinars in May and June 2020 that will cover various Azure database and analytics services. It also includes references and links for further learning about Azure SQL Database, Azure SQL Managed Instance, high availability and disaster recovery options.
Similar to 20210427 azure lille_meetup_azure_data_stack (20)
Dive into the world of CosmosDB with this in-depth class support material! 🚀
📚 In this presentation, I covered key concepts, best practices, and hands-on insights into working with Azure CosmosDB. Whether you're a beginner or seeking to deepen your understanding, this guide provides a valuable resource to navigate the complexities of CosmosDB.
🔍 Topics Covered:
- Introduction to CosmosDB
- Data Modeling Strategies
- Querying and Indexing
- Performance Optimization
- Scalability and Partitioning
- Real-world Use Cases
💡 Why CosmosDB?
Discover why CosmosDB is a game-changer in the world of NoSQL databases, offering unparalleled scalability, global distribution, and multi-model flexibility.
🚀 Who Should Explore This?
- Developers
- Database Administrators
- Data Engineers
- Cloud Enthusiasts
comparatifs des familles NoSQL & concepts de modélisationAlexandre BERGERE
Support de présentation présentée au Connected Week d'Angers lors de la data day.
Le thème abordé est le suivant: "comparatifs des familles NoSQL & concepts de modélisation"
With this support you would be able to have the basic of Azure Data slack and it will help you to pass the DP-200 and DP-201. If you need some basics on Azure, you can download this support : https://www.slideshare.net/AlexandreBERGERE/azure-fundamentals-153339148.
This support is a summary from the paths:
Azure for the Data Engineer
Store data in Azure
Work with relational data in Azure
Large Scale Data Processing with Azure Data Lake Storage Gen2
Implement a Data Streaming Solution with Azure Streaming Analytics
Implement a Data Warehouse with Azure SQL Data Warehouse
in Microsoft Learn.
This document contains the planning and schedule for a 5 day Big Data class being taught by Alexandre Bergere. The schedule outlines the topics to be covered each day, including What is Big Data, NoSQL, Cloud Architecture, Spark, data storage options, MongoDB, and more. Presentation slides are included that provide more detail on MongoDB, its flexibility, types of data it can store, CRUD operations, and other tools like MongoDB Compass.
Iot streaming with Azure Stream Analytics from IotHub to the full data slackAlexandre BERGERE
In this article I'm going to explain how to push data from iot devices through Azure Stream Analytics into multiples channels: Azure Blob Storage (as a cold database), Azure Cosmos DB (as a hot database), Power BI (for data visualization) and Azure Service Bus & Azure Logic App (data processing & user interaction).
This document provides information about a MongoDB class taught by Alexandre Bergere. The class covers topics including Big Data, NoSQL, MongoDB architecture and modeling, CRUD operations, replication, security, and aggregation. It includes Alexandre's background and credentials, as well as sources and use cases for MongoDB.
DefCamp_2016_Chemerkin_Yury-publish.pdf - Presentation by Yury Chemerkin at DefCamp 2016 discussing mobile app vulnerabilities, data protection issues, and analysis of security levels across different types of mobile applications.
"Building Future-Ready Apps with .NET 8 and Azure Serverless Ecosystem", Stan...Fwdays
.NET 8 brought a lot of improvements for developers and maturity to the Azure serverless container ecosystem. So, this talk will cover these changes and explain how you can apply them to your projects. Another reason for this talk is the re-invention of Serverless from a DevOps perspective as a Platform Engineering trend with Backstage and the recent Radius project from Microsoft. So now is the perfect time to look at developer productivity tooling and serverless apps from Microsoft's perspective.
Finetuning GenAI For Hacking and DefendingPriyanka Aash
Generative AI, particularly through the lens of large language models (LLMs), represents a transformative leap in artificial intelligence. With advancements that have fundamentally altered our approach to AI, understanding and leveraging these technologies is crucial for innovators and practitioners alike. This comprehensive exploration delves into the intricacies of GenAI, from its foundational principles and historical evolution to its practical applications in security and beyond.
Keynote : AI & Future Of Offensive SecurityPriyanka Aash
In the presentation, the focus is on the transformative impact of artificial intelligence (AI) in cybersecurity, particularly in the context of malware generation and adversarial attacks. AI promises to revolutionize the field by enabling scalable solutions to historically challenging problems such as continuous threat simulation, autonomous attack path generation, and the creation of sophisticated attack payloads. The discussions underscore how AI-powered tools like AI-based penetration testing can outpace traditional methods, enhancing security posture by efficiently identifying and mitigating vulnerabilities across complex attack surfaces. The use of AI in red teaming further amplifies these capabilities, allowing organizations to validate security controls effectively against diverse adversarial scenarios. These advancements not only streamline testing processes but also bolster defense strategies, ensuring readiness against evolving cyber threats.
TrustArc Webinar - Innovating with TRUSTe Responsible AI CertificationTrustArc
In a landmark year marked by significant AI advancements, it’s vital to prioritize transparency, accountability, and respect for privacy rights with your AI innovation.
Learn how to navigate the shifting AI landscape with our innovative solution TRUSTe Responsible AI Certification, the first AI certification designed for data protection and privacy. Crafted by a team with 10,000+ privacy certifications issued, this framework integrated industry standards and laws for responsible AI governance.
This webinar will review:
- How compliance can play a role in the development and deployment of AI systems
- How to model trust and transparency across products and services
- How to save time and work smarter in understanding regulatory obligations, including AI
- How to operationalize and deploy AI governance best practices in your organization
"Making .NET Application Even Faster", Sergey Teplyakov.pptxFwdays
In this talk we're going to explore performance improvement lifecycle, starting with setting the performance goals, using profilers to figure out the bottle necks, making a fix and validating that the fix works by benchmarking it. The talk will be useful for novice and seasoned .NET developers and architects interested in making their application fast and understanding how things work under the hood.
The History of Embeddings & Multimodal EmbeddingsZilliz
Frank Liu will walk through the history of embeddings and how we got to the cool embedding models used today. He'll end with a demo on how multimodal RAG is used.
Top 12 AI Technology Trends For 2024.pdfMarrie Morris
Technology has become an irreplaceable component of our daily lives. The role of AI in technology revolutionizes our lives for the betterment of the future. In this article, we will learn about the top 12 AI technology trends for 2024.
Retrieval Augmented Generation Evaluation with RagasZilliz
Retrieval Augmented Generation (RAG) enhances chatbots by incorporating custom data in the prompt. Using large language models (LLMs) as judge has gained prominence in modern RAG systems. This talk will demo Ragas, an open-source automation tool for RAG evaluations. Christy will talk about and demo evaluating a RAG pipeline using Milvus and RAG metrics like context F1-score and answer correctness.
Discovery Series - Zero to Hero - Task Mining Session 1DianaGray10
This session is focused on providing you with an introduction to task mining. We will go over different types of task mining and provide you with a real-world demo on each type of task mining in detail.
Generative AI technology is a fascinating field that focuses on creating comp...Nohoax Kanont
Generative AI technology is a fascinating field that focuses on creating computer models capable of generating new, original content. It leverages the power of large language models, neural networks, and machine learning to produce content that can mimic human creativity. This technology has seen a surge in innovation and adoption since the introduction of ChatGPT in 2022, leading to significant productivity benefits across various industries. With its ability to generate text, images, video, and audio, generative AI is transforming how we interact with technology and the types of tasks that can be automated.
The Challenge of Interpretability in Generative AI Models.pdfSara Kroft
Navigating the intricacies of generative AI models reveals a pressing challenge: interpretability. Our blog delves into the complexities of understanding how these advanced models make decisions, shedding light on the mechanisms behind their outputs. Explore the latest research, practical implications, and ethical considerations, as we unravel the opaque processes that drive generative AI. Join us in this insightful journey to demystify the black box of artificial intelligence.
Dive into the complexities of generative AI with our blog on interpretability. Find out why making AI models understandable is key to trust and ethical use and discover current efforts to tackle this big challenge.
2. dataredkite.com
premiseo.com
Who are we ?
26/02/2021 2
I'm a data and cloud Architect and Spark lover.
I worked many years as an Oracle consultant and
expert, and now I work with Cloud solutions devoted to
solve complex problems with high volumes of data.
I am a Data Analyst & Solution Architect indepedent -
☁️ MCSE, Cosmos DB & Delta lover.
I developed my skills through various clients' projects,
teaching at the University and personal proof of
concepts.
I’m also the Co-Founder of DataRedkite, a product which
can quickly give to its user a good management of data
in Microsoft Azure DataLake.
Laurent Leturgez Alexandre Bergere
Meetup Azure Lille
5. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 5
Relational Databases
Managed relational SQL Database
as a service
Azure SQL Database
Managed MariaDB database
service for app developers
Azure Database for MariaDB
Managed MySQL database service
for app developers
Azure Database for MySQL
Managed Postgres database
service for app developers
Azure Database for PostGres
6. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 6
Relational Databases
Managed relational SQL Database
as a service
Azure SQL Database
Managed MySQL database service
for app developers
Azure Database for MySQL
Managed MariaDB database
service for app developers
Azure Database for MariaDB
Managed Postgres database
service for app developers
Azure Database for PostGres
8. dataredkite.com
premiseo.com
Azure SQL Database
27/04/2021 Meetup Azure Lille 8
• Azure SQL
• SQL Server Paas service
• Managed upgrades, patches, backups and monitoring
• Latest Stable version of SQL Server
• 99,99% availability
• Deployment model
• Single Database : database runs on non shared resources
• Elastic Pool : database runs with a collection of databases that share set of resources at a
predictable price
9. dataredkite.com
premiseo.com
Azure SQL Database
27/04/2021 Meetup Azure Lille 9
• Azure SQL
• Purchasing model
• DTU (Database Transaction Unit) : https://docs.microsoft.com/en-us/azure/azure-sql/database/service-tiers-
dtu
• Basic tier
• Standard Tier
• Premium Tier
• vCore model
• Serverless
• Service Tier
• General Purpose (vCore) / Standard (DTU) : Common workloads
• Business Critical (vCore) / Premium (DTU) : High transaction and availability / low latency IO
• HyperScale (vCore) :
• Up to 100Tb Database
• Rapid Scale up (compute resources)
• Rapid Scale out (read only nodes : read workload / hot-standby)
10. dataredkite.com
premiseo.com
Azure SQL Database
27/04/2021 Meetup Azure Lille 10
• Azure SQL Managed Instance
• Features
• Paas platform for lift and shift at scale
• Broadest SQL Server engine compatibility (network integration, features etc.)
• With perservation of all Paas capabilities (patching, updates, backups, HA etc.)
• vCore purchase model only
• BYOL available
• SQL Virtual Machine
• SQL Server deployment on VM (Linux and Windows)
• Can choice SQL Server version
• From 2008 R2
• Up to 2019
11. dataredkite.com
premiseo.com
Azure SQL Database
27/04/2021 Meetup Azure Lille 11
Azure SQL Database
Managed Instance
Instance scoped model with
high compatibility to SQL Server
Best for modernisation at scale
with low cost effort (lift & shift)
Single
Standalone managed database
for predictable and stable
workloads
Elastic Pool
Shared resources model :
multitenant
12. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 12
Relational Databases
Managed relational SQL Database
as a service
Azure SQL Database
Managed MySQL database service
for app developers
Azure Database for MySQL
Managed MariaDB database
service for app developers
Azure Database for MariaDB
Managed Postgres database
service for app developers
Azure Database for PostGre
13. dataredkite.com
premiseo.com
Azure Database for PostgreSQL
27/04/2021 Meetup Azure Lille 13
• Paas Service for PostgreSQL
• Runs on Windows
• Single Server
• v9.5 to 11
• Up to 64 vCores depending on SKU (https://docs.microsoft.com/en-us/azure/postgresql/concepts-pricing-
tiers)
• Up to 2 for Basic SKU
• Up to 64 for General Purpose SKU
• Up to 32 for Memory Optimized SKU
• Bunch of PG Extensions available
• Automated Backup (retention up to 35days)
• Backup frequency and backup types depend on database size
• Geo-redundant backup option (General Purpose & Memory Optimized)
14. dataredkite.com
premiseo.com
Azure Database for PostgreSQL
27/04/2021 Meetup Azure Lille 14
• Paas Service for PostgreSQL
• HyperScale (Citus)
• High performance and analytical workloads beyond 100Gb
• Hyperscale delivers
• Horizontal scaling across multiple machine (with Sharding)
• Query parallelization across these servers
• High performance for analytics
• Based on server groups
• Design approach required for table distribution and performance
• Distributed tables (based on distribution column)
• Reference tables (content concentrated into a single shard replicated on every worker node)
• Local tables (ordinary unsharded tables. Perfect for small tables not involded into joins)
• Automated backup through storage snapshots
15. dataredkite.com
premiseo.com
Azure Database for PostgreSQL
27/04/2021 Meetup Azure Lille 15
• Paas Service for PostgreSQL
• Flexible Server (Preview)
• Automated patching
• Automatic backups
• Performance adjustment in three switchable compute tiers : Burstable, GP, Memory Optimized
High Availability Zone Redundant HA (Optional)
16. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 16
Relational Databases
Managed relational SQL Database
as a service
Azure SQL Database
Managed MariaDB database
service for app developers
Azure Database for MariaDB
Managed Postgres database
service for app developers
Azure Database for PostGre
Managed MySQL database service
for app developers
Azure Database for MySQL
17. dataredkite.com
premiseo.com
Azure Database for MariaDB
27/04/2021 Meetup Azure Lille 17
• Paas Service for MariaDB
• Runs on Windows
• Single Server
• V10.2 and 10.3
• Up to 64 vCores depending on SKU (https://docs.microsoft.com/en-us/azure/mariadb/concepts-pricing-tiers)
• Up to 2 for Basic SKU
• Up to 64 for General Purpose SKU
• Up to 32 for Memory Optimized SKU
• Automated Backup (retention up to 35days)
• Backup frequency and backup types depend on database size
• Geo-redundant backup option (General Purpose & Memory Optimized)
18. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 18
Relational Databases
Managed relational SQL Database
as a service
Azure SQL Database
Managed MariaDB database
service for app developers
Azure Database for MariaDB
Managed Postgres database
service for app developers
Azure Database for PostGre
Managed MySQL database service
for app developers
Azure Database for MySQL
19. dataredkite.com
premiseo.com
Azure Database for MySQL
27/04/2021 Meetup Azure Lille 19
• Paas Service for MySQL
• Runs on Windows
• Single Server
• V5.6, 5.7, and 8.0
• Up to 64 vCores depending on SKU (https://docs.microsoft.com/en-us/azure/mysql/concepts-pricing-tiers)
• Up to 2 for Basic SKU
• Up to 64 for General Purpose SKU
• Up to 32 for Memory Optimized SKU
• Automated Backup (retention up to 35days)
• Backup frequency and backup types depend on database size
• Geo-redundant backup option (General Purpose & Memory Optimized)
20. dataredkite.com
premiseo.com
Azure Database for MySQL
27/04/2021 Meetup Azure Lille 20
• Paas Service for MySQL
Flexible Server (Preview)
• V5.7
• Automated patching
• Automatic backups
• Performance adjustment in three switchable compute tiers : Burstable, GP, Memory Optimized
• Network Isolation
• Private Access through Vnet integration
• Public Access
25. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 25
Big Data
Storage
REST-based object storage for
unstructured data
Storage Account
Massively scalable, secure data
lake functionality built on Azure
Blob Storage
Azure Data Lake Storage
26. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 26
Big Data
Storage
REST-based object storage for
unstructured data
Storage Account
Massively scalable, secure data
lake functionality built on Azure
Blob Storage
Azure Data Lake Storage
27. dataredkite.com
premiseo.com
Storage Account
26/02/2021 27
o Azure Blobs : A scalable object store for text and binary data
o Azure Files : Managed file shares for cloud or on-premises deployments
o Azure Queues : A messaging store for reliable messaging between application components
o Azure Tables : A NoSQL store for no-schema storage of structured data
Azure Storage accounts are the base storage type within Azure. Azure Storage offers a very scalable object store for data
objects and file system services in the cloud. It can also provide a messaging store for reliable messaging, or it can act as a
NoSQL store.
Azure selected four of these data services and placed them together under the name Azure Storage. The four services are
Azure Blobs, Azure Files, Azure Queues, and Azure Tables. The following illustration shows the elements of Azure Storage
28. dataredkite.com
premiseo.com
Storage Account
26/02/2021 28
Type of Storage Account
Storage account type Services Redundancy options
General-purpose V2 Basic storage account type for blobs, files, queues, and tables. Recommended
for most scenarios using Azure Storage.
LRS, GRS, RA-GRS, ZRS, GZRS,
RA-GZRS
General-purpose V1 Legacy account type for blobs, files, queues, and tables. Use general-purpose
v2 accounts instead when possible.
LRS, GRS, RA-GRS
BlockBlobStorage Storage accounts with premium performance characteristics for block blobs
and append blobs. Recommended for scenarios with high transactions rates, or
scenarios that use smaller objects or require consistently low storage latency.
LRS, ZRS
FileStorage Files-only storage accounts with premium performance characteristics.
Recommended for enterprise or high performance scale applications.
LRS, ZRS
BlobStorage Legacy Blob-only storage accounts. Use general-purpose v2 accounts instead
when possible.
LRS, GRS, RA-GRS
31. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 31
Big Data
Storage
REST-based object storage for
unstructured data
Storage Account
Massively scalable, secure data
lake functionality built on Azure
Blob Storage
Azure Data Lake Storage
32. dataredkite.com
premiseo.com
Azure Datalake Store
26/02/2021 32
Azure Data Lake Storage is a Hadoop-compatible data repository that can store any size or type of data. This storage
service is available as Generation 1 (Gen1) or Generation 2 (Gen2).
Key features of Data Lake Storage:
o Unlimited scalability
o Hadoop compatibility
o Security support for both access control lists (ACLs) & RBAC (for Gen 2 only)
o POSIX compliance
o An optimized Azure Blob File System (ABFS) driver that's designed for big-data analytics
o Zone-redundant storage
o Geo-redundant storage
Azure Datalake Gen 1 Azure Datalake Gen 2
33. dataredkite.com
premiseo.com
Choose a storage solution on Azure
26/02/2021 33
Data classification Operations Latency & throughput Transactional support Recommended service
Product catalog data Semi-structured because of
the need to extend or modify
the schema for new products
o Customers require a high
number of read operations,
with the ability to query on
many fields within the
database.
o The business requires a
high number of write
operations to track the
constantly changing
inventory.
High throughput and low
latency
Required Azure Cosmos DB
Photos and videos Unstructured o Only need to be retrieved
by ID.
o Customers require a high
number of read operations
with low latency.
o Creates and updates will be
somewhat infrequent and
can have higher latency
than read operations.
Retrievals by ID need to
support low latency and high
throughput. Creates and
updates can have higher
latency than read operations.
Not required Azure Blob storage
Business data Structured Read-only, complex analytical
queries across multiple
databases
Some latency in the results is
expected based on the
complex nature of the queries
Required Azure SQL Database
Azure Database for MariaDB
Azure Database for PostGre
Azure Database for MySQL
37. dataredkite.com
premiseo.com
Azure Function
37
Azure Functions is the serverless compute service from Microsoft. Functions are event-driven: each function defines a
trigger — the exact definition of the event source, for instance, the name of a storage queue.
Uses cases:
If you want to... then...
Build a web API Implement an endpoint for your web applications using the HTTP trigger
Process file uploads Run code when a file is uploaded or changed in blob storage
Build a serverless workflow Chain a series of functions together using durable functions
Respond to database changes Run custom logic when a document is created or updated in Cosmos DB
Run scheduled tasks Execute code at set times
Create reliable message queue systems Process message queues using Queue Storage, Service Bus, or Event Hubs
38. dataredkite.com
premiseo.com
Azure Function
38
Consumption Plan Functions
Consumption Plan (B1, B2, B3, S1, S2, S3
Scale automatically and only pay for compute resources when your functions are running. On
the Consumption plan, instances of the Functions host will be dynamically added and
removed based on the number of incoming events.
Premium plan (P1v2, P2v2, P3v3)
While automatically scaling based on demand, use prewarmed workers to run applications
with no delay after being idle, run on more powerful instances and connect to VNETs.
Azure App Service plan
Run Functions within an App Service plan at regular App Service plan rates. Good fit for long-
running operations, as well as when more predictive scaling and costs are required.
Azure Functions hosting options : Azure Plan
39. dataredkite.com
premiseo.com
27/04/2021 39
Durable Functions is a library that brings workflow orchestration abstractions to Azure Functions. It introduces a number of idioms and tools
to define stateful, potentially long-running operations, and manages a lot of mechanics of reliable communication and state management
behind the scenes.
Log of events in the course of orchestrator
progression
3 steps of a workflow executed in sequence
https://medium.com/hackernoon/making-sense-of-azure-durable-functions-
645ecb3c1d58
Azure Function
Azure Durable Functions
41. dataredkite.com
premiseo.com
Azure Data Factory
27/04/2021 Meetup Azure Lille 41
• Serverless Data Integration service
• Data Pipeline : logical group of activities
• Data Flow : Data Transformation activity
• Data Copy : Data Transfer activity
• SSIS Integration
• Git integration
42. dataredkite.com
premiseo.com
Azure Data Factory
27/04/2021 Meetup Azure Lille 42
• Serverless Data Integration service
• Job scheduling
• Automatically through internal Scheduler
• Manually
• SDK : .NET, Python
• REST API
• PowerShell
43. dataredkite.com
premiseo.com
Azure Data Factory
27/04/2021 Meetup Azure Lille 43
• Serverless Data Integration service
• Integration runtime
• Compute infrastructure used by ADF to provide data integration
• Azure : Serverless
• Self Hosted : Onprem or Azure Virtual Machine (Windows)
• SSIS
Activity Features
Azure Data Flow
Data Copy
Dispatch Activity (HDI, Databricks, SQL …)
Cloud to Cloud data transfer/flows
Self-Hosted Data Flow
Data Copy
Dispatch Activity (HDI, Databricks, SQL …)
OnPrem or Virtual Machine deployment (Windows)
OnPrem <-> Cloud data transfer/flows
When connectors are not available
SSIS SSIS Package execution Private or public Network
44. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 44
Big Data
Compute
Fast, easy, and collaborative
Apache Spark-based analytics
platform
Azure Databricks
HDInsight supports the latest open
source projects from the Apache
Hadoop and Spark ecosystems.
Azure HDInsight
Managed Enterprise
Datawarehouse and BigData
Analytics service
Azure Synapse Analytics
45. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 45
Big Data
Compute
Fast, easy, and collaborative
Apache Spark-based analytics
platform
Azure Databricks
HDInsight supports the latest open
source projects from the Apache
Hadoop and Spark ecosystems.
Azure HDInsight
Managed Enterprise
Datawarehouse and BigData
Analytics service
Azure Synapse Analytics
47. dataredkite.com
premiseo.com
Azure Databricks
26/02/2021 47
Azure Databricks is a data analytics platform optimized for the Microsoft Azure cloud services platform. Azure Databricks
offers two environments for developing data intensive applications:
o Azure Databricks Workspace: provides an interactive workspace that enables collaboration between data engineers,
data scientists, and machine learning engineers.
o Azure Databricks SQL Analytics: provides an easy-to-use platform for analysts who want to run SQL queries on their
data lake, create multiple visualization types to explore query results from different perspectives, and build and share
dashboards.
48. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 48
Big Data
Compute
Fast, easy, and collaborative
Apache Spark-based analytics
platform
Azure Databricks
HDInsight supports the latest open
source projects from the Apache
Hadoop and Spark ecosystems.
Azure HDInsight
Managed Enterprise
Datawarehouse and BigData
Analytics service
Azure Synapse Analytics
49. dataredkite.com
premiseo.com
Azure HDInsights
27/04/2021 Meetup Azure Lille 49
• Managed Hadoop distribution for Azure
• Based on Cloudera Hortonworks hadoop distribution
• Comes in various flavours / shapes (VM shapes and number)
• Hadoop : General purpose (HDFS, Yarn, MapReduce, Hive, Pig, Sqoop, Oozie)
• Spark
• Kafka
• HBase
• Hive / LLAP (Interactive Query)
• Storm (Stream processing)
• ML Services with R
51. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 51
Big Data
Compute
Fast, easy, and collaborative
Apache Spark-based analytics
platform
Azure Databricks
HDInsight supports the latest open
source projects from the Apache
Hadoop and Spark ecosystems.
Azure HDInsigth
Managed Enterprise
Datawarehouse and BigData
Analytics service
Azure Synapse Analytics
53. dataredkite.com
premiseo.com
Azure Synapse Analytics
27/04/2021 Meetup Azure Lille 53
MPP
Datawarehou
se
Choice of language (T-SQL, Spark
SQL, Python, Scala, .Net)
Analytics ready (Analysis Services,
Power BI)
Data Science and AI Ready (Azure
Machine Learning integration)
58. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 58
Big Data
Streaming
Real-time data stream processing
from millions of IoT devices
Azure Stream Analytics
Connect, monitor and manage
billions of IoT assets
Azure IoT Hub
Real-time data stream with Kafka
Azure HDInsigth & Kafka
Use Spark Streaming with
Databricks
Spark Streaming with Databricks
59. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 59
Big Data
Streaming
Real-time data stream processing
from millions of IoT devices
Azure Stream Analytics
Connect, monitor and manage
billions of IoT assets
Azure IoT Hub
Real-time data stream with Kafka
Azure HDInsigth & Kafka
Use Spark Streaming with
Databricks
Spark Streaming with Databricks
61. dataredkite.com
premiseo.com
Azure Streaming Analytics
26/02/2021 61
o Azure Stream Analytics supports user-defined functions (UDF) or user-defined aggregates (UDA) in JavaScript for cloud jobs and C# for IoT
Edge jobs
UDFs, UDAs, and custom deserializers:
o Analyze real-time telemetry streams from IoT devices
o Web logs/clickstream analytics
o Geospatial analytics for fleet management and driverless vehicles
o Remote monitoring and predictive maintenance of high value assets
o Real-time analytics on Point of Sale data for inventory control and anomaly detection
Examples scenarios:
62. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 62
Big Data
Streaming
Real-time data stream processing
from millions of IoT devices
Azure Stream Analytics
Connect, monitor and manage
billions of IoT assets
Azure IoT Hub
Real-time data stream with Kafka
Azure HDInsigth & Kafka
Use Spark Streaming with
Databricks
Spark Streaming with Databricks
63. dataredkite.com
premiseo.com
Azure Iot Hub
63
Azure IoT Hub :
o The cloud gateway that connects IoT devices to gather data and drive business insights and automation.
o The big data streaming service of Azure. It is designed for high throughput data streaming scenarios where customers
may send billions of requests per day.
o Bi-directional communication capabilities
64. dataredkite.com
premiseo.com
Iot Hub or Event Hubs
64
IoT Hub was developed to address the unique requirements of connecting IoT devices to the Azure cloud while Event Hubs
was designed for big data streaming. Microsoft recommends using Azure IoT Hub to connect IoT devices to Azure.
IoT Capability IoT Hub standard tier IoT Hub basic tier Event Hubs
Device-to-cloud messaging
Protocols: HTTPS, AMQP, AMQP over webSockets
Protocols: MQTT, MQTT over webSockets
Per-device identity
File upload from devices
Device Provisioning Service
Cloud-to-device messaging
Device twin and device management
Device streams (preview)
IoT Edge
65. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 65
Big Data
Streaming
Real-time data stream processing
from millions of IoT devices
Azure Stream Analytics
Connect, monitor and manage
billions of IoT assets
Azure IoT Hub
Real-time data stream with Kafka
Azure HDInsigth & Kafka
Connect, monitor and manage
billions of IoT assets
Spark Streaming with Databricks
67. premiseo.com dataredkite.com
26/02/2021 Meetup Azure Lille 67
Big Data
Streaming
Real-time data stream processing
from millions of IoT devices
Azure Stream Analytics
Connect, monitor and manage
billions of IoT assets
Azure IoT Hub
Real-time data stream with Kafka
Azure HDInsigth & Kafka
Use Spark Streaming with
Databricks
Spark Streaming with Databricks
68. dataredkite.com
premiseo.com
Azure Databricks
26/02/2021 68
o Apache Spark Streaming is a scalable fault-tolerant streaming processing system that natively supports both batch and
streaming workloads.
o Spark Streaming is an extension of the core Spark API
70. dataredkite.com
premiseo.com
Azure Data Studio
26/02/2021 70
Azure Data Studio is a cross-platform database tool that you can run on Windows, macOS, and Linux. You'll use it to
connect to SQL Data Warehouse and Azure SQL Database.
Previously released under the preview name SQL Operations Studio, Azure Data Studio offers a modern editor experience
with IntelliSense, code snippets, source control integration, and an integrated terminal. It is engineered with the data
platform user in mind, with built in charting of query result sets and customizable dashboards.
71. dataredkite.com
premiseo.com
Storage Explorer
26/02/2021 71
Begin by downloading and installing Storage Explorer. You can use Storage Explorer to do several operations against data
in your Azure Storage account and data lake:
o Upload files or folders from your local computer into Azure Storage.
o Download cloud-based data to your local computer.
o Copy or move files and folders around in the storage account.
o Delete data from the storage account.
72. dataredkite.com
premiseo.com
Visual Studio Code
26/02/2021 72
Visual Studio Code is a lightweight source code editor which runs on your desktop and is available for Windows, macOS
and Linux. It comes with built-in support for JavaScript, TypeScript and Node.js and has a rich ecosystem of extensions for
other languages (such as C++, C#, Java, Python, PHP, Go) and runtimes (such as .NET and Unity).
74. dataredkite.com
premiseo.com
Summary
26/02/2021 74
Scenario Some recommended solutions
Disaster Recovery Azure geo-redundant backups
Read Scale Use read-only replicas to load balance read-only query
workloads (preview)
ETL (OLTP to OLAP) Azure Data Factory or SQL Server Integration Services or
Databricks
Migration from on-premises SQL Server to Azure SQL
Database
Azure Database Migration Service
Kept up-to-date across several Azure SQL databases or SQL
Server database
Azure SQL Data Sync
Detecting compatibility issues that can impact database
functionality in your new version of SQL Server or Azure SQL
Database
Data Migration Assistant (DMA)
77. premiseo.com dataredkite.com
26/02/2021 77
Just few sources in Microsoft Learn:
o Azure for the Data Engineer
o Store data in Azure
o Work with relational data in Azure
o Large Scale Data Processing with Azure Data Lake Storage Gen2
o Implement a Data Streaming Solution with Azure Streaming Analytics
o Implement a Data Warehouse with Azure SQL Data Warehouse
Sources