Cloud Analytics & ML with Sam Taha |

Adventures in Big Data, Analytics and ML

Tuesday, November 22, 2022

The Web3 and Blockchain Wasteland

›
  Bubbles happen. But you hope what is left behind after the bubble supernova is at least useful to science or society. Like the dot.com bub...
Friday, December 31, 2021

Machine Learning is Not AI

›
We need to stop referring to today's machine learning as AI. It is marketing techno spin no more and no less. There is nothing intellige...
Friday, November 26, 2021

Modern Cloud Data Lake/Warehouse: Don't get Locked-in All Over Again

›
When relational databases, data warehouses, and data marts took root in the late 1990s, our data and our database systems were just down the...
Thursday, April 15, 2021

Data Driven vs Data Model Driven Company

›
Somehow along the way data lakes got the rap that you can dump "anything" into them. I think this is carry over from the failed hi...
Thursday, January 21, 2021

The AI Lesson for All of Us

›
There is no doubt that the brute force ML (aka deep learning) approach to achieve general AI or some level of human decision making by using...
Wednesday, December 16, 2020

2021 Data and Analytics Predictions

›
Cloud data platforms really gained momentum in 2020. It has been a real breakout year for both cloud data lakes and cloud data warehouses (y...
Thursday, November 26, 2020

Are Open Cloud Data Lakes the Future?

›
  Building a cloud data platform? First question: open Data Lake or proprietary DW or maybe a mix of both? Not a simple question or architec...
Wednesday, August 12, 2020

The Lost Art of Data Lineage

›
  Maybe it is more of mangled and ill defined art than a lost art. Data lineage is one of the aspects of data governance that gets lost in ...
Thursday, July 2, 2020

Tuning the Snowflake Data Cloud

›
To be clear, I do not classify Snowflake as an OLAP or MPP database. It has these capabilities for sure, but being born in the cloud and onl...
Thursday, June 11, 2020

Is ML Curve Fitting The Best We Got?

›
Curve Fitting is for the most part what most machine learning boils down to, not that that is a bad thing. How do go be beyond the correlati...
Friday, June 5, 2020

Choosing an ML Cloud Platform: GCP vs AWS

›
ML cloud services are evolving fast and furious. GCP and AWS are the leading players. Here is a quick visual peak at both ML tech stacks. AW...
Tuesday, June 2, 2020

Cloud OLAP: Choosing between Redshift, Snowflake, BigQuery or other?

›
Which to choose for your cloud OLAP engine? There are a lot of choices when it comes to cloud based analytics engines. All the major clouds ...
Friday, January 24, 2020

Why Spark is the Wrong Abstraction

›
Is the sun setting on Spark? I don't want to knock Spark and frameworks like it, they have had their moment in the sun. Spark was a r...
Tuesday, January 21, 2020

Data Lakes before AI/ML/Analytics (cart before horse thing)

›
Don't start or continue your AI and predictive analytics journey without building the necessary data infrastructure underpinnings. An...
Monday, September 9, 2019

Know Where Your Data Lake Has Been?

›
The foundation of a good data management strategy is based on number skills including data policies, best practices, and technology/...
›
Home
View web version
Powered by Blogger.