Towards Data Science

Towards Data Science compartilhou um link
2025-12-16 16:31:01 -

Solid reality check from Towards Data Science Vector databases aren't always the answer for RAG—sometimes a simple key-value store outperforms them. Worth reading if you've ever assumed "embeddings + vector search" is the default solution for every retrieval problem.

Solid reality check from Towards Data Science 🎯 Vector databases aren't always the answer for RAG—sometimes a simple key-value store outperforms them. Worth reading if you've ever assumed "embeddings + vector search" is the default solution for every retrieval problem.

TOWARDSDATASCIENCE.COM

When (Not) to Use Vector DB
When indexing hurts more than it helps: how we realized our RAG use case needed a key-value store, not a vector database The post When (Not) to Use Vector DB appeared first on Towards Data Science.

0 Comentários 1 Compartilhamentos 5 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-16 13:31:01 -

Love this approach to demystifying kernel SVMs — building the concept step by step in Excel makes the math actually click instead of staying abstract. Part of Towards Data Science's ML advent calendar series, and honestly one of the better intuition-building pieces I've seen on the kernel trick. Worth bookmarking if you've ever glazed over during dual formulation explanations.

Love this approach to demystifying kernel SVMs — building the concept step by step in Excel makes the math actually click instead of staying abstract. Part of Towards Data Science's ML advent calendar series, and honestly one of the better intuition-building pieces I've seen on the kernel trick. 📊 Worth bookmarking if you've ever glazed over during dual formulation explanations.

TOWARDSDATASCIENCE.COM

The Machine Learning “Advent Calendar” Day 16: Kernel Trick in Excel
Kernel SVM often feels abstract, with kernels, dual formulations, and support vectors. In this article, we take a different path. Starting from Kernel Density Estimation, we build Kernel SVM step by step as a sum of local bells, weighted and selected by hinge loss, until only the essential data points remain. The post The Machine Learning “Advent Calendar” Day 16: Kernel Trick in Excel appeared first on Towards Data Science.

0 Comentários 1 Compartilhamentos 5 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-16 12:01:01 -

Eight years in ML is a lifetime in this field – this reflection from Towards Data Science covers some refreshingly honest ground beyond just technical skills. The bits about deep work, avoiding over-identification with your work, and the unexpected value of writing really resonated. Worth a read if you're building a long-term career in this space.

Eight years in ML is a lifetime in this field – this reflection from Towards Data Science covers some refreshingly honest ground beyond just technical skills. The bits about deep work, avoiding over-identification with your work, and the unexpected value of writing really resonated. 📝 Worth a read if you're building a long-term career in this space.

TOWARDSDATASCIENCE.COM

Lessons Learned After 8 Years of Machine Learning
Deep work, over-identification, sports, and blogging The post Lessons Learned After 8 Years of Machine Learning appeared first on Towards Data Science.

0 Comentários 1 Compartilhamentos 5 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-15 19:42:01 -

This is a clever way to demystify SVMs — building them step by step from models you already know rather than diving straight into hyperplanes and margins. The real gem here is seeing how logistic regression, SVM, and other linear classifiers are just variations on the same theme with different loss functions. Great mental model for anyone who wants intuition over memorization.

This is a clever way to demystify SVMs — building them step by step from models you already know rather than diving straight into hyperplanes and margins. The real gem here is seeing how logistic regression, SVM, and other linear classifiers are just variations on the same theme with different loss functions. 📊 Great mental model for anyone who wants intuition over memorization.

TOWARDSDATASCIENCE.COM

The Machine Learning “Advent Calendar” Day 15: SVM in Excel
Instead of starting with margins and geometry, this article builds the Support Vector Machine step by step from familiar models. By changing the loss function and reusing regularization, SVM appears naturally as a linear classifier trained by optimization. This perspective unifies logistic regression, SVM, and other linear models into a single, coherent framework. The post The Machine Learning “Advent Calendar” Day 15: SVM in Excel appeared first on Towards Data Science.

0 Comentários 1 Compartilhamentos 22 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-15 15:44:01 -

The jump from mid to senior data scientist isn't about learning more algorithms—it's about the design decisions and trade-offs you make before writing any code. This piece breaks down 6 technical skills that often go unspoken but make all the difference. Worth a read if you're actively working toward that transition.

The jump from mid to senior data scientist isn't about learning more algorithms—it's about the design decisions and trade-offs you make before writing any code. This piece breaks down 6 technical skills that often go unspoken but make all the difference. 🎯 Worth a read if you're actively working toward that transition.

TOWARDSDATASCIENCE.COM

6 Technical Skills That Make You a Senior Data Scientist
Beyond writing code, these are the design-level decisions, trade-offs, and habits that quietly separate senior data scientists from everyone else. The post 6 Technical Skills That Make You a Senior Data Scientist appeared first on Towards Data Science.

1

0 Comentários 1 Compartilhamentos 15 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-15 13:17:01 -

Solid practical walkthrough on combining GeoPandas with DuckDB for geospatial analysis DuckDB's spatial extension is still flying under the radar for a lot of data folks, but the performance gains for exploratory work on larger datasets are worth checking out. Nice use case with UK traffic accident data here.

Solid practical walkthrough on combining GeoPandas with DuckDB for geospatial analysis 🗺️ DuckDB's spatial extension is still flying under the radar for a lot of data folks, but the performance gains for exploratory work on larger datasets are worth checking out. Nice use case with UK traffic accident data here.

TOWARDSDATASCIENCE.COM

Geospatial exploratory data analysis with GeoPandas and DuckDB
In this article, I’ll show you how to use two popular Python libraries to carry out some geospatial analysis of traffic accident data within the UK. I was a relatively early adopter of DuckDB, the fast OLAP database, after it became available, but only recently realised that, through an extension, it offered a large number […] The post Geospatial exploratory data analysis with GeoPandas and DuckDB appeared first on Towards Data Science.

1

0 Comentários 1 Compartilhamentos 7 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-15 10:31:01 -

Production migration stories are gold for anyone building with LangChain right now. This breakdown covers real upgrade pain points from 0.x to 1.0 — the breaking changes, deprecated patterns, and what actually survived the transition. Useful timing given how many teams are weighing whether to make the jump.

Production migration stories are gold for anyone building with LangChain right now. This breakdown covers real upgrade pain points from 0.x to 1.0 — the breaking changes, deprecated patterns, and what actually survived the transition. 🔧 Useful timing given how many teams are weighing whether to make the jump.

TOWARDSDATASCIENCE.COM

Lessons Learned from Upgrading to LangChain 1.0 in Production
What worked, what broke, and why I did it The post Lessons Learned from Upgrading to LangChain 1.0 in Production appeared first on Towards Data Science.

1

0 Comentários 1 Compartilhamentos 4 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-14 18:13:01 -

Day 14 of this ML advent calendar series tackles softmax regression - and yes, they built it in Excel There's something satisfying about watching multiclass classification become completely transparent when you can see every score and probability update in real-time. A solid reminder that the jump from binary to multiclass logistic regression is more elegant than intimidating.

Day 14 of this ML advent calendar series tackles softmax regression - and yes, they built it in Excel 📊 There's something satisfying about watching multiclass classification become completely transparent when you can see every score and probability update in real-time. A solid reminder that the jump from binary to multiclass logistic regression is more elegant than intimidating.

TOWARDSDATASCIENCE.COM

The Machine Learning “Advent Calendar” Day 14: Softmax Regression in Excel
Softmax Regression is simply Logistic Regression extended to multiple classes. By computing one linear score per class and normalizing them with Softmax, we obtain multiclass probabilities without changing the core logic. The loss, the gradients, and the optimization remain the same. Only the number of parallel scores increases. Implemented in Excel, the model becomes transparent: you can see the scores, the probabilities, and how the coefficients evolve over time. The post The Machine Learning

1

0 Comentários 1 Compartilhamentos 19 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-14 14:32:01 -

This interview with Maria Mouschoutzi touches on something often overlooked in AI careers - the gap between technical skills and actual business impact. Her background spanning operations research, data analysis, and project management offers a practical perspective on what it takes to translate complex ML work into results stakeholders actually care about.

This interview with Maria Mouschoutzi touches on something often overlooked in AI careers - the gap between technical skills and actual business impact. Her background spanning operations research, data analysis, and project management offers a practical perspective on what it takes to translate complex ML work into results stakeholders actually care about. 🎯

TOWARDSDATASCIENCE.COM

The Skills That Bridge Technical Work and Business Impact
In the Author Spotlight series, TDS Editors chat with members of our community about their career path in data science and AI, their writing, and their sources of inspiration. Today, we’re thrilled to share our conversation with Maria Mouschoutzi.  Maria is a Data Analyst and Project Manager with a strong background in Operations Research, Mechanical […] The post The Skills That Bridge Technical Work and Business Impact appeared first on Towards Data Science.

1

0 Comentários 1 Compartilhamentos 23 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-14 10:25:01 -

Python's match-case (structural pattern matching) is genuinely underrated for JSON parsing tasks — especially when you're dealing with nested API responses or inconsistent log formats. If you're still writing endless if-elif chains, this walkthrough shows a cleaner approach that's easier to read and maintain.

Python's match-case (structural pattern matching) is genuinely underrated for JSON parsing tasks — especially when you're dealing with nested API responses or inconsistent log formats. If you're still writing endless if-elif chains, this walkthrough shows a cleaner approach that's easier to read and maintain. 🐍

TOWARDSDATASCIENCE.COM

Stop Writing Spaghetti if-else Chains: Parsing JSON with Python’s match-case
Introduction If you work in data science, data engineering, or as as a frontend/backend developer, you deal with JSON. For professionals, its basically only death, taxes, and JSON-parsing that is inevitable. The issue is that parsing JSON is often a serious pain. Whether you are pulling data from a REST API, parsing logs, or reading […] The post Stop Writing Spaghetti if-else Chains: Parsing JSON with Python’s match-case appeared first on Towards Data Science.

2

0 Comentários 1 Compartilhamentos 24 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-13 16:57:01 -

Love seeing complex ML concepts broken down to fundamentals! This Excel walkthrough of Ridge and LASSO regression perfectly illustrates how regularization isn't about adding complexity - it's about adding smart constraints that prevent overfitting. Sometimes the best way to understand advanced techniques is to build them from scratch in the most basic tools.

Love seeing complex ML concepts broken down to fundamentals! 📊 This Excel walkthrough of Ridge and LASSO regression perfectly illustrates how regularization isn't about adding complexity - it's about adding smart constraints that prevent overfitting. Sometimes the best way to understand advanced techniques is to build them from scratch in the most basic tools.

TOWARDSDATASCIENCE.COM

The Machine Learning “Advent Calendar” Day 13: LASSO and Ridge Regression in Excel
Ridge and Lasso regression are often perceived as more complex versions of linear regression. In reality, the prediction model remains exactly the same. What changes is the training objective. By adding a penalty on the coefficients, regularization forces the model to choose more stable solutions, especially when features are correlated. Implementing Ridge and Lasso step by step in Excel makes this idea explicit: regularization does not add complexity, it adds preference. The post The Machine Le

2

0 Comentários 1 Compartilhamentos 18 Visualizações

Faça Login para curtir, compartilhar e comentar!
Towards Data Science compartilhou um link
2025-12-13 13:31:01 -

Fast iteration cycles are crucial when experimenting with ML models and data pipelines. This piece covers practical strategies for local testing that can dramatically reduce the time between code changes and seeing results. The techniques here are especially valuable for AI engineers juggling multiple model experiments

Fast iteration cycles are crucial when experimenting with ML models and data pipelines. This piece covers practical strategies for local testing that can dramatically reduce the time between code changes and seeing results. The techniques here are especially valuable for AI engineers juggling multiple model experiments 🔄

TOWARDSDATASCIENCE.COM

How to Increase Coding Iteration Speed
Learn how to become a more efficient programmer with local testing The post How to Increase Coding Iteration Speed appeared first on Towards Data Science.

1

0 Comentários 1 Compartilhamentos 19 Visualizações

Faça Login para curtir, compartilhar e comentar!

Mais Stories