Official Page · This page represents an organization and can be claimed by its official representatives.
Claim this Page
Towards Data Science is a Medium publication sharing concepts, ideas and codes about data science.
-
2 pessoas curtiram isso
-
13 Publicações
-
0 fotos
-
0 Vídeos
-
Anterior
-
Neural Networks & Deep Learning
Recent Posts
-
Solid reality check from Towards Data Science Vector databases aren't always the answer for RAG—sometimes a simple key-value store outperforms them. Worth reading if you've ever assumed "embeddings + vector search" is the default solution for every retrieval problem.Solid reality check from Towards Data Science 🎯 Vector databases aren't always the answer for RAG—sometimes a simple key-value store outperforms them. Worth reading if you've ever assumed "embeddings + vector search" is the default solution for every retrieval problem.
TOWARDSDATASCIENCE.COMWhen (Not) to Use Vector DBWhen indexing hurts more than it helps: how we realized our RAG use case needed a key-value store, not a vector database The post When (Not) to Use Vector DB appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 5 VisualizaçõesFaça Login para curtir, compartilhar e comentar! -
Love this approach to demystifying kernel SVMs — building the concept step by step in Excel makes the math actually click instead of staying abstract. Part of Towards Data Science's ML advent calendar series, and honestly one of the better intuition-building pieces I've seen on the kernel trick. Worth bookmarking if you've ever glazed over during dual formulation explanations.Love this approach to demystifying kernel SVMs — building the concept step by step in Excel makes the math actually click instead of staying abstract. Part of Towards Data Science's ML advent calendar series, and honestly one of the better intuition-building pieces I've seen on the kernel trick. 📊 Worth bookmarking if you've ever glazed over during dual formulation explanations.
TOWARDSDATASCIENCE.COMThe Machine Learning “Advent Calendar” Day 16: Kernel Trick in ExcelKernel SVM often feels abstract, with kernels, dual formulations, and support vectors. In this article, we take a different path. Starting from Kernel Density Estimation, we build Kernel SVM step by step as a sum of local bells, weighted and selected by hinge loss, until only the essential data points remain. The post The Machine Learning “Advent Calendar” Day 16: Kernel Trick in Excel appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 5 Visualizações -
Eight years in ML is a lifetime in this field – this reflection from Towards Data Science covers some refreshingly honest ground beyond just technical skills. The bits about deep work, avoiding over-identification with your work, and the unexpected value of writing really resonated. Worth a read if you're building a long-term career in this space.Eight years in ML is a lifetime in this field – this reflection from Towards Data Science covers some refreshingly honest ground beyond just technical skills. The bits about deep work, avoiding over-identification with your work, and the unexpected value of writing really resonated. 📝 Worth a read if you're building a long-term career in this space.
TOWARDSDATASCIENCE.COMLessons Learned After 8 Years of Machine LearningDeep work, over-identification, sports, and blogging The post Lessons Learned After 8 Years of Machine Learning appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 5 Visualizações -
This is a clever way to demystify SVMs — building them step by step from models you already know rather than diving straight into hyperplanes and margins. The real gem here is seeing how logistic regression, SVM, and other linear classifiers are just variations on the same theme with different loss functions. Great mental model for anyone who wants intuition over memorization.This is a clever way to demystify SVMs — building them step by step from models you already know rather than diving straight into hyperplanes and margins. The real gem here is seeing how logistic regression, SVM, and other linear classifiers are just variations on the same theme with different loss functions. 📊 Great mental model for anyone who wants intuition over memorization.
TOWARDSDATASCIENCE.COMThe Machine Learning “Advent Calendar” Day 15: SVM in ExcelInstead of starting with margins and geometry, this article builds the Support Vector Machine step by step from familiar models. By changing the loss function and reusing regularization, SVM appears naturally as a linear classifier trained by optimization. This perspective unifies logistic regression, SVM, and other linear models into a single, coherent framework. The post The Machine Learning “Advent Calendar” Day 15: SVM in Excel appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 22 Visualizações -
The jump from mid to senior data scientist isn't about learning more algorithms—it's about the design decisions and trade-offs you make before writing any code. This piece breaks down 6 technical skills that often go unspoken but make all the difference. Worth a read if you're actively working toward that transition.The jump from mid to senior data scientist isn't about learning more algorithms—it's about the design decisions and trade-offs you make before writing any code. This piece breaks down 6 technical skills that often go unspoken but make all the difference. 🎯 Worth a read if you're actively working toward that transition.
TOWARDSDATASCIENCE.COM6 Technical Skills That Make You a Senior Data ScientistBeyond writing code, these are the design-level decisions, trade-offs, and habits that quietly separate senior data scientists from everyone else. The post 6 Technical Skills That Make You a Senior Data Scientist appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 15 Visualizações1
-
Solid practical walkthrough on combining GeoPandas with DuckDB for geospatial analysis DuckDB's spatial extension is still flying under the radar for a lot of data folks, but the performance gains for exploratory work on larger datasets are worth checking out. Nice use case with UK traffic accident data here.Solid practical walkthrough on combining GeoPandas with DuckDB for geospatial analysis 🗺️ DuckDB's spatial extension is still flying under the radar for a lot of data folks, but the performance gains for exploratory work on larger datasets are worth checking out. Nice use case with UK traffic accident data here.
TOWARDSDATASCIENCE.COMGeospatial exploratory data analysis with GeoPandas and DuckDBIn this article, I’ll show you how to use two popular Python libraries to carry out some geospatial analysis of traffic accident data within the UK. I was a relatively early adopter of DuckDB, the fast OLAP database, after it became available, but only recently realised that, through an extension, it offered a large number […] The post Geospatial exploratory data analysis with GeoPandas and DuckDB appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 7 Visualizações1
-
Production migration stories are gold for anyone building with LangChain right now. This breakdown covers real upgrade pain points from 0.x to 1.0 — the breaking changes, deprecated patterns, and what actually survived the transition. Useful timing given how many teams are weighing whether to make the jump.Production migration stories are gold for anyone building with LangChain right now. This breakdown covers real upgrade pain points from 0.x to 1.0 — the breaking changes, deprecated patterns, and what actually survived the transition. 🔧 Useful timing given how many teams are weighing whether to make the jump.
TOWARDSDATASCIENCE.COMLessons Learned from Upgrading to LangChain 1.0 in ProductionWhat worked, what broke, and why I did it The post Lessons Learned from Upgrading to LangChain 1.0 in Production appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 4 Visualizações1
-
Day 14 of this ML advent calendar series tackles softmax regression - and yes, they built it in Excel There's something satisfying about watching multiclass classification become completely transparent when you can see every score and probability update in real-time. A solid reminder that the jump from binary to multiclass logistic regression is more elegant than intimidating.Day 14 of this ML advent calendar series tackles softmax regression - and yes, they built it in Excel 📊 There's something satisfying about watching multiclass classification become completely transparent when you can see every score and probability update in real-time. A solid reminder that the jump from binary to multiclass logistic regression is more elegant than intimidating.
TOWARDSDATASCIENCE.COMThe Machine Learning “Advent Calendar” Day 14: Softmax Regression in ExcelSoftmax Regression is simply Logistic Regression extended to multiple classes. By computing one linear score per class and normalizing them with Softmax, we obtain multiclass probabilities without changing the core logic. The loss, the gradients, and the optimization remain the same. Only the number of parallel scores increases. Implemented in Excel, the model becomes transparent: you can see the scores, the probabilities, and how the coefficients evolve over time. The post The Machine Learning0 Comentários 1 Compartilhamentos 19 Visualizações1
-
This interview with Maria Mouschoutzi touches on something often overlooked in AI careers - the gap between technical skills and actual business impact. Her background spanning operations research, data analysis, and project management offers a practical perspective on what it takes to translate complex ML work into results stakeholders actually care about.This interview with Maria Mouschoutzi touches on something often overlooked in AI careers - the gap between technical skills and actual business impact. Her background spanning operations research, data analysis, and project management offers a practical perspective on what it takes to translate complex ML work into results stakeholders actually care about. 🎯
TOWARDSDATASCIENCE.COMThe Skills That Bridge Technical Work and Business ImpactIn the Author Spotlight series, TDS Editors chat with members of our community about their career path in data science and AI, their writing, and their sources of inspiration. Today, we’re thrilled to share our conversation with Maria Mouschoutzi.  Maria is a Data Analyst and Project Manager with a strong background in Operations Research, Mechanical […] The post The Skills That Bridge Technical Work and Business Impact appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 23 Visualizações1
-
Python's match-case (structural pattern matching) is genuinely underrated for JSON parsing tasks — especially when you're dealing with nested API responses or inconsistent log formats. If you're still writing endless if-elif chains, this walkthrough shows a cleaner approach that's easier to read and maintain.Python's match-case (structural pattern matching) is genuinely underrated for JSON parsing tasks — especially when you're dealing with nested API responses or inconsistent log formats. If you're still writing endless if-elif chains, this walkthrough shows a cleaner approach that's easier to read and maintain. 🐍
TOWARDSDATASCIENCE.COMStop Writing Spaghetti if-else Chains: Parsing JSON with Python’s match-caseIntroduction If you work in data science, data engineering, or as as a frontend/backend developer, you deal with JSON. For professionals, its basically only death, taxes, and JSON-parsing that is inevitable. The issue is that parsing JSON is often a serious pain. Whether you are pulling data from a REST API, parsing logs, or reading […] The post Stop Writing Spaghetti if-else Chains: Parsing JSON with Python’s match-case appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 24 Visualizações2
-
Love seeing complex ML concepts broken down to fundamentals! This Excel walkthrough of Ridge and LASSO regression perfectly illustrates how regularization isn't about adding complexity - it's about adding smart constraints that prevent overfitting. Sometimes the best way to understand advanced techniques is to build them from scratch in the most basic tools.Love seeing complex ML concepts broken down to fundamentals! 📊 This Excel walkthrough of Ridge and LASSO regression perfectly illustrates how regularization isn't about adding complexity - it's about adding smart constraints that prevent overfitting. Sometimes the best way to understand advanced techniques is to build them from scratch in the most basic tools.
TOWARDSDATASCIENCE.COMThe Machine Learning “Advent Calendar” Day 13: LASSO and Ridge Regression in ExcelRidge and Lasso regression are often perceived as more complex versions of linear regression. In reality, the prediction model remains exactly the same. What changes is the training objective. By adding a penalty on the coefficients, regularization forces the model to choose more stable solutions, especially when features are correlated. Implementing Ridge and Lasso step by step in Excel makes this idea explicit: regularization does not add complexity, it adds preference. The post The Machine Le0 Comentários 1 Compartilhamentos 18 Visualizações2
-
Fast iteration cycles are crucial when experimenting with ML models and data pipelines. This piece covers practical strategies for local testing that can dramatically reduce the time between code changes and seeing results. The techniques here are especially valuable for AI engineers juggling multiple model experimentsFast iteration cycles are crucial when experimenting with ML models and data pipelines. This piece covers practical strategies for local testing that can dramatically reduce the time between code changes and seeing results. The techniques here are especially valuable for AI engineers juggling multiple model experiments 🔄
TOWARDSDATASCIENCE.COMHow to Increase Coding Iteration SpeedLearn how to become a more efficient programmer with local testing The post How to Increase Coding Iteration Speed appeared first on Towards Data Science.0 Comentários 1 Compartilhamentos 19 Visualizações1
Mais Stories