Cómo comparar y seleccionar los mejores algoritmos de ML

1 Define tu objetivo

El primer paso es definir tu objetivo y lo que quieres lograr con tu modelo de ML. Esto le ayudará a acotar los posibles algoritmos que pueden abordar su problema y los criterios que utilizará para evaluarlos. Por ejemplo, si su objetivo es clasificar imágenes en diferentes categorías, necesitará un algoritmo que pueda manejar datos complejos y de alta dimensión, como una red neuronal convolucional (CNN). Si su objetivo es predecir el precio de una casa en función de sus características, necesitará un algoritmo que pueda realizar una regresión, como una regresión lineal o un árbol de decisión.

Añade tu opinión

Jean Dessain

Managing Partner at Reacfin | Professor of Finance & Machine Learning at IÉSEG
Denunciar la contribución
There is no "one size fits all" approach for selecting the best model or assessing its performance. In many cases, statistical metrics based on error rates will be good enough. In most cases, it is worth digging deeper and analyzing the consequences of the decision-making that relies on the machine learning tool. This is particularly true when there is a significant imbalance between the consequences of errors, some errors being benign while others trigger severe negative consequences. This is the case in various domains, like computing the maximum load of a network, predicting probability of default, ... A good knowledge of the field for which ML is applied is the best safeguard to avoid choosing a wrong performance metric.

Traducido

Recomendar

Poco útil
Narasimman Saravana

AI | ML | DL | CV | NLP | Gen AI | LLM | LLMOPs | RAG | Graph Analytics | Vector Database | Azure Cognitive Service | Data Scientist | AIOPs | GPU | CUDA | TOGAF | AI Product Architect
Denunciar la contribución
Selecting the best machine learning (ML) model for a particular task involves a combination of understanding the problem, exploring the data, and experimenting with different algorithms. 1. Clearly understand the problem you are trying to solve. 2. Understand the distribution of classes in classification tasks or the distribution of the target variable in regression tasks. 3. Select appropriate evaluation metrics based on the nature of your problem (accuracy, precision, recall, F1 score for classification; mean squared error, R-squared for regression, etc.). And Consider business-specific metrics if applicable. 4.Consider the trade-offs between different models, such as model complexity, interpretability.

Traducido

Recomendar

Poco útil
Dr. John Martin

Academician & Education Leader | Computer Science | Head of Curriculum | Jazan University | Pioneering Healthcare AI Innovation
Denunciar la contribución
In general, we compare learning algorithms based on their error rates, but it is important to remember that in reality, error is just one of the factors that influence our decision. Other criteria are: - risks when errors are generalized using loss functions instead of 0/1 loss -training time and space complexity - testing time and space complexity - interpretability, namely, whether the method allows knowledge extraction that can be checked and validated by experts, and - easy programmability

Traducido

Recomendar

Poco útil
Sanjay Kumar MBA,MS,PhD
Denunciar la contribución
The initial step in machine learning is defining your goal and desired outcomes for your model. This process helps narrow down suitable algorithms and criteria for evaluation. For instance, classifying images might require a convolutional neural network (CNN) for complex data, while predicting house prices might involve regression methods like linear regression or decision trees.

Traducido

Recomendar

Poco útil
Oluwaseyi Ogunnowo

Data Scientist, Relevance and Personalization, Adevinta
Denunciar la contribución
Generally, comparing algorithms is usually guided by the following steps 1. Identifying the type of problem to be solved (classification, regression, clustering, etc) 2. Determine the success metric (both technical and business) 3. Shortlist some number of ML algorithms to train and benchmark against each other 4. Measure how well these algorithms perform on an out of fold set using the metrics you identified in step 2 5. Pick the ML algorithm that satisfies the success requirement 6. Optimize the chosen algorithm through hyper parameter optimization 7. Test and understand why the model makes its choices. Here you can use frame works like LIME/ SHAP. (Model explainability).

Traducido

Recomendar

Poco útil

Cargar más contribuciones

2 Explora tus datos

El segundo paso es explorar los datos y comprender sus características, como su tamaño, forma, distribución, calidad y características. Esto le ayudará a elegir los algoritmos que sean compatibles con sus datos y puedan manejar sus desafíos, como valores faltantes, valores atípicos, desequilibrios o ruido. Por ejemplo, si los datos son grandes y dispersos, es posible que desee utilizar un algoritmo que pueda escalar bien y reducir la dimensionalidad, como una máquina de vectores de soporte (SVM) o un análisis de componentes principales (PCA). Si los datos son pequeños y ruidosos, es posible que desee usar un algoritmo que pueda evitar el sobreajuste y regularizar el modelo, como una regresión logística o un bosque aleatorio.

Añade tu opinión

Balagopal Madhusoodhanan

Director Intelligent Automation | LinkedIn Top Voice (AI) | Speaker | Strategy & Architecture | Cloud computing | LowCode | Supply Chain Transformation
Denunciar la contribución
Data engineering is the process of preparing and transforming the data for the machine learning model. - Data quality assessment report evaluates the quality of the data and identifies and quantifies the data issues. - Data strategy defines the criteria and the logic for selecting or excluding the data sources based on relevance, availability, reliability, and diversity. - Data engineering design specifies the steps and the rules for performing the data engineering tasks, such as data cleansing, data transformation and ensure its scalable.

Traducido

Recomendar

Poco útil
Afonso Ferreira

Founder and R&D Engineer at Sympathia Technologies
Denunciar la contribución
Whenever data is limited (e.g. labelled health data can be expensive), it is worth checking which samples are considered more “uncertain” by the model before obtaining its annotation. Which level of confidence was a given sample classified with? By selecting low-confidence samples, we can induce a more significant impact on the learning process, which is equivalent to achieving the same performance with less data. This is because we are discarding samples that are considered “redundant”. In those cases, we opt to not obtaining their annotation, reducing cost.

Traducido

Recomendar

Poco útil
Paresh Patil

LinkedIn Top Data Science Voice💡| 5X LinkedIn Top Voice | ML, Deep Learning & Python Expert, Data Scientist | Data Visualization & Storytelling | Actively Seeking Opportunities
Denunciar la contribución
To effectively compare and select ML algorithms, it's essential to deeply understand your data. Start by analyzing data characteristics using tools like matplotlib or seaborn. This analysis will highlight important features such as distribution, outliers, or missing values, guiding your algorithm choice. For example, tree-based models are great for non-linear data, while simpler models suffice for linear relationships. The data’s complexity and size also influence this choice; large datasets might need robust methods like ensemble models or deep learning. The key is a comprehensive data understanding, guiding you to the right algorithm.

Traducido

Recomendar

Poco útil
Kheem Parkash Dharmani

Machine Learning Engineer
Denunciar la contribución
By analyzing various aspects of your data, like its size, shape, distribution, quality, and features, you can identify challenges in your data, such as missing values, outliers, imbalances, or noise. The characteristics of your data will guide you in choosing algorithms that are compatible with its specific nature. For instance, if you're working with large and sparse data, algorithms like support vector machines (SVM) or principal component analysis (PCA) may be suitable due to their ability to scale well and reduce dimensionality. On the other hand, if your data is small and noisy, algorithms like logistic regression or random forest, which can prevent overfitting and provide regularization, might be more appropriate.

Traducido

Recomendar

Poco útil
Leoson Hoay

Data Scientist @ Learning Collider | Data Science Mentor
Denunciar la contribución
Don't underestimate this piece. It is true what they say: a large part of the work of being a data analyst or data scientist resides within this step. Understanding whether your data has unique characteristics - such as group imbalances, missing values, outliers, distributional skew - will affect how you clean and treat the data, which in turn plays a big part on the validity and efficacy of your model. Visualize your data, using value tables and histograms to get a sense of the distribution of your variables of interest. Determine your method for dealing with group imbalances, outliers, or missing data, and remember that you have to justify these methods in your final model.

Traducido

Recomendar

Poco útil

Cargar más contribuciones

3 Selecciona tus métricas

El tercer paso es seleccionar las métricas y cómo medirá el rendimiento y la calidad del modelo de ML. Esto te ayudará a comparar los algoritmos de forma objetiva y cuantitativa, y a elegir el que mejor se adapte a tus expectativas y requisitos. Por ejemplo, si su métrica es la precisión, querrá elegir el algoritmo que tenga el mayor porcentaje de predicciones correctas. Si su métrica es de precisión, querrá elegir el algoritmo que tenga la tasa más baja de falsos positivos. Otras métricas comunes incluyen el recuerdo, la puntuación F1, la curva ROC, MSE, MAE y R cuadrado.

Añade tu opinión

Tankut Tekeli

Director of Data Analytics @ LC Waikiki | Data Analytics, Machine Learning
Denunciar la contribución
When defining metrics, control metrics are often the overlooked part of the story. In the complex systems with intelligent internal and external interactions, a performance metric should be supported or constrained with proper control metrics to foresee, apply and monitor the desired effects.

Traducido

Recomendar

Poco útil
Paresh Patil

LinkedIn Top Data Science Voice💡| 5X LinkedIn Top Voice | ML, Deep Learning & Python Expert, Data Scientist | Data Visualization & Storytelling | Actively Seeking Opportunities
Denunciar la contribución
Selecting the right metrics is crucial in comparing and selecting ML algorithms. Accuracy is a common starting point, but it's not always sufficient. In classification tasks with imbalanced classes, precision, recall, and F1-score provide deeper insights. For regression models, consider mean squared error or mean absolute error. In complex scenarios, custom metrics tailored to specific business objectives can be more informative. It's also important to consider computational efficiency, especially for large datasets or real-time applications. Ultimately, the choice of metrics should align with your project’s goals and the specific nature of the data at hand.

Traducido

Recomendar

Poco útil
Ali Mokh

AI-MLOps| Generative AI @Ericsson| AI/ML Instructor @ESILV Senior IEEE Member
Denunciar la contribución
Selecting the best metric also depends on the goal and the data distribution. Accuracy for example could be misleading for certain applications. Suppose you have a dataset of 1000 chests X-ray images, where 10 of it are cancerous. If your ML model is made to give negative predictions all the time (no cancer), the accuracy is 99%! But it is not significant nor useful.

Traducido

Recomendar

Poco útil
Raja Meer Baz Khan

Senior Manager, Data Analytics & Data Engineering | Business Analytics | Business Intelligence | Data Strategy & Architecture | 3x Microsoft Azure certified
Denunciar la contribución
Having a good validation strategy and solid evaluation metrics are important. Ensure that the model you choose is robust and reliable and correlates with the business metrics you intend to optimize with machine learning solution. Choosing the correct evaluation schema, whether a simple train-test split or a complex cross-validation strategy, is the crucial first step of building any machine learning solution.

Traducido

Recomendar

Poco útil
Ramin Toosi

ML Engineer | CEO at Avir
Denunciar la contribución
In assessing machine learning models, performance isn't solely about accuracy. While metrics like accuracy, precision, and recall matter, practical considerations such as latency and hardware usage are crucial. Achieving optimal accuracy must be balanced with the model's efficiency in terms of response time and resource utilization. This ensures that the selected model is not only accurate but also feasible and scalable for real-world deployment.

Traducido

Recomendar

Poco útil

Cargar más contribuciones

4 Divide tus datos

El cuarto paso es dividir los datos en diferentes subconjuntos para el entrenamiento, la validación y las pruebas. Esto le ayudará a evitar el sobreajuste y el ajuste insuficiente, y a estimar la capacidad de generalización del modelo de ML. Por ejemplo, puede usar una división 70/15/15, en la que usa el 70 % de los datos para el entrenamiento, el 15 % para la validación y el 15 % para las pruebas. El conjunto de entrenamiento se usa para ajustar los parámetros del modelo, el conjunto de validación se usa para ajustar los hiperparámetros del modelo y el conjunto de pruebas se usa para evaluar el rendimiento final del modelo.

Añade tu opinión

Paresh Patil

LinkedIn Top Data Science Voice💡| 5X LinkedIn Top Voice | ML, Deep Learning & Python Expert, Data Scientist | Data Visualization & Storytelling | Actively Seeking Opportunities
Denunciar la contribución
In comparing and selecting ML algorithms, data splitting is crucial. Generally, data is divided into training, validation, and testing sets. Training builds the model, validation tunes parameters, and testing evaluates performance. This method assesses a model's generalization on unseen data. Common split ratios are 60-20-20 or 70-15-15. It's vital to ensure each set represents the whole dataset, especially in classification tasks. Techniques like stratified sampling help maintain class distribution. Proper splitting ensures fair, accurate ML algorithm comparison and robust model selection.

Traducido

Recomendar

Poco útil
Shivani Paunikar, MSBA

Data Engineer @Tucson Police Department | ASU Grad Medallion | Snowflake Certified | BGS Member
Denunciar la contribución
Generalization Ability: Data splitting allows you to estimate the generalization ability of your model by evaluating its performance on unseen data. This helps you assess whether the model has learned the underlying patterns in the data or if it is just memorizing the training data Preventing Overfitting and Underfitting: By having separate training, validation, and testing sets, you can prevent overfitting and underfitting. The validation set helps in tuning the model's hyperparameters, while the testing set provides an unbiased evaluation of the model's performance on unseen data Model Selection: Insights from the validation set can guide you in selecting the best model architecture and hyperparameters that result in optimal performance

Traducido

Recomendar

Poco útil
Ramin Toosi

ML Engineer | CEO at Avir
Denunciar la contribución
In the vast realm of machine learning, data splitting is a crucial choreography. However, with colossal datasets like 10 million images, the conventional 70/15/15 split may need adaptation. In this scenario, even a mere 1% for testing or validation yields substantial subsets for meaningful insights. Scaling the split percentage to the dataset's magnitude ensures resource efficiency and statistical robustness. Let the data split be a harmonious composition, finely tuned to unravel the grand symphony of machine learning in an optimal and judicious manner.

Traducido

Recomendar

Poco útil
Mihir Dakwala

Connecting the dots between innovation, strategy, and execution//Business Unit Head at Amnex Infotechnologies//
Denunciar la contribución
1. Training Data: Purpose: Used to train the machine learning model. Percentage: Often around 60-80% of the total dataset. Usage: Model learns patterns, relationships, and features from this subset. 2. Validation Data: Purpose: Assess and tune the model during training. Percentage: Usually around 10-20% of the total dataset (can vary). Usage: Helps in hyperparameter tuning, preventing overfitting by providing feedback to adjust the model. 3. Testing Data: Purpose: Evaluate the model's performance. Percentage: Typically around 10-20% of the total dataset (can vary). Usage: Completely unseen by the model during training and validation; used to assess how well the model generalizes to new data.

Traducido

Recomendar

Poco útil
Trung Nguyen

Head of ML | Data Science | AI Advisor | MLOps | NLP | LLMs | GenAI | EdTech
Denunciar la contribución
If you're dealing with data with timestamps, which is common if your data has transactions or interactions or events, it's critical to use out-of-time validation (split the data into different non-overlapping periods for train, test, and validation), to avoid any potential leakage.

Traducido

Recomendar

Poco útil

Cargar más contribuciones

5 Entrene y pruebe sus algoritmos

El quinto paso es entrenar y probar los algoritmos utilizando los subconjuntos de datos y las métricas que ha seleccionado. Esto le ayudará a ver cómo funcionan sus algoritmos en diferentes escenarios de datos y cómo se comparan entre sí. Por ejemplo, puede usar un bucle o una función para iterar sobre diferentes algoritmos y aplicarlos a los mismos conjuntos de datos y, a continuación, almacenar los resultados en una tabla o una gráfica. También puedes usar bibliotecas o herramientas que puedan automatizar este proceso, como scikit-learn, TensorFlow o AutoML.

Añade tu opinión

Shivani Paunikar, MSBA

Data Engineer @Tucson Police Department | ASU Grad Medallion | Snowflake Certified | BGS Member
Denunciar la contribución
Algorithm Performance Comparison: By testing multiple algorithms on the same dataset, you can compare their performances and identify the one that best suits your specific problem. Comparing metrics such as accuracy, precision, recall, F1 score, and ROC-AUC can help you determine which algorithm performs better under different data scenarios. Model Selection and Tuning: Training and testing allow you to identify the best-performing algorithm and its associated hyperparameters Understanding Model Behavior: Training and testing provide insights into how different algorithms behave with respect to your dataset. You can gain an understanding of which models are prone to overfitting or underfitting and how they handle different data scenarios

Traducido

Recomendar

Poco útil
Tom Halbertal

Principal Data Scientist at EG Australia
Denunciar la contribución
Perhaps one of the most underrated tools for model selection are learning curves, which are crucial in evaluating the performance of machine learning models. These curves depict the relationship between a chosen performance metric (such as accuracy or loss) and the amount of training data or training iterations. If underfitting is detected, one might consider increasing model complexity, while overfitting may necessitate the use of regularization techniques or a reduction in complexity. The curves also guide decisions on whether a model would benefit from additional data or training iterations. Ultimately, learning curves offer a systematic approach to refining model architecture, enhancing training strategies, and optimizing performance.

Traducido

Recomendar

Poco útil
Kewin Sachtleben

Data Scientist @ DOJO - Smart Ways | 2x Top Voice | Machine Learning | Data Science | AI Engineer | Generative AI | LLM
Denunciar la contribución
Train Multiple Algorithms: Utilize various machine learning models like decision trees, SVMs, and neural networks on your training dataset, tailoring each to the specific nature of your problem. Test on Validation Set: Evaluate these models on a separate validation set using relevant metrics to determine effectiveness, ensuring they have not been exposed to this data during training. Create Similar New Data: Synthesize new data with the same distribution as the validation set to replicate real-world scenarios, using methods like bootstrapping or synthetic data generation. Introduce Bias and Stress-Test: Inject biases into this new data and re-test the models to assess their robustness and ability to handle data variations and challenges.

Traducido

Recomendar

Poco útil
Pranav Singaraju

Machine Learning @ Amazon Advertising | Masters, Computer Science | NLP | LLM | Gen AI
Denunciar la contribución
Understanding the UseCase and Model uis of high importance. For example, A model detecting a disease must have high Recall than precision and is considered better even if it has less F1 score. So, it depends on usecase. But, in general: F1 score, Accuracy, RMSE, Binary Cross Entropy loss, ROC-AUC curve

Traducido

Recomendar

Poco útil
Narasimman Saravana

AI | ML | DL | CV | NLP | Gen AI | LLM | LLMOPs | RAG | Graph Analytics | Vector Database | Azure Cognitive Service | Data Scientist | AIOPs | GPU | CUDA | TOGAF | AI Product Architect
Denunciar la contribución
The machine learning model selection process involves training and testing algorithms on selected data subsets using chosen metrics. This involves implementing a loop or function to iterate over different algorithms, applying them to the same datasets, and storing and comparing results using tables or plots. Automation tools like scikit-learn, TensorFlow, or AutoML can streamline this process, enhancing efficiency and consistency in model evaluation.

Traducido

Recomendar

Poco útil

Cargar más contribuciones

6 Analice y seleccione su algoritmo

El sexto y último paso es analizar y seleccionar el algoritmo en función de los resultados y la información que se ha obtenido de los pasos anteriores. Esto te ayudará a tomar una decisión informada y racional que se adapte a tu objetivo y a tus datos. Por ejemplo, puede observar los valores de las métricas, las curvas de aprendizaje, las matrices de confusión, la importancia de las características o la complejidad del modelo, y ver qué algoritmo tiene el mejor equilibrio entre precisión, eficiencia y simplicidad. También puedes tener en cuenta otros factores, como la interpretabilidad, la robustez o la escalabilidad del algoritmo.

Añade tu opinión

Shivani Paunikar, MSBA

Data Engineer @Tucson Police Department | ASU Grad Medallion | Snowflake Certified | BGS Member
Denunciar la contribución
analyzing and selecting the most suitable algorithm involves the following considerations Metrics Evaluation: Assess the performance of different algorithms based on various metrics like accuracy, precision, recall, F1 score, or ROC-AUC to understand how well each algorithm performs on your specific dataset Learning Curves and Model Complexity: Examine learning curves to gauge how algorithms handle training data and whether they tend to overfit or underfit. Understanding the complexity of the model can help prevent issues like overfitting, ensuring better generalization Confusion Matrices and Feature Importances: Analyze confusion matrices to understand how well an algorithm classifies different classes and where it might be making errors

Traducido

Recomendar

Poco útil
Maikel Groenewoud
Denunciar la contribución
When evaluating model performance and selecting algorithms in a scenario with various subgroups or segments in the data, it is advisable to also consider the performance for each group/segment separately in addition to the overall model performance. It is for instance possible that the overall performance is relatively high, but that there are groups/segments for which the performance is significantly lower.

Traducido

Recomendar

Poco útil
Axel Teich

Finance lead of the Ableton Group
Denunciar la contribución
In the research of my master's thesis focused on classification (10 years ago by now) I concluded that a great combination can be to use advanced but difficult-to-interpret algorithms (such as Random Tree) to establish an understanding of what a baseline good performance result for a model. Then, to use a better understandable model (e.g. Logistics, regular decision trees etc.) on well pre-processed data and compare how close it gets to the baseline established reference point.

Traducido

Recomendar

Poco útil
Vaishnov RG

Senior engineer AI at Concord Technologies
Denunciar la contribución
In the process of algorithm selection, it's akin to trying on different outfits to find the one that fits you the best. Each algorithm is like a unique outfit, offering different styles and fits. Evaluating them involves checking how well each outfit complements your figure – in the ML world, it's about metrics like accuracy, precision, recall, and F1 score. The chosen algorithm should not only look good on the training data but also suit unseen data, demonstrating its versatility and reliability. Just like finding that perfect outfit, the selected algorithm should make you feel confident across various situations.

Traducido

Recomendar

Poco útil
Walter Sperat

Expert Machine Learning Engineer
Denunciar la contribución
Heavy cross validation using the chosen metrics will help better understand the candidate models. Multiple comparisons must be considered when doing this, as they are likely to give overly optimistic estimates if one isn't careful.

Traducido

Recomendar

Poco útil

Cargar más contribuciones

7 Esto es lo que hay que tener en cuenta

Este es un espacio para compartir ejemplos, historias o ideas que no encajan en ninguna de las secciones anteriores. ¿Qué más te gustaría añadir?

Añade tu opinión

Soudamini Sreepada

Principal Data Scientist (Ex Microsoft), Trainer, Mentor
Denunciar la contribución
Selecting the best algorithm is a trade-off between 1. Training and deployment cost: Choosing supervised learning algorithms is more cost-effective than deep learning algorithms. If the data is tabular (numerical/categorical), it's best to choose supervised learning algorithms. 2. High accuracy/AUC (Static metrics): Leverage the search space algorithms using Keras Tuner or AutoML solutions to get baseline best algorithms. 3. High-performing algorithms: The success metrics could be static (e.g., accuracy/precision) or the metrics can be business metrics. In the A/B testing world, we could select a few best algorithms, run the experimentation, and select the one that gives the best metrics (revenue/user engagement/latency).

Traducido

Recomendar

Poco útil
Siddhant Sadangi

🥑ML DevRel Engineer @neptune.ai | 👨💻Ex - Data Scientist @ Reuters, Deloitte
Denunciar la contribución
As machine learning practitioners, we often focus on fine-tuning algorithms, tweaking hyperparameters, and experimenting with complex models. However, achieving optimal performance goes beyond just the algorithm itself and to the entire ML pipeline. But how do we optimize this pipeline? While hyperparameter tuning is crucial, consider other dimensions of optimization. Sometimes, a simple rule-based step can significantly enhance model performance. For instance, incorporating domain-specific knowledge or business rules can lead to better predictions. Don’t hesitate to experiment with such techniques.

Traducido

Recomendar

Poco útil
Mohamed Bakir

Senior Project Manager, PMP, CSM
Denunciar la contribución
Machine learning's adaptability spans across all industries, with data quality taking precedence over a singular focus on technical intricacies. Rather than fixating solely on technical aspects, a prudent approach involves surveying the environment to identify or obtain relevant data and discerning its potential value. Embracing a problem-centric methodology, What problem is in need of resolution, and what enhancement is envisioned? It is essential to recognize that the goalof a machine learning model transcends mere speed or accuracy. Instead, the measure of success lies in its ability to effectively address substantial human challenges.

Traducido

Recomendar

Poco útil
Soham Malakar

ML Engineer @ Informatica | ClaireGPT | Generative AI | Machine Learning | NLP
Denunciar la contribución
ML is a highly iterative process where a lot changes on the quality of data. Moreover with the pace of ML research a lot of new things are coming out each day. Therefore constant research or methods is the key to select the best suited ML algorithms

Traducido

Recomendar

Poco útil
Ravinder (Ravi) Singh

Investor: Deep Tech & Science | Professor: Innovation & Entrepreneurship | C-Level Leader: Technology & Engineering | Board Adviser: Incubation & Rationalization | Global Speaker: Future & Realism I Aviator
Denunciar la contribución
Once you have selected the best algorithm, you can try to improve its performance by tuning its hyperparameters using grid search, random search, or Bayesian optimization methods. You can also validate your model on a new or unseen data set to check its generalization ability and robustness.

Traducido

Recomendar

Poco útil

Cargar más contribuciones

¿Cómo se pueden comparar y seleccionar los mejores algoritmos de ML?

1

2

3

4

5

6

7

1 Define tu objetivo

2 Explora tus datos

3 Selecciona tus métricas

4 Divide tus datos

5 Entrene y pruebe sus algoritmos

6 Analice y seleccione su algoritmo

7 Esto es lo que hay que tener en cuenta

Aprendizaje automático

Valorar este artículo

Gracias por tus comentarios

Más artículos sobre Aprendizaje automático

Lecturas más relevantes

¿Cómo se pueden comparar y seleccionar los mejores algoritmos de ML?

1

2

3

4

5

6

7

1 Define tu objetivo

2 Explora tus datos

3 Selecciona tus métricas

4 Divide tus datos

5 Entrene y pruebe sus algoritmos

6 Analice y seleccione su algoritmo

7 Esto es lo que hay que tener en cuenta

Aprendizaje automático

Valorar este artículo

Gracias por tus comentarios

Explorar otras aptitudes