Data Science (DS)
Browse by
Recent Submissions

On solving cycle problems with BranchandCut: extending shrinking and exact subcycle elimination separation algorithms
(20210101)In this paper, we extend techniques developed in the context of the Travelling Salesperson Problem for cycle problems. Particularly, we study the shrinking of support graphs and the exact algorithms for subcycle elimination ... 
Statistical assessment of experimental results: a graphical approach for comparing algorithms
(20210825)Nondeterministic measurements are common in realworld scenarios: the performance of a stochastic optimization algorithm or the total reward of a reinforcement learning agent in a chaotic environment are just two examples ... 
Simulation approach for assessing the performance of the γEWMA control chart
(20210222)i) Purpose: The purpose of this paper is to evaluate the performance of a modified EWMA control chart ($\gamma$EWMA control chart), which considers data distribution and incorporate its correlation structure, simulating ... 
Altered effective connectivity in sensorimotor cortices: a novel signature of severity and clinical course in depression
(2021)Functional neuroimaging research on depression has traditionally targeted neural networks associated with the psychological aspects of depression. In this study, instead, we focus on alterations of sensorimotor function ... 
From habitat to management: a simulation framework for improving statistical methods in fisheries science
(20210707)Monte Carlo simulation consists of computer experiments that involve creating data by pseudorandom sampling and has shown to be a powerful tool for studying the performance of statistical methods. In this thesis Monte ... 
A Review on Outlier/Anomaly Detection in Time Series Data
(2021)Recent advances in technology have brought major breakthroughs in data collection, enabling a large amount of data to be gathered over time and thus generating time series. Mining this data has become an important task for ... 
Water leak detection using selfsupervised time series classification
(2021)Leaks in water distribution networks cause a loss of water that needs to be com pensated to ensure a continuous supply for all customers. This compensation is achieved by increasing the flow of the network, which entails ... 
A cheap feature selection approach for the K means algorithm
(202105)The increase in the number of features that need to be analyzed in a wide variety of areas, such as genome sequencing, computer vision or sensor networks, represents a challenge for the Kmeans algorithm. In this regard, ... 
Statistical model for reproducibility in rankingbased feature selection
(20201105)The stability of feature subset selection algorithms has become crucial in realworld problems due to the need for consistent experimental results across different replicates. Specifically, in this paper, we analyze the ... 
On the symmetry of the Quadratic Assignment Problem through Elementary Landscape Decomposition
(202107)When designing metaheuristic strategies to optimize the quadratic assignment problem (QAP), it is important to take into account the specific characteristics of the instance to be solved. One of the characteristics that ... 
Exploring Gaps in DeepFool inSearch of More Effective Adversarial Perturbations
(2021)Adversarial examples are inputs subtly perturbed to produce a wrong prediction in machine learning models, while remaining perceptually similar to the original input. To find adversarial examples, some attack strategies ... 
Delineation of site‐specific management zones using estimation of distribution algorithms
(2021)In this paper, we present a novel methodology to solve the problem of delineating homogeneous sitespecific management zones (SSMZ) in agricultural fields. This problem consists of dividing the field into small regions for ... 
On the fair comparison of optimization algorithms in different machines
(2021)An experimental comparison of two or more optimization algorithms requires the same computational resources to be assigned to each algorithm. When a maximum runtime is set as the stopping criterion, all algorithms need to ... 
Efficient metaheuristics for spacecraft trajectory optimization
(202103)Metaheuristics has a long tradition in computer science. During the past few years, different types of metaheuristics, specially evolutionary algorithms got noticeable attention in dealing with realworld optimization ... 
Algorithms for Large Orienteering Problems
(202101)In this thesis, we have developed algorithms to solve largescale Orienteering Problems. The Orienteering Problem is a combinatorial optimization problem were given a weighted complete graph with vertex profits and a maximum ... 
Simulation Framework for Orbit Propagation and Space Trajectory Visualization
(2021)In this paper, an interactive tool for simulation of satellites dynamics and autonomous spacecraft guidance is presented. Different geopotential models for orbit propagation of Earthorbiting satellites are provided, which ... 
Minimax Classification with 01 Loss and Performance Guarantees
(20201201)Supervised classification techniques use training samples to find classification rules with small expected 01 loss. Conventional methods achieve efficient learning and outofsample generalization by minimizing surrogate ... 
Analysis of the sensitivity of the EndOfTurn Detection task to errors generated by the Automatic Speech Recognition process.
(2021)An EndOfTurn Detection Module (EOTDM) is an essential component of au tomatic Spoken Dialogue Systems. The capability of correctly detecting whether a user’s utterance has ended or not improves the accuracy in interpreting ... 
Probabilistic Load Forecasting Based on Adaptive Online Learning
(2020)Load forecasting is crucial for multiple energy management tasks such as scheduling generation capacity, planning supply and demand, and minimizing energy trade costs. Such relevance has increased even more in recent ... 
Identifying common treatments from Electronic Health Records with missing information. An application to breast cancer.
(20201229)The aim of this paper is to analyze the sequence of actions in the health system associated with a particular disease. In order to do that, using Electronic Health Records, we define a general methodology that allows us ...