Research Article
Opinion Mining of Student Regarding Educational System Using Online Platform
Muhammad Irfan*,
Khadija Bibi,
Adeeba Aslam,
Saima Bibi,
Anwar Khan
Issue:
Volume 10, Issue 2, December 2025
Pages:
91-109
Received:
27 January 2025
Accepted:
19 May 2025
Published:
4 August 2025
Abstract: Covid-19 is new virus that is spreading rapidly in all over the world. It is a communicable disease. World Health Organization announced social distancing to control the spread of that virus. All institutions are closed in Pakistan. Education was also effecting with this shutdown. In the age of computing, social computing has emerged as a means of sharing knowledge, conveying ideas, and forming academic discussion groups, to name a few. Social websites or apps are also used for online study due to some critical situation as if nowadays we are facing many problems due to COVID-19. Due to the COVID-19 educational system is disturbed for that purpose we are introducing a different online platform for delivering knowledge and continue the educational system many data mining techniques are applied to social network data for online analysis due to a large number of users and widespread use. This paper describes a method for extracting and analyzing master’s student comments from the online survey that which platform is better for online study and also giving the opinion about most used platform. The proposed technique is implemented using different models or algorithms. By providing various proformas and analyzing vary- iOS student opinions, the said system may assist the administration in improving the learning environment.
Abstract: Covid-19 is new virus that is spreading rapidly in all over the world. It is a communicable disease. World Health Organization announced social distancing to control the spread of that virus. All institutions are closed in Pakistan. Education was also effecting with this shutdown. In the age of computing, social computing has emerged as a means of ...
Show More
Research Article
Comparative Analysis of Machine Learning Algorithms for Predicting Under-Five Mortality: Evidence from Tanzania Demographic and Health Survey
Issue:
Volume 10, Issue 2, December 2025
Pages:
110-123
Received:
9 July 2025
Accepted:
24 July 2025
Published:
20 August 2025
Abstract: Under-five mortality remains a global health challenge with the rates of 43 deaths per every 1000 live births in Tanzania and 37 deaths per every 1000 live births globally. Although child mortality has significantly declined in the last twenty years, the current rates are far from reaching the anticipated Sustainable Development Goal of atmost 25 deaths per 1000 live births in 2030. This study intended to find the best performing classifier of under-five mortality status by comparing ten supervised machine learning algorithms. These machine learning algorithms are Decision Trees, Random Forest, Support Vector Machines, SMOTE-Based Boosted Random Forest, XGBoost, LightGBM, CatBoost, Logistic Regression, K-Nearest Neighbors and Stacked Ensemble Methods. The class imbalance of the dataset detected in the pre-processing stage was addressed using weighted categorical cross-entropy and SMOTE with a 5-folds cross validation and data splitting ratio of 80% for training set and 20% for testing set. With 20 experiments for each of the nine algorithms, the average results were reported to ensure that the findings were not by chance. Further, the stacking ensemble model was developed integrating six of the best performing algorithms using an inclusion criterion of AUC > 0.97. The findings revealed that ensemble algorithm consistently outperformed the other nine algorithms by achieving 100%, 100%, 99.97% and 99.24% for AUC, Accuracy, F1-Score and MCC respectively. This implies that stacking ensemble can uncover more insights than the individual algorithms in predicting under-five mortality status. This study recommends designing policies on under-five mortality that integrate insights from the stacking ensemble algorithm which shows the highest predictive performance.
Abstract: Under-five mortality remains a global health challenge with the rates of 43 deaths per every 1000 live births in Tanzania and 37 deaths per every 1000 live births globally. Although child mortality has significantly declined in the last twenty years, the current rates are far from reaching the anticipated Sustainable Development Goal of atmost 25 d...
Show More
Research Article
Statistical Test-Based Feature Selection and Classification Techniques for Breast Cancer Data
Murfia Rahman Muna,
Md. Alamgir Sarder*
Issue:
Volume 10, Issue 2, December 2025
Pages:
124-130
Received:
19 June 2025
Accepted:
7 July 2025
Published:
28 August 2025
DOI:
10.11648/j.mlr.20251002.13
Downloads:
Views:
Abstract: Breast cancer is a disease that affects the majority of women and it is the second most common cause of death among women globally. Medical scientists have proven that there are a vast number of genes that are responsible for breast cancer. Among them, all genes are not equally responsible. Therefore, the most relevant and informative genes are needed to find out to control the disease. The objectives of our study are: (i) To find the most informative and significant genes using different statistical test-based feature selection techniques (FST) as well as find the best classifier and (ii) To validate our experimental results using a simulated dataset. The breast cancer dataset is a benchmark dataset provided by Kent Ridge Biomedical Data Repository, USA. In our study, we have used different statistical test-based feature selection techniques such as the t-test and Wilcoxon signed rank sum (WCSRS) test. Naïve Bayes (NB), Adaboost (AB), linear discriminant analysis (LDA), artificial neural network (ANN), k-nearest neighbor (KNN), and random forest (RF) are treated as classification techniques. Our analysis included 24,188 genes and 97 patients. Among them, 46 patients were with cancer and 51 were in control. We considered 70% of the dataset as a training set and the rest is a test set and repeated this procedure about 1000 times. Among all the combinations of FST and classification techniques t-test-based Naive Bayes classifier gives us the highest classification accuracy. The analysis of our study indicates that the integration of t-test-based FST and Naïve Bayes classifier produces the maximum classification accuracy.
Abstract: Breast cancer is a disease that affects the majority of women and it is the second most common cause of death among women globally. Medical scientists have proven that there are a vast number of genes that are responsible for breast cancer. Among them, all genes are not equally responsible. Therefore, the most relevant and informative genes are nee...
Show More