About
🔗 GitHub: github.com/gpsyrou
As a Data Scientist, I’m passionate about leveraging…
Activity
2K followers
Experience
Education
-
University of Michigan
-
-
Activities and Societies: Kasper Space Lab
Coursework:
Statistical Computing (Python, C++)
Multivariate Analysis (Machine Learning)
Statistical Learning
Computational Statistics & Algorithms
Statistical Inference
Network Analysis
Applied Bayesian Inference -
-
-
-
-
-
Licenses & Certifications
Volunteer Experience
-
Volunteer
TEDxUniversityofPiraeus
- 2 months
Education
Organizing Committee Member
• Active member of a team of 29 volunteers with a job rotation role
• Immersed in crowd management techniques in order to ensure an effective event
Courses
-
Applied Bayesian Inference
BIOSTAT 682
-
Computational Methods and Tools in Statistics
STATS 506
-
Networks
SI 608
-
Probability and Distribution Theory
STATS 510
-
Real Analysis
ME406
-
Special Topics in Applied Statistics (Data Analysis in Python)
STATS 700
-
Special Topics in Engineering
ENGR 599
-
Statistical Computing
STATS 607
-
Statistical Computing
BIOSTAT 615
-
Statistical Inference
STATS 511
-
Statistical Learning I: Regression
STATS 500
-
Statistical Learning II: Multivariate Analysis
STATS 503
Projects
-
Binary Prediction of Edge Presence Using ERGM's For the International Coffee Trade Network
-
Used Exponential Random Graph Models and Network Analysis techniques in order to analyze the global coffee trade network for three different periods (2005, 2010, 2015).For this purpose we used three types of variables in our analysis: General model variables (name of exporter country, name of importer country etc), Economic/Demographic variables (GDP, population), as well as graph statistics like number of edges, mutual trade relationship between pairs of countries, edge distribution. Our main…
Used Exponential Random Graph Models and Network Analysis techniques in order to analyze the global coffee trade network for three different periods (2005, 2010, 2015).For this purpose we used three types of variables in our analysis: General model variables (name of exporter country, name of importer country etc), Economic/Demographic variables (GDP, population), as well as graph statistics like number of edges, mutual trade relationship between pairs of countries, edge distribution. Our main goal was edge prediction (existence of trade) between countries for different years, along with some Exploratory Data Analysis (EDA).Our final results indicate that wealthy countries have high import-export activity at the coffee trade, while poor countries (which are the lead exporters of coffee) seemed to focus only in exporting (low importing activity).Our final model achieved a prediction accuracy of approximately 86, 5%, which we believe that we can improve by using more complex graph statistics.
For this project we used the Python and R programming languages.Other creators -
Classification of Polish Bankruptcy Data via Support Vector Machines,Random Forests and Neural Networks
-
Project used Principal Component Analysis, Neural Networks, Random Forests, and Support Vector Machines in order to classify a dataset of Polish corporate bankruptcy data, finding significant differences in the classification error rates across methods. We followed the whole data analysis procedure: Collect data, Handle missing cases through imputation (Predictive Mean Matching), Dimension Reduction (PCA) and then compare the 3 different Machine Learning Classification methods. We concluded…
Project used Principal Component Analysis, Neural Networks, Random Forests, and Support Vector Machines in order to classify a dataset of Polish corporate bankruptcy data, finding significant differences in the classification error rates across methods. We followed the whole data analysis procedure: Collect data, Handle missing cases through imputation (Predictive Mean Matching), Dimension Reduction (PCA) and then compare the 3 different Machine Learning Classification methods. We concluded that a business analyst, who is more averse on False Negative errors, should choose the SVM method, while an individual who is more averse on False Positive errors, should follow the Random Forests method.
Finally, the project is completed by using the programming language R .Other creators
Languages
-
Greek
Native or bilingual proficiency
-
English
Native or bilingual proficiency
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More