Software | Code

My team and I develop open-source statistical and machine learning software for large-scale data running on the Apache Spark distributed computing
platform. Detailed software information is available at my code repository and my KLLAB repository

Statistical packages

Package      DescriptionLanguage  Environment  Link   
riseforecastGeneral Recovery Forecasting with RISEPythonAllGitHub
gratisEfficient algorithms for generating time series with diverse and controllable characteristicsR
Python
AllCRAN
GitHub
febamaFeature-based Bayesian Forecasting Model AveragingR
Python
AllGitHub
fideFeature-based Intermittent DEmand forecastingRAllGitHub
fumaForecast uncertainty based on model averagingRAllGitHub
fformppFFORMPP: Feature-based FORecast Model Performance PredictionR
Python
AllGitHub
dngDistribution and Gradients for Skewed DistributionsR
Python
AllCRAN
GitHub
pyhtsA python package for hierarchical forecasting, inspired by the hts package in RPythonAllGitHub
PyPi
dlsaDistributed Least Squares Approximation implemented with Apache SparkPythonSparkGitHub
darimaDistributed ARIMA models implemented with Apache SparkPythonSparkGitHub
dqrDistributed Quantile Regression by Pilot Sampling and One-Step UpdatingPythonSparkGitHub
cdcopulaCovariate-dependent copula modelsRAllGitHub
movingknotsEfficient Bayesian Multivariate Surface RegressionPythonAllGitHub
GSMFlexible Modeling of Conditional Distributions using Smooth MixturesPythonAllGitHub

Miscellaneous 🔓

I have some fun stuffs as well