Risk Modeling & Decision Safety
Underwriting and fraud-risk workflows with calibration, threshold policies, abstention, validation, explainability, and review-focused reporting.
Data Scientist · Machine Learning · Decision Intelligence
I’m Amir Honardoust, a Data Scientist focused on explainable machine learning, forecasting, NLP, analytics, and practical AI systems people can understand and use.
What I do
My work connects statistical thinking, machine learning, product sense, and clear communication. I care about models that are evaluated, explainable, and useful beyond a notebook.
Underwriting and fraud-risk workflows with calibration, threshold policies, abstention, validation, explainability, and review-focused reporting.
Retrieval pipelines, knowledge-graph augmentation, text classification, recommender evaluation, and AI systems built for traceability.
Synthetic tabular-data evaluation, business prediction tools, reproducible model workflows, dashboard outputs, and portfolio-grade documentation.
Featured projects
Decision Safety · Risk ML
GitHub ↗A loan-approval safety lab with probability calibration, abstention policies, coverage–quality tradeoffs, a triage UI, and data-quality checks.
Fraud Detection · Risk ML
GitHub ↗A cost-sensitive fraud-detection system with SHAP explanations, threshold optimization, batch scoring, and an interactive review dashboard.
RAG · LLM Systems
GitHub ↗An explainable graph + vector RAG system with FAISS retrieval, knowledge-graph reasoning paths, a FastAPI backend, and a Streamlit UI.
Synthetic Data · Generative ML
GitHub ↗A research-style comparison of Gaussian Copula and VAE methods with distribution checks, correlation analysis, PCA diagnostics, and visual reports.
NLP · Responsible AI
GitHub ↗A TF-IDF + Logistic Regression style-risk detector with a Streamlit app, CLI prediction, uncertainty handling, leakage analysis, tests, and CI.
SQL · Machine Learning
GitHub ↗An end-to-end site-selection workflow with SQL feature engineering, regression modeling, model comparison, candidate ranking, tests, and CI.
Technical notes, project breakdowns, reproducible workflows, and deeper implementation details.
Visit technical labAbout
I focus on practical data science: understanding the problem, shaping the data, building the right model, evaluating it honestly, and communicating the result clearly.
My strongest interests are risk modeling, retrieval-augmented generation, synthetic data evaluation, recommender systems, explainability, and analytics systems that help people make better decisions.
“Good data science is not just a model. It is a reliable path from messy evidence to a decision someone can trust.”
Skills
Python, SQL, Solidity, MQL5.
pandas, NumPy, SQLite, SQLAlchemy.
scikit-learn, XGBoost, LightGBM, joblib.
PyTorch, TensorFlow / Keras, Hugging Face Transformers, BERT.
FAISS, Sentence Transformers, FastAPI, Streamlit.
matplotlib, Plotly, Streamlit dashboards.
SHAP, feature importance, calibration, threshold analysis.
Tests, CI, reproducible outputs, model artifacts, documentation.
Contact
The fastest way to reach me is through LinkedIn or GitHub. For technical details, visit my lab at honardoust.codes.