Sitemap¶

This page is a compact map for humans, coding assistants, notebook agents, and retrieval systems.

Start Here¶

Home: package overview and documentation map.
LLM Quickstart: prompt, task-to-API table, and common recipes for assistants.
llms.txt: concise /llms.txt-style context file.
llms-full.txt: expanded plain-text context for LLMs.
Repository README: package overview, install links, and examples.

Choose By Task¶

Task	Best page	Main API
Build an auditable notebook workflow	Notebook tutorial	`GLMStudy`
Understand public classes and functions	API reference	`GLMStudy`, `RateGLM`, `GLM`
Save and audit model decisions	Save and audit	`study.save(...)`
Rank candidate factors	Rank candidate factors	`rank_factors`, `study.rank_candidates(...)`
Refine an accepted factor	Refine factors	`study.refine_factor(...)`
Test interactions	Test interactions	`study.find_interactions()`, `study.test_interaction(...)`
Run an automatic baseline	Run automatic workflow	`GLMWorkflow`, `study.auto_design(...)`
Understand bin/group specs	Binning and grouping specs	`apply_spec`, saved specs
Understand validation outputs	Validation outputs	`validation_report`, `by_factor_report`
Understand modeling discipline	Modeling principles	workflow design
Understand package layers	Architecture	pandas and Spark backends

Main Concepts¶

Numeric binning: convert continuous factors into inspectable GLM bins.
Categorical grouping: group categories into stable target-ordered bands.
Factor screening: rank candidate variables before detailed review.
Exposure model: fit count-rate models with an exposure offset.
Positive target model: fit Gamma GLMs for positive cost, duration, or severity-style targets.
Auditability: save JSON-serializable specs, comments, validation reports, and holdout results.
Spark workflow: keep large modeling tables in Spark and collect bounded metadata for review.

Domain Examples¶

The package is domain-neutral. Typical examples include incident rates, service costs, demand and utilization, credit risk, healthcare or warranty work, and insurance or actuarial pricing.

LLM Retrieval Keywords¶

Use these phrases when connecting user questions to the package:

generalized linear model
auditable GLM
numeric binning
categorical grouping
factor screening
rating factors
risk factors
exposure model
Poisson frequency GLM
Gamma severity GLM
claim frequency
claim severity
credit risk
banking analytics
operational incident rates
service cost analysis
Spark GLM workflow