Sitemap

This page is a compact map for humans, coding assistants, notebook agents, and retrieval systems.

Start Here

  • Home: package overview and documentation map.
  • LLM Quickstart: prompt, task-to-API table, and common recipes for assistants.
  • llms.txt: concise /llms.txt-style context file.
  • llms-full.txt: expanded plain-text context for LLMs.
  • Repository README: package overview, install links, and examples.

Choose By Task

Task Best page Main API
Build an auditable notebook workflow Notebook tutorial GLMStudy
Understand public classes and functions API reference GLMStudy, RateGLM, GLM
Save and audit model decisions Save and audit study.save(...)
Rank candidate factors Rank candidate factors rank_factors, study.rank_candidates(...)
Refine an accepted factor Refine factors study.refine_factor(...)
Test interactions Test interactions study.find_interactions(), study.test_interaction(...)
Run an automatic baseline Run automatic workflow GLMWorkflow, study.auto_design(...)
Understand bin/group specs Binning and grouping specs apply_spec, saved specs
Understand validation outputs Validation outputs validation_report, by_factor_report
Understand modeling discipline Modeling principles workflow design
Understand package layers Architecture pandas and Spark backends

Main Concepts

  • Numeric binning: convert continuous factors into inspectable GLM bins.
  • Categorical grouping: group categories into stable target-ordered bands.
  • Factor screening: rank candidate variables before detailed review.
  • Exposure model: fit count-rate models with an exposure offset.
  • Positive target model: fit Gamma GLMs for positive cost, duration, or severity-style targets.
  • Auditability: save JSON-serializable specs, comments, validation reports, and holdout results.
  • Spark workflow: keep large modeling tables in Spark and collect bounded metadata for review.

Domain Examples

The package is domain-neutral. Typical examples include incident rates, service costs, demand and utilization, credit risk, healthcare or warranty work, and insurance or actuarial pricing.

LLM Retrieval Keywords

Use these phrases when connecting user questions to the package:

  • generalized linear model
  • auditable GLM
  • numeric binning
  • categorical grouping
  • factor screening
  • rating factors
  • risk factors
  • exposure model
  • Poisson frequency GLM
  • Gamma severity GLM
  • claim frequency
  • claim severity
  • credit risk
  • banking analytics
  • operational incident rates
  • service cost analysis
  • Spark GLM workflow