Corpus Intelligence Regression Analysis 2026-04-26 02:13 UTC
Regression Analysis
OLS: Occupancy Rate ~ 21 features | R² = 63.8% | n = 4,858 | 20 significant
🛡️ Public data only — no PHI permitted on this instance.
63.8%
63.6%
Adj R²
4,858
Observations
21
Features
405.9
F-Statistic
0.1454
RMSE (avg error)

R² = fraction of variance explained (1.0 = perfect, 0 = no signal). RMSE = root mean squared error (average prediction miss in target units). F-statistic tests whether the model explains more than random chance.

Intercept Interpretation

Value: 0.5240
SE: 0.0021
Target Mean: 0.5240
Target Range: 0.0066 — 2.3918

When all features are at their mean values, the predicted occupancy rate is 0.52

Coefficients (Standardized)

Target: Occupancy Rate. Standardized coefficients (-1.0 to +1.0): a one-SD increase in the feature produces this fraction of the strongest effect. *** p<0.001, ** p<0.01, * p<0.05.

VariableStrengthtp-valueSig95% CIImpact
Bed Days Available-1.00034.00.0000***[-0.556, -0.496]
Medicare Days+0.71317.60.0000***[0.334, 0.417]
Beds+0.65113.70.0000***[0.294, 0.392]
Medicare Intensity-0.62915.80.0000***[-0.372, -0.290]
Total Patient Days+0.46610.40.0000***[0.199, 0.291]
Size Quartile+0.24535.70.0000***[0.122, 0.136]
Revenue Per Bed+0.1387.60.0000***[0.054, 0.091]
Net Patient Revenue-0.08713.80.0000***[-0.052, -0.039]
Revenue Per Day-0.08116.90.0000***[-0.047, -0.038]
Operating Expenses-0.07413.70.0000***[-0.045, -0.034]
Expense Per Bed-0.0553.30.0011**[-0.046, -0.012]
Operating Margin+0.0489.70.0000***[0.020, 0.030]
Medicaid Day Pct+0.0326.80.0000***[0.012, 0.022]
Net Income-0.0285.50.0000***[-0.020, -0.010]
Payer Diversity-0.0223.90.0001***[-0.017, -0.006]
Medicaid Days-0.0223.30.0009***[-0.018, -0.005]
Gross Patient Revenue-0.0195.00.0000***[-0.014, -0.006]
Net To Gross Ratio+0.0142.60.0092**[0.002, 0.013]
Medicare Day Pct-0.0113.10.0022**[-0.009, -0.002]
Commercial Pct-0.0092.40.0182*[-0.009, -0.001]
Contractual Allowances+0.0020.30.7834ns[-0.005, 0.007]

Univariate Correlations with Target

Pearson r for each feature vs Occupancy Rate. Shows raw linear relationship before controlling for other variables.

FeaturerStrength
Size Quartile0.539Moderate
Total Patient Days0.509Moderate
Medicare Days0.488Moderate
Bed Days Available0.431Moderate
Beds0.427Moderate
Net Patient Revenue0.392Weak
Operating Expenses0.382Weak
Gross Patient Revenue0.380Weak
Medicare Intensity0.375Weak
Contractual Allowances0.363Weak
Medicare Day Pct-0.352Weak
Medicaid Days0.342Weak
Commercial Pct0.296Weak
Operating Margin0.193Weak
Revenue Per Day-0.176Weak
Net To Gross Ratio-0.175Weak
Payer Diversity-0.115Weak
Net Income-0.115Weak
Revenue Per Bed0.099Weak
Medicaid Day Pct0.092Weak
Expense Per Bed0.081Weak

Hospital Outliers (Residual Analysis)

Hospitals with the largest standardized residuals. >2σ = model underpredicts/overpredicts — investigate for deal opportunities or data quality issues.

HospitalStateActualPredictedResidual
IZARD REGIONAL HOSPITALAR2.34580.2404+14.48σ
FIRSTHEALTH MONTGOMERY MEMORIAL CAHNC2.39180.5653+12.56σ
PARKLAND HLTH CTR - BONNE TERREMO0.0493-0.9369+6.78σ
SPECTRUM HEALTH REED CITYMI0.0066-0.9264+6.41σ
MEMORIAL HERMANN MEMORIAL CITY MEDITX2.24451.4439+5.50σ
BAPTIST HEALTH SYSTEMTX0.5249-0.1885+4.90σ
NORTH SHORE UNIVERSITY HOSPITALNY1.00261.7036-4.82σ
BRIGHAM AND WOMENS HOSPITALMA1.01331.6682-4.50σ
CHRISTIANA CARE HEALTH SYSTEMDE0.88161.5128-4.34σ
RIVERSIDE HOSPITAL OF LOUISIANALA1.09980.4720+4.32σ
ATLANTA MEDICAL CENTERGA0.40141.0178-4.24σ
ST. JOSEPHS HOSPITALFL0.5616-0.0443+4.17σ
MASSACHUSETTS GENERAL HOSPITALMA0.92621.4940-3.90σ
CLEVELAND CLINIC HOSPITALOH0.75920.2042+3.82σ
HENRICO DOCTORS HOSPITALVA0.4116-0.1286+3.71σ

Variance Inflation Factors

VIF > 10 = severe multicollinearity (coefficient estimates unreliable). VIF > 5 = moderate. Consider removing high-VIF features.

FeatureVIFStatus
Net Patient Revenue999.0High
Operating Expenses999.0High
Medicare Day Pct999.0High
Medicaid Day Pct999.0High
Net Income999.0High
Gross Patient Revenue999.0High
Contractual Allowances999.0High
Commercial Pct999.0High
Beds142.7High
Total Patient Days127.1High
Medicare Days103.9High
Medicare Intensity100.5High

Model Fit by State

How well the national model predicts within each state. Low R² states have unique market dynamics not captured by national features — consider state-specific models.

StatenMean Residual
CT3175.2%0.0000
IL19172.9%-0.0000
ND4071.4%0.0000
TN7271.4%-0.0000
MN12271.3%-0.0000
CO10370.8%-0.0000
TX32070.7%-0.0000
WA9569.7%0.0000
NH2769.4%0.1000
IA9169.2%-0.1000
NV4967.8%0.0000
GA14965.4%0.0000
PA17465.4%0.0000
MI13764.3%-0.0000
OR5764.2%0.0000

Top Pairwise Correlations

Variable 1Variable 2r
Gross Patient RevenueContractual Allowances0.992
BedsBed Days Available0.984
Net Patient RevenueOperating Expenses0.982
Total Patient DaysBed Days Available0.976
BedsTotal Patient Days0.962
Medicare DaysMedicare Intensity0.955
Revenue Per BedExpense Per Bed0.953
Net Patient RevenueGross Patient Revenue0.912
Total Patient DaysMedicare Days0.901
Operating ExpensesGross Patient Revenue0.891
BedsMedicare Intensity0.881
Net Patient RevenueTotal Patient Days0.879
Medicare DaysBed Days Available0.874
Operating ExpensesTotal Patient Days0.869
Total Patient DaysGross Patient Revenue0.863