Help for package MEsreg

Type:

Package

Title:

Generalized Maximum Entropy Estimation for Smooth Transition and Kink Regression Models

Version:

0.1.0

Maintainer:

Woraphon Yamaka <woraphon.econ@gmail.com>

Description:

Implements generalized maximum entropy estimation for linear regression, kink regression, and smooth transition kink regression models. The approach represents unknown parameters and disturbances as probability distributions over discrete support spaces and estimates them by maximizing entropy subject to model constraints. It is particularly suited to ill-posed problems and does not require distributional assumptions on the error term. The methods have been applied in empirical studies such as Tarkhamtham and Yamaka (2019) https://thaijmath.com/index.php/thaijmath/article/view/867/870 and Maneejuk, Yamaka, and Sriboonchitta (2022) <doi:10.1080/03610918.2020.1836214>.

License:

GPL-3

Encoding:

UTF-8

RoxygenNote:

7.3.3

Imports:

Rsolnp, stats

Language:

en-US

NeedsCompilation:

Packaged:

2026-04-02 06:50:39 UTC; Acer

Author:

Woraphon Yamaka [aut, cre], Paravee Maneejuk [aut], Nuttaphong Kaewtathip [aut]

Repository:

CRAN

Date/Publication:

2026-04-07 07:50:07 UTC

Generalized Maximum Entropy Estimation for Kink Regression with Multiple Covariates

Description

Estimates a kink (threshold) regression model using the Generalized Maximum Entropy (GME) framework. The model allows for a nonlinear relationship in a primary regressor and linear effects from additional covariates.

The kink regression model is specified as:

y_i = \beta_0 + \beta_1 (x_i - \tau)^{-} + \beta_2 (x_i - \tau)^{+} + \mathbf{Z}_i' \gamma + \varepsilon_i,

where (x_i - \tau)^{-} = (x_i - \tau)\mathbf{1}(x_i < \tau) and (x_i - \tau)^{+} = (x_i - \tau)\mathbf{1}(x_i \geq \tau) represent the negative and positive parts of the kink function, respectively. The parameter \tau denotes the threshold (kink point).

Usage

MEkink(y, x, Z = NULL,
       number = "3",
       entropy = "shannon",
       s1 = 10,
       s2 = 3,
       boot = FALSE, B = 100)

Arguments

y

numeric vector; the dependent variable.

x

numeric vector; the primary explanatory variable exhibiting a kink (threshold) effect.

Z

optional numeric vector or matrix; additional explanatory variables entering the model linearly. If NULL, no additional covariates are included.

number

character; the number of support points used in the entropy estimation. Valid options are "3", "5", and "7".

entropy

character; the type of entropy measure used in estimation. Options include "shannon", "tsallis", and "renyi".

s1

numeric; the bound for the coefficient support space.

s2

numeric; the bound for the error support space.

boot

logical; if TRUE, bootstrap resampling is used to compute standard errors of the estimated coefficients.

B

integer; the number of bootstrap replications used when boot = TRUE.

Details

The Generalized Maximum Entropy (GME) approach estimates the unknown parameters by representing them as probability distributions over discrete support spaces. These probabilities are obtained by maximizing an entropy measure (e.g., Shannon entropy) subject to the consistency constraints imposed by the regression model.

The argument number controls the number of support points used for discretization, while s1 and s2 define the bounds for the coefficient and error supports, respectively.

When boot = TRUE, standard errors are computed using bootstrap resampling, which is particularly useful in nonlinear and nonstandard estimation settings such as kink regression.

Value

A list with the following components:

beta

numeric vector; estimated coefficients, including the intercept, kink slopes, and coefficients for additional covariates.

threshold

numeric; the estimated kink (threshold) point.

se

numeric vector; standard errors of the estimated coefficients (bootstrap-based if boot = TRUE).

pvalue

numeric vector; p-values associated with the estimated coefficients.

RMSE

numeric; root mean squared error of the fitted model.

RSS

numeric; root of the sum of squared residuals.

R2

numeric; coefficient of determination.

Author(s)

Woraphon Yamaka, Paravee Maneejuk, and Nuttaphong Kaewtathip

References

Golan, A., Judge, G. G., & Miller, D. (1996). Maximum entropy econometrics. Iowa State University Press.

Jaynes, E. T. (1957). Information theory and statistical mechanics. Physical Review, 106(4), 620–630.

Shannon, C. E. (1948). A mathematical theory of communication. Bell System Technical Journal, 27, 379–423.

Examples

## Simulated data
set.seed(1)
n <- 15
thres <- 3
x <-rnorm(n, thres, 5)
z1 <- rnorm(n)
z2 <- rnorm(n)

Z <- cbind(z1, z2)

y <- 0.5 +
     1*(x*(x < 3)) +
    -1*(x*(x >= 3)) +
     rnorm(n, sd=0.2)

## Estimate model
fit <- MEkink(y, x, Z,number="5", entropy="shannon",s1=10,s2=3,boot=FALSE)
fit

Generalized Maximum Entropy for estimating linear regression

Description

Estimates a linear regression model using the Generalized Maximum Entropy (GME) framework.

The model is given by:

y_i = \beta_0 + \sum_{j=1}^{k} \beta_j x_{ij} + \varepsilon_i

The parameters are estimated by representing unknown coefficients and disturbances as probability distributions over discrete support spaces.

Usage

MEregress(y,x,number,entropy,s1,s2,boot,B)

Arguments

y

numeric vector; the dependent variable.

x

numeric vector or matrix; the explanatory variable(s).

number

character; the number of support points used in the estimation. Valid options are "3", "5", and "7".

entropy

character; the type of entropy measure used in estimation. Options include "shannon", "tsallis", and "renyi".

s1

numeric; the upper bound for the coefficient support space.

s2

numeric; the upper bound for the error support space.

boot

logical; if TRUE, bootstrap resampling is performed to compute standard errors of the estimated coefficients.

B

integer; the number of bootstrap replications used when boot = TRUE.

Details

The GME approach represents unknown parameters and disturbances as probability distributions over discrete support spaces, and estimates them by maximizing Shannon entropy subject to the model constraints.

The number of support points is controlled by the number argument, which determines the discretization of the support space (e.g., "3", "5", or "7" points). A larger number of support points provides greater flexibility in approximating the underlying distributions, but increases computational complexity.

The bounds of the support spaces are specified by s1 for the coefficients and s2 for the error term. These bounds incorporate prior information and play an important role in shaping the estimated distributions.

Compared to traditional methods such as ordinary least squares, the GME estimator is particularly useful in ill-posed problems and settings with limited data, where distributional assumptions may be unreliable.

Value

beta

numeric vector; estimated coefficients including the intercept.

Maxent

numeric; the value of the maximized entropy.

se

numeric vector; standard errors of the estimated coefficients (bootstrap-based if boot = TRUE).

Author(s)

Woraphon Yamaka, Paravee Maneejuk, and Nuttaphong Kaewtathip

References

Golan, A., Judge, G. G., and Miller, D. (1996). Maximum Entropy Econometrics. Wiley.

Jaynes, E. T. (1957). Information theory and statistical mechanics. Physical Review, 106(4), 620–630.

Maneejuk, P., Yamaka, W., and Sriboonchitta, S. (2019). Does the Kuznets curve exist in Thailand? Annals of Operations Research.

Maneejuk, P. and Yamaka, W. (2020). Entropy inference in smooth transition kink regression.

Examples

set.seed(1)
n=20
e=rnorm(n)
x0=rnorm(n)
x1=rnorm(n)
y=1+2*x0+3*x1+e
x=cbind(x0,x1)
MEregress(y,x,number="3",entropy = "shannon",s1=10,s2=5,boot=FALSE)

Generalized Maximum Entropy Estimation for Smooth Transition Kink Regression

Description

Estimates a smooth transition kink regression model using the Generalized Maximum Entropy (GME) framework.

The model is defined as:

y_i = \beta_0 + \beta_1 x_i (1 - G(x_i; \gamma, \tau)) + \beta_2 x_i G(x_i; \gamma, \tau) + \varepsilon_i,

where G(x_i; \gamma, \tau) = \frac{1}{1 + \exp(-\gamma (x_i - \tau))} is a logistic transition function.

Unknown parameters and disturbances are represented as probability distributions over discrete support spaces, providing robust estimation under ill-posed or ill-conditioned settings.

Usage

MEskink(y,x,Z,number,entropy,s1,s2,boot,B)

Arguments

y

numeric vector; the dependent variable.

x

numeric vector or matrix; the explanatory variable(s).

Z

optional numeric vector or matrix; additional explanatory variables entering the model linearly. If NULL, no additional covariates are included.

number

character; the number of support points used in the estimation. Valid options are "3", "5", and "7".

entropy

character; the type of entropy measure used in estimation. Options include "shannon", "tsallis", and "renyi".

s1

numeric; the upper bound for the coefficient support space.

s2

numeric; the upper bound for the error support space.

boot

logical; if TRUE, bootstrap resampling is performed to compute standard errors of the estimated coefficients.

B

integer; the number of bootstrap replications used when boot = TRUE.

Details

The Generalized Maximum Entropy (GME) approach provides a flexible framework for estimating models in ill-posed or underdetermined settings. The method is based on the principle of maximum entropy introduced by Jaynes (1957), which selects probability distributions that are maximally non-committal with respect to missing information.

In this framework, unknown parameters and disturbances are represented as discrete probability distributions over predefined support spaces. These probabilities are estimated by maximizing Shannon entropy (Shannon, 1948) subject to the consistency constraints implied by the regression model.

The approach was further developed by Golan, Judge, and Miller (1996) for econometric applications, allowing for estimation without imposing strict distributional assumptions on the error term. As a result, the GME estimator is robust to multicollinearity, small sample sizes, and ill-conditioned problems.

This implementation focuses on a smooth transition kink regression model with a single explanatory variable, where nonlinear behavior is captured through a logistic transition function.

Value

beta

numeric vector; estimated coefficients including the intercept.

Maxent

numeric; the value of the maximized entropy.

threshold

numeric; the estimated kink (threshold) point that determines where the slope of the relationship changes.

smooth

numeric; the smoothness parameter controlling the speed of transition between regimes. Larger values imply a sharper transition, while smaller values indicate a smoother change.

Author(s)

Woraphon Yamaka, Paravee Maneejuk, and Nuttaphong Kaewtathip

References

Golan, A., Judge, G. G., and Miller, D. (1996). Maximum Entropy Econometrics. Wiley.

Jaynes, E. T. (1957). Information theory and statistical mechanics. Physical Review, 106(4), 620–630.

Maneejuk, P., Yamaka, W., and Sriboonchitta, S. (2019). Does the Kuznets curve exist in Thailand? Annals of Operations Research.

Maneejuk, P. and Yamaka, W. (2020). Entropy inference in smooth transition kink regression.

Examples

set.seed(1)
n <- 20
thres <- 3
gam <- 1.2
e <- rnorm(n)
x <-rnorm(n, thres, 5)
z1 <- rnorm(n)
z2 <- rnorm(n)

Z <- cbind(z1, z2)
alpha <- c(0.5, 1, -1)

g <- 1/(1+exp(-gam*(x-thres)))

y <- alpha[1] +
     alpha[2]*(x*(1-g)) +
     alpha[3]*(x*g) + e

fit2=MEskink(y, x, Z,number = "7",entropy="tsallis", s1 = 10, s2 = 5,boot=FALSE)
fit2

Plot Method for Smooth Transition and Kink Regression Models

Description

Plots the observed data together with the fitted regression curve from a Generalized Maximum Entropy kink or smooth transition kink regression model.

Usage

## S3 method for class 'MEskink'
plot(x, type = c("smooth", "kink"), main = NULL, ...)

Arguments

x

An object of class "MEskink" returned by MEskink.

type

Character string indicating the type of fitted curve to plot. Use "smooth" for smooth transition kink regression or "kink" for standard kink regression.

main

Character string specifying the title of the plot. If NULL, a default title is automatically selected based on type.

...

Additional graphical arguments passed to plot.

Details

This function provides a graphical representation of the relationship between the dependent variable and the explanatory variable.

For type = "smooth", the fitted curve is based on a smooth transition kink regression model using a logistic transition function, allowing the slope to change gradually around the estimated threshold.

For type = "kink", the fitted curve corresponds to a standard kink (piecewise linear) regression, where the slope changes abruptly at the estimated threshold.

The function constructs a grid over the range of the explanatory variable and evaluates the fitted model to generate the regression curve.

For smooth models, the input object must contain a smooth parameter.

Value

This function returns NULL. It is used for its side effect of producing a plot.

Author(s)

Woraphon Yamaka, Paravee Maneejuk, and Nuttaphong Kaewtathip

Examples

set.seed(1)
n <- 20
thres <- 3
gam <- 1.2
e <- rnorm(n)
x <- rnorm(n, thres, 5)
z1 <- rnorm(n)
z2 <- rnorm(n)

Z <- cbind(z1, z2)
alpha <- c(0.5, 1, -1)

g <- 1 / (1 + exp(-gam * (x - thres)))

y <- alpha[1] +
     alpha[2] * (x * (1 - g)) +
     alpha[3] * (x * g) + e

fit2 <- MEskink(y, x, Z,
                number = "7",
                entropy = "tsallis",
                s1 = 10, s2 = 5,
                boot = FALSE)

plot(fit2, type = "smooth")

Package {MEsreg}

Generalized Maximum Entropy Estimation for Kink Regression with Multiple Covariates

Description

Usage

Arguments

Details

Value

Author(s)

References

Examples

Generalized Maximum Entropy for estimating linear regression

Description

Usage

Arguments

Details

Value

Author(s)

References

Examples

Generalized Maximum Entropy Estimation for Smooth Transition Kink Regression

Description

Usage

Arguments

Details

Value

Author(s)

References

Examples

Plot Method for Smooth Transition and Kink Regression Models

Description

Usage

Arguments

Details

Value

Author(s)

Examples