Fit a multinomial regression model (via
multinom
, Ripley 1996, Venables and Ripley 2002)
to a defined chunk of time (a.k.a. segment)
[chunk$start, chunk$end]
within a time series.
multinom_TS_chunk(data, formula, chunk, timename = "time",
weights = NULL, control = list())
Arguments
data |
Class data.frame object including the predictor and
response variables. |
formula |
Formula as a formula or
character object describing the chunk. |
chunk |
Length-2 vector of times: [1] start , the start time
for the chunk and [2] end , the end time for the chunk. |
timename |
character element indicating the time variable
used in the time series. Defaults to "time" . The variable must be
integer-conformable or a Date . If the variable named
is a Date , the input is converted to an integer, resulting in the
timestep being 1 day, which is often not desired behavior.
|
weights |
Optional class numeric vector of weights for each
document. Defaults to NULL , translating to an equal weight for
each document. When using multinom_TS in a standard LDATS
analysis, it is advisable to weight the documents by their total size,
as the result of LDA is a matrix of
proportions, which does not account for size differences among documents.
For most models, a scaling of the weights (so that the average is 1) is
most appropriate, and this is accomplished using document_weights . |
control |
A list of parameters to control the fitting of the
Time Series model including the parallel tempering Markov Chain
Monte Carlo (ptMCMC) controls. Values not input assume defaults set by
TS_control . |
Value
Fitted model object for the chunk, of classes multinom
and
nnet
.
References
Ripley, B. D. 1996. Pattern Recognition and Neural Networks. Cambridge.
Venables, W. N. and B. D. Ripley. 2002. Modern Applied Statistics with S.
Fourth edition. Springer.
Examples
data(rodents)
dtt <- rodents$document_term_table
lda <- LDA_set(dtt, 2, 1, list(quiet = TRUE))
dct <- rodents$document_covariate_table
dct$gamma <- lda[[1]]@gamma
weights <- document_weights(dtt)
chunk <- c(start = 0, end = 100)
mtsc <- multinom_TS_chunk(dct, formula = gamma ~ 1, chunk = chunk,
timename = "newmoon", weights = weights)