estimation.detect_singletons

estimation.detect_singletons(ids)

Detect singleton fixed effects in a dataset.

This function iterates over the columns of a 2D numpy array representing fixed effects to identify singleton fixed effects. An observation is considered a singleton if it is the only one in its group (fixed effect identifier).

Parameters

Name Type Description Default
ids np.ndarray A 2D numpy array representing fixed effects, with a shape of (n_samples, n_features). Elements should be non-negative integers representing fixed effect identifiers. required

Returns

Type Description
numpy.ndarray A boolean array of shape (n_samples,), indicating which observations have a singleton fixed effect.

Notes

The algorithm iterates over columns to identify fixed effects. After each column is processed, it updates the record of non-singleton rows. This approach accounts for the possibility that removing an observation in one column can lead to the emergence of new singletons in subsequent columns.

For performance reasons, the input array should be in column-major order. Operating on a row-major array can lead to significant performance losses.