Abstract

Abstract A new variant ‘PMF’ of factor analysis is described. It is assumed that X is a matrix of observed data and σ is the known matrix of standard deviations of elements of X . Both X and σ are of dimensions n × m . The method solves the bilinear matrix problem X = GF + E where G is the unknown left hand factor matrix (scores) of dimensions n × p , F is the unknown right hand factor matrix (loadings) of dimensions p × m , and E is the matrix of residuals. The problem is solved in the weighted least squares sense: G and F are determined so that the Frobenius norm of E divided (element‐by‐element) by σ is minimized. Furthermore, the solution is constrained so that all the elements of G and F are required to be non‐negative. It is shown that the solutions by PMF are usually different from any solutions produced by the customary factor analysis (FA, i.e. principal component analysis (PCA) followed by rotations). Usually PMF produces a better fit to the data than FA. Also, the result of PF is guaranteed to be non‐negative, while the result of FA often cannot be rotated so that all negative entries would be eliminated. Different possible application areas of the new method are briefly discussed. In environmental data, the error estimates of data can be widely varying and non‐negativity is often an essential feature of the underlying models. Thus it is concluded that PMF is better suited than FA or PCA in many environmental applications. Examples of successful applications of PMF are shown in companion papers.

Keywords

Principal component analysisMatrix (chemical analysis)MathematicsNorm (philosophy)Matrix decompositionFactorizationFactor analysisStatisticsApplied mathematicsCombinatoricsAlgorithmEigenvalues and eigenvectorsPhysicsChemistry

Affiliated Institutions

Related Publications

Fitting the Factor Analysis Model

When the covariance matrix Σ(p×P) does not satisfy the formal factor analysis model for m factors, there will be no factor matrix Λ(p×m) such that γ=(Σ-ΛΛ′) is diagonal. The fac...

1969 Psychometrika 32 citations

Publication Info

Year
1994
Type
article
Volume
5
Issue
2
Pages
111-126
Citations
6053
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

6053
OpenAlex

Cite This

Pentti Paatero, Unto Tapper (1994). Positive matrix factorization: A non‐negative factor model with optimal utilization of error estimates of data values. Environmetrics , 5 (2) , 111-126. https://doi.org/10.1002/env.3170050203

Identifiers

DOI
10.1002/env.3170050203