What if factor rotations hide multiple perfect solutions?

Factor analysis is a curious craft. It pretends data have hidden drivers that, when teased apart, reveal why people behave the way they do, which words they choose, or which products they buy. In practice, researchers estimate a loading matrix that links observed variables to a handful of latent factors. But once you have that matrix, a trick remains: you can spin it around with a rotation and still describe the same data. The goal of rotation is interpretability, a human-friendly form that makes the math look almost like a map of a landscape rather than a tangle of numbers. This is where the new algebraic approach steps in, promising not one best view but all the possible clear-cut views that could exist. The work comes from Kyushu Universitys Faculty of Mathematics, led by Ryoya Fukasaku, with collaborators from Osaka University, Nagasaki University, and other institutions in Japan. Their paper dives into how to find every equality-constrained stationary point of an orthomax rotation, a family that includes the familiar varimax and quartimax criteria, among others, and what that means for how we interpret factor analyses.

Think of factor rotation as rearranging the furniture in a room so that the purpose of each piece becomes obvious. If you choose the right arrangement, you can tell at a glance which items share a common theme. The catch is that there is no single unique rearrangement that is best in every sense. Different rotation criteria prioritize different flavors of simplicity. And when the math is nonlinear, the standard optimization methods can get stuck in local optima, missing the global best view or ignoring other equally valid viewpoints. Fukasaku and coauthors propose an algebraic detour around those pitfalls: instead of chasing a single optimum from an arbitrary starting point, why not compute all stationary points that satisfy the constraints, then pick the one that serves the analyst best? Their approach leans on the idea that many rotation problems can be expressed as algebraic equations, so we can use the tools of computational algebra to solve them exactly. The lead author is from Kyushu University, and the team includes researchers from Osaka University, Nagasaki University, Kyushu Universitys Institute of Mathematics for Industry, and RIKEN Center for Advanced Intelligence.

A puzzle in plain sight

The core trick, in plain terms, is to recast the rotation problem as an equality-constrained optimization. You want to maximize a rotation criterion Q, evaluated on the rotated loading matrix AT, under the rule that T is orthogonal so that the factors stay independent. The mathematics hides in the fact that Q can be written as a polynomial in the entries of T. That is the bridge to algebraic geometry: once you have a polynomial equation with real coefficients, you can study its stationary points not by marching along a hill but by solving a system of polynomial equations. This is where the paper leans on a toolbox of polynomial rings, ideals, and the idea that you can coax all solutions out of the algebra rather than chase a single path with a gradient descent.

What matters here is not a single best rotation but a complete map of all candidates that satisfy the necessary conditions for an extremum. The authors show that when you express orthomax rotation criteria as polynomials, you can apply an algebraic algorithm to extract all algebraic solutions to the stationarity conditions. This yields every possible orthogonal rotation that could maximize the criterion, not just the one your software happens to stumble upon. The practical upshot is a new kind of transparency: you can inspect multiple equally valid loadings and pick the one that makes the data most intelligible for your domain, whether you care about tight clusters of items under a single factor or about spreading loadings more evenly across factors.

Crucially, the method is not a purely theoretical parade. It leverages modern computer algebra systems to enumerate connected components of the solution space and to classify stationary points using second-order conditions. The result is a finite, explicit set of candidate rotations for a given initial solution. If A is your initial loading, the algorithm computes every T that makes AT a stationary point of Q under the orthogonality constraint. The study even provides a concrete framework to distinguish global optima from merely local optima, a subtle but important distinction in real data where interpretability trumps a narrow numeric peak.

Quartimax outshines expectation

One of the paper’s most striking twists is a result that will feel counterintuitive to many practitioners. Varimax has long reigned as the default orthogonal rotation in many software packages. It’s fast, familiar, and historically associated with achieving a simple structure in which items load strongly on one factor and weakly on others. In practice, that appealing crispness has often been interpreted as highly interpretable structure. Yet the authors’ Monte Carlo simulations—driven by the algebraic machinery they developed—reveal a surprising twist: quartimax rotation can produce simpler, more interpretable patterns than varimax in many scenarios.

In the paper’s framework, a loading matrix Lambda has a perfect simple structure if each row contains at most one nonzero element. That is the gold standard of interpretability in some circles, sometimes called unifactoriality. The simulations show quartimax frequently achieves more rows with this property than varimax, challenging a consensus about which rotation is most user-friendly. In other words, the best-looking map of the data might not be the one you’d expect if you only ever run varimax.

The algebraic lens also reveals a nuanced truth about how interpretability should be measured. For a given initial solution, if you consider all stationary points—global optima and local optima alike—you may find another rotation that is, in a precise sense, even simpler in a particular aspect. That means a single numeric optimizer may inadvertently hide a more interpretable loading matrix that sits a short distance away in the space of all possible rotations. The authors’ discovery invites analysts to explore alternatives rather than accept the first one software hands you. It’s a reminder that the math of interpretation often rewards a broader search rather than a single pursuit of the top score.

To ground these ideas, the authors compare their algebraic results with GPArotation, a widely used rotation package implemented in R that follows Jennrichs gradient projection approach. Across several criteria including quartimax, varimax, equamax, and parsimax, GPArotation typically converged to the global optimum in their tests, but not always. The algebraic method confirmed when the glow of a software’s answer matches the global optimum and when it simply lands on a different stationary point with its own interpretive advantages. In short, quartimax emerges not as an anti varimax rebellion but as a viable route to a more human-friendly map of data, sometimes outperforming what analysts have come to expect from varimax.

Beyond the numbers, this result matters culturally for how researchers talk about interpretability. It suggests that the conventional wisdom about a single default rotation might be incomplete. If quartimax can deliver crisper, more interpretable clusters in the loadings, then data analysts may want to keep both tools in their cognitive toolkit and, crucially, have access to all stationary points to make an informed choice. The algebraic approach makes that possible in a principled way rather than relying on trial and error or serendipity in software defaults.

From equations to interpretability

The paper is not just about a clever trick. It also sketches a practical frontier for how we might do data analysis differently. The authors lay out an explicit algorithm, built on the Lagrange multiplier formulation, to locate all stationary points of a rotation criterion under the orthogonality constraint. The algorithm first identifies the algebraic solutions to the stationarity equations, then climbs through each connected component of that solution space to sample representative points. For each candidate rotation, it evaluates the criterion value and classifies the point as a local maximum, a local minimum, or indeterminate using what they call bordered Hessians. In the language of geometry and algebra, they convert a slippery, nonlinear optimization problem into a finite, trackable algebraic package.

One key payoff is independence from starting values. Traditional gradient-based methods, including the popular GPArotation approach, depend on where you begin. With the algebraic method, you get a complete census of solutions, and then you can choose the one that best serves your interpretive aims. This is not about discarding numerical methods; it is about complementing them with a parallel, exact alternative that reveals the landscape’s full structure. The authors argue that this algebraic perspective can guide analysts toward more transparent and adaptable rotation choices, especially in fields where interpretability is essential, from psychology to marketing to materials science.

There is a broader methodological impulse here as well. The paper ties the practice of factor rotation to a broader algebraic apparatus. It shows that many rotation criteria can be treated as algebraic functions and that the constraints live in the world of polynomial equations. That opens the door to transferring ideas from computational algebra, like radical ideals and Hilbert’s Nullstellensatz, into the everyday toolkit of exploratory data analysis. It’s the kind of cross-pollination that researchers in math-heavy disciplines often dream about but rarely realize in routine practice.

Of course, the approach is not without challenges. The algebraic route is computationally intense. The authors report hours of computation for moderate sized problems, and scaling to very large data sets or very high numbers of factors remains an open question. Their demonstrations focus on a three-factor, twenty-something-variable setup, which is substantial but still a fairly small corner of typical real-world data sets. The authors are candid about the cost, and they note that this is a necessary tradeoff for exhaustively cataloging all stationary points. The payoff, they argue, is a principled way to choose the rotation that maximizes interpretability for a given scientific question, not simply a numerical peak on a rugged surface.

Looking ahead, the authors propose extending their algebraic assault to oblique rotations, where factors are allowed to correlate. Oblique rotations are common in practice precisely because real-world constructs often overlap. If their framework can adapt to that more tangled setting, it could broaden the reach of exact, exhaustive rotation analysis beyond the orthogonal case. It could also spark new rotation criteria born from the algebraic vantage point, criteria that explicitly optimize interpretability in ways researchers can inspect and justify.

In the end, this work sits at an intersection of math and mind. It treats a familiar data-analytic maneuver as if it were a hidden combinatorial puzzle, solvable not by wandering through an optimization landscape but by unlocking the algebraic rules that govern it. The authors from Kyushu University and their collaborators show that there is more than one way to tell a story about the same data and that sometimes the most human-friendly story is not the one you would have predicted from standard software defaults. The algebraic lens doesn’t replace old methods; it enriches them, offering a catalog of potential rotations that you can choose from, with the confidence that you have not left any interpretability gem behind in the underbrush.

Originating from the mathematics community in Japan, the study is led by Ryoya Fukasaku of Kyushu University, with coauthors including Michio Yamamoto, Yutaro Kabata, Yasuhiko Ikematsu, and Kei Hirose, among others. The work sits at the crossroads of theory and practice, a reminder that the words we use to describe data—loading, rotation, simple structure—are not just jargon but levers we can pull with greater precision when we couple statistical intuition with algebraic insight.

The upshot is not a single recommended recipe but a new way to think about how we interpret factors. By making it possible to compute all stationary points of orthomax rotations, the authors give analysts a richer menu of options, where interpretability can be chosen with intention rather than luck. If you want to know which rotation makes your data story most compelling to your audience, you now have a principled way to explore not just one candidate but the entire constellation of viable candidates. That is a small revolution in how we handle the honest, human side of data storytelling.