Monday, December 23, 2013

Holiday Haze

Your dedicated blogger is about to vanish in the holiday haze, presumably stumbling back sometime early in the new year.

Random thought: Obviously I guessed that I'd enjoy writing this blog, or I wouldn't have started, but I had no idea how truly satisfying it would be, or, for that matter, that anyone would actually read it! Many thanks my friends. I look forward to returning soon. Meanwhile, all best wishes for the holidays.

Monday, December 16, 2013

FRB St. Louis is Far Ahead of the Data Pack

The email below arrived recently from the Federal Reserve Bank of St. Louis. It reminds me of something that's hardly a secret, but that nevertheless merits applause, namely that FRBSL's Research Department is a wonderful source of economic and financial data provision (FRED and much more...), and related information provision broadly defined (RePEc and much more...).

FRED, ALFRED, GeoFRED, RePEc, FRASER, etc. -- wow!  FRBSL supplies not only the data, but also intuitive and seamless delivery interfaces. They're very much on the cutting edge, constantly innovating and leading.

Other Feds of course supply some great data as well. To take just one example close to home, the Real-Time Data Research Center within FRB Philadelphia's Research Department maintains a widely-respected Real-Time Dataset and Survey of Professional Forecasters (and of course my favorites, the ADS Index and GDPplus).

But FRBSL is in a league of its own. Maybe there's been an implicit decision within the System that FRBSL will be the de facto data guru? Or maybe it's just me, not looking around thoroughly enough? I suspect it's a bit of both.

In any event I applaud FRBSL for a job marvelously well done.

Monday, December 9, 2013

Comparing Predictive Accuracy, Twenty Years Later

I have now posted the final pre-meeting draft of the "Use and Abuse" paper (well, more-or-less "final").

I'll present it as the JBES Lecture, January 2014 ASSA meetings, Philadelphia. Please join if you're around. It's Friday January 3, 2:30, Pennsylvania Convention Center Room 2004-C (I think).

By the way, the 2010 Peter Hansen paper that I now cite in my final paragraph, "A Winners Curse for Econometric Models: On the Joint Distribution of In-Sample Fit and Out-of-Sample Fit and its Implications for Model Selection," is tremendously insightful. I saw Peter present it a few years ago at a Stanford summer workshop, but I didn't fully appreciate it and had forgotten about it until he reminded me when he visited Penn last week. He's withheld the 2010 and later revisions from general circulation evidently because one section still needs work. Let's hope that he gets it revised and posted soon! (A more preliminary 2009 version remains online from a University of Chicago seminar.) One of Peter's key points is that although split-sample model comparisons can be "tricked" by data mining in finite samples, just as can all model comparison procedures, split-sample comparisons appear to be harder to trick, in a sense that he makes precise. That's potentially a very big deal.

Comparing Predictive Accuracy, Twenty Years Later: A Personal Perspective on the Use and Abuse of Diebold-Mariano Tests

Abstract: The Diebold-Mariano (DM) test was intended for comparing forecasts; it has been, and remains, useful in that regard. The DM test was not intended for comparing models. Much of the large ensuing literature, however, uses DM-type tests for comparing models, in (pseudo-) out-of-sample environments. In that case, simpler yet more compelling full-sample model comparison procedures exist; they have been, and should continue to be, widely used. The hunch that (pseudo-) out-of-sample analysis is somehow the only," or best," or even necessarily a good" way to provide insurance against in-sample over-fitting in model comparisons proves largely false. On the other hand, (pseudo-) out-of-sample analysis remains useful for certain tasks, most notably for providing information about comparative predictive performance during particular historical episodes.

Monday, December 2, 2013

The e-Writing Jungle Part 3: Web-Based e-books Using Python / Sphinx

In the previous Parts 1 and 2, I essentially dealt with two extremes: (1) LaTeX to pdf to web, and (2) raw HTML (however arrived at) with math rendered by MathJax. Now let's look at something of a middle ground: the Python package, Sphinx, for producing e-books.

Part 3: Python / Sphinx

Parts 1 and 2 of Quantitative Economics, by Stachurski and Sargent, are great routes into Python for economists. There's lots of good comparative discussion of Python vs. Matlab or Julia, the benefits of public-domain, open-source code, etc. And it's always up to the minute, because it's an on-line e-book! Just check it out.

Of course we're interested here in e-books, not Python per se. It turns out, however, that Stachurski and Sargent is also a cutting-edge example of a beautiful e-book. It's effectively written in Python using Sphinx, which is a Python package that started as a vehicle for writing software manuals. But a manual is just a book, and one can fill a book with whatever one wants.

Sphinx is instantly downloadable, beautifully documented (the documentation is written in Sphinx, of course!), open source, and public domain (licensed under BSD). ReStructuredText is the powerful markup language. (You can learn all you need in ten minutes, since math is the only complicated thing, and math stays in LaTeX, rendered either by JavaScript via MathJax or as png images, your choice.) In addition to publishing to HTML, you can publish to LaTeX or pdf.

Want to see how Sphinx performs with math even more dense than Stachurcski and Sargent's? Just check, for example, the Sphinx book Theoretical Physics Reference.  Want to  see how it performs with graphics even more slick than Stachurcski and Sargent's? Just check the Matplotlib Documentation. It's all done in Sphinx.

Sphinx is a totally class act. In my humble opinion, nothing else in its genre comes close.

Monday, November 25, 2013

Collaboration Distance and the Math Genealogy Project

The American Mathematical Society has a fun site on "collaboration distance" between various mathematicians. The idea is simple: If, for example, I wrote with X, and X wrote with Z, then my collaboration distance to Z is two. There's a good description here, and the actual calculator is here.

You can track your collaboration distance not only to Erdos (of course), but also to all-time giants like Gauss or Laplace. The calculator reveals, for example, that my collaboration distance to Gauss is just eight:

I co-authored with Marc Nerlove
Marc Nerlove co-authored with Kenneth J. Arrow
Kenneth J. Arrow co-authored with Theodore E. Harris
Theodore E. Harris co-authored with Richard E. Bellman
Richard E. Bellman co-authored with Ernst G. Straus
Ernst G. Straus co-authored with Albert Einstein
Albert Einstein co-authored with Hermann Minkowski
Hermann Minkowski co-authored with Carl Friedrich Gauss.

Wow -- and some great company along the way, quite apart from the origin at old Carl Friedrich!

Of course I understand the "small-world" network phenomenon, but it's nevertheless hard not to be astounded at first.

So how truly astounding is my eight-step connection to Gauss? Let's do a back-of-the-envelope calculation. For a benchmark Erdos-Renyi network we have:

$$max \approx \frac{\ln N}{\ln \mu},$$
where $$max$$ is the maximum collaboration distance, $$N$$ is the number of authors in the network, and $$\mu$$ is the mean number of co-authors. Suppose there are 1,000,000 authors ($$N=1,000,000$$), each with 5 co-authors (so, trivially, $$\mu=5$$). Then we have $$max \approx 9$$.

Hmmm...I'm no longer feeling so special.

Monday, November 18, 2013

The e-Writing Jungle Part 2: The MathML Impasse and the MathJax Solution

Back to LaTeX and MathJax and MathML and Python and Sphinx and IPython and R and Knitter and Firefox and Chrome and ...

In Part 1, I praised e-books done as LaTeX to pdf to the web, perhaps surprisingly. Now let's go the other way, to an e-book done natively on the web as HTML. Each approach is worth considering, depending on the application, as each has different costs and benefits.

Part 2: The MathML Impasse and the MathJax Solution

All we want is an HTML version with native support and beautiful rendering of mathematics. That's what HTML5 does, except for a small detail: many browsers (IE, Chrome, ...) won't display HTML5. The real problem is MathML, which is embedded in HTML5, and which is the key to math fonts in HTML5 or anywhere else. It's not just a question of browser suppliers finally waking up and flipping on the MathML switch; rather, successful MathML integration turns out to be really hard (seriously, although I don't really know why), and there are also security issues (again seriously, and again I don't really know why). For those reasons, the good folks at Microsoft and Google, for example, have now basically decided that they'll never support MathML. There's a lot of noise about all this swirling around right now -- some of it quite bitter -- but a single recent informative and entertaining piece will catapult you to the cutting edge, "Google Subtracts MathML from Chrome, and Anger Multiplies," by Steven Shankland.

The bottom line: Math has now been officially sentenced to an eternity of second-class web citizenship, in the sense that native and broad math browser support is not going to happen. But that brings us to MathJax, a JavaScript app that works with HTML. You simply type in LaTeX and MathJax finds any math expressions and renders them beautifully. (For an example see my recent post On the Wastefulness of (Pseudo-) Out-of-Sample Predictive Model Comparisons, which was done in LaTeX and rendered using MathJax.) Note well that MathJax is not just pasting graphics images; hence its output scales nicely and works well on mobile devices too. For all you need to know, check out "MathML Forges On," by Peter Krautzberger.

So what's the big problem? Doesn't HTML plus MathJax basically equal HTML5, with the major additional benefit that it actually works? Of course it's somewhat insulting to us math folk, and certainly it's aesthetically unappealing, to have to overlay something on HTML just to get it to display math. (I'm reminded of the old days of PC hardware, with separate "math co-processors.") And there are other issues. For example, MathJax loads from the cloud (unless it's on your machine(s), which requires installations and updates, and which can't be done for mobile devices), and the MathJax math rendering may take a few seconds or more, depending on the speed of your connection and the complexity/length of your math.

But are any of the above "problems" truly serious? I don't think so. On the contrary, MathJax strikes me as a versatile and long-overdue solution for web-based math. And its future looks very bright, with official supporters now ranging from the American Mathematical Society to Springer to Matlab. (Not that I'm a fan of Matlab any longer -- please join the resistance, purge Matlab from your life, and replace it with Python and R -- but that's a topic for another day.)

[Next: Python, Sphinx, ...]

Monday, November 11, 2013

A New Center to Watch for Predictive Macroeconomic and Financial Modeling

Check out USC's fine new Center for Applied Financial Economics, led by the indefatigable Hashem Pesaran. The first event is a fascinating conference, "Recent Developments on Forecasting Techniques for Macro and Finance."  Lots of information here, and program below.

Monday, October 21, 2013

Lawrence R. Klein, 1920-2013

I am sad to report that Lawrence R. Klein has passed away. He was in many respects the father of modern econometrics and empirical macroeconomics; indeed his 1980 Nobel Prize citation was "for the creation of econometric models and their application to the analysis of economic fluctuations and economic policies." He was also a dear friend and mentor to legions of Penn faculty and students, including me. I am grateful to him for many things, including his serving on my Penn Ph.D. dissertation committee nearly thirty years ago.

You can find a beautiful and fascinating autobiographical essay written in 1980, and updated in 2005, here.

Check back during the coming days as I update this post with additional links and materials.

Update 1: KLEIN LAWRENCE, October 20, 2013, of Gladwyne, Pa. Husband of Sonia (nee Adelson). Father of Hannah Klein, Rebecca (James) Kennedy, Rachel (Lyle) Klein and Jonathan (Blandina) Klein. Also survived by 7 grandchildren and 4 great-grandchildren. Services and Interment are private. Relatives and friends are invited to the residence of Mrs. Sonia Klein Wednesday, October 23, 2-4 P.M. AND Saturday, October 26, 2-4 P.M. (only). Contributions in his memory may be made to the University of Pennsylvania Department of Economics.

Update 2: Extensive New York Times obituary here.

Update 3: Penn Economics memorial statement here.

Update 4: Saturday 26 October Financial Times Weekend will contain an extensive obituary.

Wednesday, October 16, 2013

Network Estimation for Time Series

Matteo Barigozzi and Christian Brownlees have a fascinating new paper, "Network Estimation for Time Series" that connects the econometric time series literature and the statistical graphical modeling (network) literature. It's not only useful, but also elegant: they get a beautiful decomposition into contemporaneous and dynamic aspects of network connectedness. Granger causality and "long-run covariance matrices" (spectra at frequency zero), centerpieces of modern time-series econometrics, feature prominently. It also incorporates sparsity, allowing analysis of very high-dimensional networks.

If I could figure out how get LaTeX/Mathjax running inside Blogger, I could show you some details, but no luck after five minutes of fiddling last week, and I haven't yet gotten a chance to return to it. (Anyone know? Maybe Daughter 1 is right and I should switch to WordPress?) For now you'll just have to click on the Barigozzi-Brownlees paper above, and see for yourself.

It's interesting to see that Granger causality is alive and well after all these years, still contributing to new research advances. And Barigozzi-Brownlees is hardly alone in that regard, as the recent biomedical imaging literature illustrates. Some of Vic Solo's recent work is a great example.

Finally, it's also interesting to note that both the Barigozzi-Brownlees and Diebold-Yilmaz approaches to network connectedness work in vector-autoregressive frameworks, yet they proceed in very different, complementary, ways.

Monday, October 14, 2013

A Nobel for Financial Econometrics

First it was Engle and Granger (2003); now it's Fama, Hansen and Shiller.

A central issue in the economics of financial markets is whether and how those markets process information efficiently, to arrive at fair prices. Inextricably linked to that central issue is a central tension: certain lines of argument suggest that financial markets should be highly efficient, yet other lines of argument suggest limits to market efficiency. Gene Fama, Lars Hansen and Bob Shiller have individually and collectively made landmark contributions that now shape both academic and practitioner thinking as regards that tension. In so doing they've built much of the foundations of modern financial economics and financial econometrics. Fama empirically championed the efficient markets hypothesis, which in many respects represents the pinnacle of neoclassical financial economics. Shiller countered with additional empirical evidence that seemingly indicated the failure of market efficiency, setting the stage for several decades of subsequent work. Throughout, Hansen supplied both powerful economic theory that brought asset pricing in closer touch with macroeconomics, and powerful econometric theory (GMM) that proved invaluable for empirical asset pricing, where moment conditions are often available but likelihoods are not.

If today we celebrate, then tomorrow we return to work -- obviously there's more to be done. But for today, a resounding bravo to the three deserving winners!