I wrote this package because I wanted a simple way to create tables in my LaTeX documents. At the same time, I wanted to be able to output HTML or Markdown for use in RStudio. And, I wanted to be able to edit tables intuitively using standard R features. My typical use case is creating tables of regression outputs, but I also wanted to be able to represent arbitrary data, like a table of descriptive statistics or of plain text.
The idea behind huxtable is to store data in a normal data frame, along with properties that describe how to display the data, at cell, column, row or table level. Operations on the data frame work as normal, and they also affect the display properties. Then, the data can be output in an appropriate format. At the moment, those formats are LaTeX, HTML, Markdown, Word/Excel/Powerpoint, RTF and on-screen pretty printing. More could be added.
Another design choice was to have separate functions per feature. Many existing packages use a single function with a large number of options. For instance,
print.xtable in the
xtable package has 34 options, and
texreg in the
texreg package has 41. Having one function per feature should make life easier for the end user. It should also lead to clearer code: each function starts with a valid huxtable, changes one thing, and returns a valid huxtable.
The output formats are very different, and decisions have to be made as to what any package will support. My background is more in HTML. This is reflected in some of the huxtable properties, like per-cell borders and padding. The package tries to keep output reasonably similar between LaTeX and HTML, but there are inevitably some differences and limitations (see below). For Markdown and on-screen output, obviously, only a few basic properties are supported.
The package makes no attempt to output beautiful HTML or LaTeX source code. In fact, in the case of LaTeX, it’s pretty ugly. The approach is “do what it takes to get the job done”.
R has many different packages to create LaTeX and HTML tables. The table(s) below list those I know and the features they have. The table is produced with huxtable, of course ;-)
|Width and height||Y||Y||Y||(Y)||Y||Y|
|Width and height||Y||Y||Y||(Y)||Y|
|Autocreate summary statistics||Y|
|Dynamic HTML widgets||Y||Y|
|Create tables from model objects||Y||Y||Y||Y||Y||Y||Y|
|A (Y) means that there is limited support for the feature. For example, multirow cells may only be supported in headers, or only horizontal border lines may work.|
This comparison doesn’t necessarily tell you the important stuff: how easy is the interface? Is the code currently maintained? I have not used all these packages, but my personal (and subjective) recommendations are:
texregis very good for producing regression tables. It can cope with a huge variety of inputs. Hopefully, with the advent of broom, it will get easier for many packages to do this.
ztableseems to support a lot of functionality, though I haven’t used it.
xtableis old, but reliable and widely available.
tableshas an interesting interface for producing summary statistics. It looks complex but powerful.
pixiedustis quite close to
huxtable. It has many features and is well-written. The interface is slightly different: you use
sprinkle()to add features to a tidy data frame.
formattableis a new kid on the block. It has well-written code and some interesting ideas. It’s HTML-only at present.
Some people love LaTeX. Other people think they have to use it to be “scientific”. (Sadly, this is all too common in my field.) Personally, I can tolerate it at a distance. It’s certainly not easy to produce LaTeX code combining a wide variety of table features. Current limitations of huxtable include:
Some of these may be fixed… at some point!
There are also limitations in HTML:
Lastly, as mentioned above, HTML and LaTeX output is likely to differ. For example, in LaTeX, height is set by putting the table inside a \resizebox object. That can obviously mess up other sizes like table width. Your Mileage May Vary.