- Time Machine is the backup mechanism of macOS, the desktop operating system developed by Apple.The software is designed to work with both local storage devices and network-attached disks, and is most commonly used with external disk drives connected using either USB or Thunderbolt.
- MacOS Big Sur (version 11) is the 17th and current major release of macOS, Apple Inc.' S operating system for Macintosh computers, and is the successor to macOS Catalina (version 10.15). It was announced at Apple's Worldwide Developers Conference (WWDC) on June 22, 2020, 7 and was released to the public on November 12, 2020.
- XQuartz 2.8.0 - 2021-03-21 - First release with native Apple Silicon support XQuartz 2.7.11 - 2016-10-29 - Last release to support Snow Leopard, Lion, and Mountain Lion XQuartz 2.7.10 - 2016-10-22.
The XQuartz project is an open-source effort to develop a version of the X.Org X Window System that runs on OS X. Together with supporting libraries and applications, it forms the X11.app that Apple shipped with OS X versions 10.5 through 10.7.
Quick Download
MacOS If you already have R and RStudio installed. Open RStudio, and click on “Help” “Check for updates”. If a new version is available, quit RStudio, and download the latest version for RStudio. To check the version of R you are using, start RStudio and the first thing that appears on the terminal indicates the version of R you are.

Download | Version | Released | Info |
---|---|---|---|
XQuartz-2.8.0.dmg | 2.8.0 | 2021-03-21 | For macOS 10.9 or later |
XQuartz-2.7.11.dmg | 2.7.11 | 2016-10-29 | For OS X 10.6.3 or later |
X11-2.6.3.dmg | 2.6.3 | 2011-07-20 | For OS X 10.5.x (final Leopard release) |
Important Info for 10.5 Users
OS X Software Updates have included some of the work done by the XQuartz project, but for various reasons, Apple cannot ship the latest and greatest version offered by the XQuartz site. Since the XQuartz X11 package clobbers Apple's X11.app, their software update will clobber the XQuartz X11 package. Because of this, you may experience conflicts after doing a Software Update from Apple. Please re-install the latest XQuartz X11 release for Leopard after installing a system software update to OS X 10.5.x Leopard.
License Info
An XQuartz installation consists of many individual pieces of software which have various licenses. The X.Org software components’ licenses are discussed on theX.Org Foundation Licenses page.The quartz-wm window manager included with the XQuartz distribution uses the Apple Public Source License Version 2.
One of the main goals of textract is to make it as easy as possible tostart using textract (meaning that installation should be as quick andpainless as possible). This package is built on top of several pythonpackages and other source libraries. Assuming you are using pip
oreasy_install
to install textract, the python packagesare all installed by default with textract. The source libraries are aseparate matter though and largely depend on your operating system.
Ubuntu / Debian¶
There are two steps required to run this package onUbuntu/Debian. First you must install some system packages using theapt-getpackage manager before installing textract from pypi.
Note
It may also be necessary to install zlib1g-dev
on Dockerinstances of Ubuntu. See issue #19 for details
OSX¶


These steps rely on you having homebrew installedas well as the cask plugin (brewinstallcaskroom/cask/brew-cask
). The basic idea is to first installXQuartz beforeinstalling a bunch of system packages before installing textract frompypi.
Note
pstotext isnot currently a part of homebrew so .ps
extraction must beenabled by manually installing from source.
Note
Depending on how you have python configured on your system withhomebrew, you may also need to install the pythondevelopment header files for textract to properly install.
Don’t see your operating system installation instructions here?¶
Xquartz Mac Os
My apologies! Installing system packages is a bit of a drag and itshard to anticipate all of the different environments that need to beaccomodated (wouldn’t it be awesome if there were a system-agnosticpackage manager or, better yet, if python could install these systemdependencies for you?!?!). If you’re operating system doesn’t havedocumenation about how to install the textract dependencies, pleasecontribute a pull request with:
Macos Boot Xquartz
A new section in here with the appropriate details about how toinstall things. In particular, please give instructions for how toinstall the following libraries before running
pipinstalltextract
:- libxml2 2.6.21 or lateris required by the
.docx
parser which uses lxml viapython-docx. - libxslt 1.1.15 or later is required by the
.docx
parser which users lxml viapython-docx. - python header files are required for building lxml.
- antiword is required by the
.doc
parser. - pdftotext is optionallyrequired by the
.pdf
parser (there is a pure python fallbackthat works if pdftotext isn’t installed). - pstotextis required by the
.ps
parser. - tesseract-ocris required by the
.jpg
,.png
and.gif
parser. - soxis required by the
.mp3
and.ogg
parser.You need to install ffmpeg, lame, libmad0 and libsox-fmt-mp3,before building sox, for these filetypes to work.
- libxml2 2.6.21 or lateris required by the
Add a requirements file to the requirements directoryof the project with the lower-cased name of your operating system(e.g.
requirements/windows
) so we can try to keep these thingsup to date in the future.
