Hadoop JupyterLab Extension
This extension is composed of a Python package named hdfsbrowser
, which installs the server+nbextension and a NPM package named @swan-cern/hdfsbrowser
for the JupyterLab extension.
- JupyterLab >= 2.1
Note: You will need NodeJS to install the extension.
pip install hdfsbrowser
jupyter nbextension install hdfsbrowser --py
jupyter nbextension enable hdfsbrowser --py
jupyter lab build
Configure notebook jupyter_notebook_config.py
:
c.HDFSBrowserConfig.hdfs_site_path = "/cvmfs/sft.cern.ch/lcg/etc/hadoop-confext/conf/etc/analytix/hadoop.analytix/hdfs-site.xml"
c.HDFSBrowserConfig.hdfs_site_namenodes_property = "dfs.ha.namenodes.analytix"
c.HDFSBrowserConfig.hdfs_site_namenodes_port = "50070"
c.HDFSBrowserConfig.webhdfs_token = "dummy"
If you are not seeing the frontend, check if it's installed:
jupyter labextension list
If it is installed, try:
jupyter lab clean
jupyter lab build
The jlpm
command is JupyterLab's pinned version of
yarn that is installed with JupyterLab. You may use
yarn
or npm
in lieu of jlpm
below.
# Clone the repo to your local environment
# Move to hdfsbrowser directory
# Install server extension
# This will also build the js code
pip install -e .
# Install and enable the nbextension
jupyter nbextension install hdfsbrowser --py --sys-prefix
jupyter nbextension enable hdfsbrowser --py --sys-prefix
# Link your development version of the extension with JupyterLab
jupyter labextension link .
# Rebuild JupyterLab after making any changes
jupyter lab build
# Rebuild Typescript source after making changes
jlpm build
# Rebuild JupyterLab after making any changes
jupyter lab build
You can watch the source directory and run JupyterLab in watch mode to watch for changes in the extension's source and automatically rebuild the extension and application.
# Watch the source directory in another terminal tab
jlpm watch
# Run jupyterlab in watch mode in one terminal tab
jupyter lab --watch
pip uninstall hdfsbrowser
jupyter labextension uninstall @swan-cern/hdfsbrowser