superset/docs/installation.rst

..  Licensed to the Apache Software Foundation (ASF) under one
    or more contributor license agreements.  See the NOTICE file
    distributed with this work for additional information
    regarding copyright ownership.  The ASF licenses this file
    to you under the Apache License, Version 2.0 (the
    "License"); you may not use this file except in compliance
    with the License.  You may obtain a copy of the License at

..    http://www.apache.org/licenses/LICENSE-2.0

..  Unless required by applicable law or agreed to in writing,
    software distributed under the License is distributed on an
    "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
    KIND, either express or implied.  See the License for the
    specific language governing permissions and limitations
    under the License.

Installation & Configuration
============================

Getting Started
---------------

Superset has deprecated support for Python ``2.*`` and supports
only ``~=3.6`` to take advantage of the newer Python features and reduce
the burden of supporting previous versions. We run our test suite
against ``3.6``, but ``3.7`` is fully supported as well.

Cloud-native!
-------------

Superset is designed to be highly available. It is
"cloud-native" as it has been designed scale out in large,
distributed environments, and works well inside containers.
While you can easily
test drive Superset on a modest setup or simply on your laptop,
there's virtually no limit around scaling out the platform.
Superset is also cloud-native in the sense that it is
flexible and lets you choose your web server (Gunicorn, Nginx, Apache),
your metadata database engine (MySQL, Postgres, MariaDB, ...),
your message queue (Redis, RabbitMQ, SQS, ...),
your results backend (S3, Redis, Memcached, ...), your caching layer
(Memcached, Redis, ...), works well with services like NewRelic, StatsD and
DataDog, and has the ability to run analytic workloads against
most popular database technologies.

Superset is battle tested in large environments with hundreds
of concurrent users. Airbnb's production environment runs inside
Kubernetes and serves 600+ daily active users viewing over 100K charts a
day.

The Superset web server and the Superset Celery workers (optional)
are stateless, so you can scale out by running on as many servers
as needed.

Start with Docker
-----------------

.. note ::
    The Docker-related files and documentation has been
    community-contributed and
    is not actively maintained and managed by the core committers working on
    the project. Some issues have been reported as of 2019-01.
    Help and contributions around Docker are welcomed!

If you know docker, then you're lucky, we have shortcut road for you to
initialize development environment: ::

    git clone https://github.com/apache/incubator-superset/
    cd incubator-superset/contrib/docker
    # prefix with SUPERSET_LOAD_EXAMPLES=yes to load examples:
    docker-compose run --rm superset ./docker-init.sh
    # you can run this command everytime you need to start superset now:
    docker-compose up

After several minutes for superset initialization to finish, you can open
a browser and view `http://localhost:8088` to start your journey.

From there, the container server will reload on modification of the superset python
and javascript source code.
Don't forget to reload the page to take the new frontend into account though.

See also `CONTRIBUTING.md#building <https://github.com/apache/incubator-superset/blob/master/CONTRIBUTING.md#building>`_,
for alternative way of serving the frontend.

It is also possible to run Superset in non-development mode: in the `docker-compose.yml` file remove
the volumes needed for development and change the variable `SUPERSET_ENV` to `production`.

If you are attempting to build on a Mac and it exits with 137 you need to increase your docker resources.
OSX instructions: https://docs.docker.com/docker-for-mac/#advanced (Search for memory)

Or if you're curious and want to install superset from bottom up, then go ahead.

See also `contrib/docker/README.md <https://github.com/apache/incubator-superset/blob/master/contrib/docker/README.md>`_

OS dependencies
---------------

Superset stores database connection information in its metadata database.
For that purpose, we use the ``cryptography`` Python library to encrypt
connection passwords. Unfortunately, this library has OS level dependencies.

You may want to attempt the next step
("Superset installation and initialization") and come back to this step if
you encounter an error.

Here's how to install them:

For **Debian** and **Ubuntu**, the following command will ensure that
the required dependencies are installed: ::

    sudo apt-get install build-essential libssl-dev libffi-dev python-dev python-pip libsasl2-dev libldap2-dev

**Ubuntu 18.04** If you have python3.6 installed alongside with python2.7, as is default on **Ubuntu 18.04 LTS**, run this command also: ::

    sudo apt-get install build-essential libssl-dev libffi-dev python3.6-dev python-pip libsasl2-dev libldap2-dev

otherwise build for ``cryptography`` fails.

For **Fedora** and **RHEL-derivatives**, the following command will ensure
that the required dependencies are installed: ::

    sudo yum upgrade python-setuptools
    sudo yum install gcc gcc-c++ libffi-devel python-devel python-pip python-wheel openssl-devel cyrus-sasl-devel openldap-devel

**Mac OS X** If possible, you should upgrade to the latest version of OS X as issues are more likely to be resolved for that version.
You *will likely need* the latest version of XCode available for your installed version of OS X. You should also install
the XCode command line tools: ::

    xcode-select --install

System python is not recommended. Homebrew's python also ships with pip: ::

    brew install pkg-config libffi openssl python
    env LDFLAGS="-L$(brew --prefix openssl)/lib" CFLAGS="-I$(brew --prefix openssl)/include" pip install cryptography==2.4.2

**Windows** isn't officially supported at this point, but if you want to
attempt it, download `get-pip.py <https://bootstrap.pypa.io/get-pip.py>`_, and run ``python get-pip.py`` which may need admin access. Then run the following: ::

    C:\> pip install cryptography

    # You may also have to create C:\Temp
    C:\> md C:\Temp

Python virtualenv
-----------------
It is recommended to install Superset inside a virtualenv. Python 3 already ships virtualenv.
But if it's not installed in your environment for some reason, you can install it
via the package for your operating systems, otherwise you can install from pip: ::

    pip install virtualenv

You can create and activate a virtualenv by: ::

    # virtualenv is shipped in Python 3.6+ as venv instead of pyvenv.
    # See https://docs.python.org/3.6/library/venv.html
    python3 -m venv venv
    . venv/bin/activate

On Windows the syntax for activating it is a bit different: ::

    venv\Scripts\activate

Once you activated your virtualenv everything you are doing is confined inside the virtualenv.
To exit a virtualenv just type ``deactivate``.

Python's setup tools and pip
----------------------------
Put all the chances on your side by getting the very latest ``pip``
and ``setuptools`` libraries.::

    pip install --upgrade setuptools pip

Superset installation and initialization
----------------------------------------
Follow these few simple steps to install Superset.::

    # Install superset
    pip install apache-superset

    # Initialize the database
    superset db upgrade

    # Create an admin user (you will be prompted to set a username, first and last name before setting a password)
    $ export FLASK_APP=superset
    flask fab create-admin

    # Load some data to play with
    superset load_examples

    # Create default roles and permissions
    superset init

    # To start a development web server on port 8088, use -p to bind to another port
    superset run -p 8088 --with-threads --reload --debugger

After installation, you should be able to point your browser to the right
hostname:port `http://localhost:8088 <http://localhost:8088>`_, login using
the credential you entered while creating the admin account, and navigate to
`Menu -> Admin -> Refresh Metadata`. This action should bring in all of
your datasources for Superset to be aware of, and they should show up in
`Menu -> Datasources`, from where you can start playing with your data!

A proper WSGI HTTP Server
-------------------------

While you can setup Superset to run on Nginx or Apache, many use
Gunicorn, preferably in **async mode**, which allows for impressive
concurrency even and is fairly easy to install and configure. Please
refer to the
documentation of your preferred technology to set up this Flask WSGI
application in a way that works well in your environment. Here's an **async**
setup known to work well in production: ::

 　gunicorn \
        -w 10 \
        -k gevent \
        --timeout 120 \
        -b  0.0.0.0:6666 \
        --limit-request-line 0 \
        --limit-request-field_size 0 \
        --statsd-host localhost:8125 \
        superset:app

Refer to the
`Gunicorn documentation <https://docs.gunicorn.org/en/stable/design.html>`_
for more information.

Note that the development web
server (`superset run` or `flask run`) is not intended for production use.

If not using gunicorn, you may want to disable the use of flask-compress
by setting `ENABLE_FLASK_COMPRESS = False` in your `superset_config.py`

Flask-AppBuilder Permissions
----------------------------

By default, every time the Flask-AppBuilder (FAB) app is initialized the
permissions and views are added automatically to the backend and associated with
the ‘Admin’ role. The issue, however, is when you are running multiple concurrent
workers this creates a lot of contention and race conditions when defining
permissions and views.

To alleviate this issue, the automatic updating of permissions can be disabled
by setting `FAB_UPDATE_PERMS = False` (defaults to True).

In a production environment initialization could take on the following form:

  superset init
  gunicorn -w 10 ... superset:app

Configuration behind a load balancer
------------------------------------

If you are running superset behind a load balancer or reverse proxy (e.g. NGINX
or ELB on AWS), you may need to utilise a healthcheck endpoint so that your
load balancer knows if your superset instance is running. This is provided
at ``/health`` which will return a 200 response containing "OK" if the
the webserver is running.

If the load balancer is inserting X-Forwarded-For/X-Forwarded-Proto headers, you
should set `ENABLE_PROXY_FIX = True` in the superset config file to extract and use
the headers.

In case that the reverse proxy is used for providing ssl encryption,
an explicit definition of the `X-Forwarded-Proto` may be required.
For the Apache webserver this can be set as follows: ::

    RequestHeader set X-Forwarded-Proto "https"

Configuration
-------------

To configure your application, you need to create a file (module)
``superset_config.py`` and make sure it is in your PYTHONPATH. Here are some
of the parameters you can copy / paste in that configuration module: ::

    #---------------------------------------------------------
    # Superset specific config
    #---------------------------------------------------------
    ROW_LIMIT = 5000

    SUPERSET_WEBSERVER_PORT = 8088
    #---------------------------------------------------------

    #---------------------------------------------------------
    # Flask App Builder configuration
    #---------------------------------------------------------
    # Your App secret key
    SECRET_KEY = '\2\1thisismyscretkey\1\2\e\y\y\h'

    # The SQLAlchemy connection string to your database backend
    # This connection defines the path to the database that stores your
    # superset metadata (slices, connections, tables, dashboards, ...).
    # Note that the connection information to connect to the datasources
    # you want to explore are managed directly in the web UI
    SQLALCHEMY_DATABASE_URI = 'sqlite:////path/to/superset.db'

    # Flask-WTF flag for CSRF
    WTF_CSRF_ENABLED = True
    # Add endpoints that need to be exempt from CSRF protection
    WTF_CSRF_EXEMPT_LIST = []
    # A CSRF token that expires in 1 year
    WTF_CSRF_TIME_LIMIT = 60 * 60 * 24 * 365

    # Set this API key to enable Mapbox visualizations
    MAPBOX_API_KEY = ''

All the parameters and default values defined in
https://github.com/apache/incubator-superset/blob/master/superset/config.py
can be altered in your local ``superset_config.py`` .
Administrators will want to
read through the file to understand what can be configured locally
as well as the default values in place.

Since ``superset_config.py`` acts as a Flask configuration module, it
can be used to alter the settings Flask itself,
as well as Flask extensions like ``flask-wtf``, ``flask-cache``,
``flask-migrate``, and ``flask-appbuilder``. Flask App Builder, the web
framework used by Superset offers many configuration settings. Please consult
the `Flask App Builder Documentation
<https://flask-appbuilder.readthedocs.org/en/latest/config.html>`_
for more information on how to configure it.

Make sure to change:

* *SQLALCHEMY_DATABASE_URI*, by default it is stored at *~/.superset/superset.db*
* *SECRET_KEY*, to a long random string

In case you need to exempt endpoints from CSRF, e.g. you are running a custom
auth postback endpoint, you can add them to *WTF_CSRF_EXEMPT_LIST*

     WTF_CSRF_EXEMPT_LIST = ['']


.. _ref_database_deps:

Database dependencies
---------------------

Superset does not ship bundled with connectivity to databases, except
for Sqlite, which is part of the Python standard library.
You'll need to install the required packages for the database you
want to use as your metadata database as well as the packages needed to
connect to the databases you want to access through Superset.

Here's a list of some of the recommended packages.

+------------------+---------------------------------------+-------------------------------------------------+
| database         | pypi package                          | SQLAlchemy URI prefix                           |
+==================+=======================================+=================================================+
| Amazon Athena    | ``pip install "PyAthenaJDBC>1.0.9"``  | ``awsathena+jdbc://``                           |
+------------------+---------------------------------------+-------------------------------------------------+
| Amazon Athena    | ``pip install "PyAthena>1.2.0"``      | ``awsathena+rest://``                           |
+------------------+---------------------------------------+-------------------------------------------------+
| Amazon Redshift  | ``pip install sqlalchemy-redshift``   | ``redshift+psycopg2://``                        |
+------------------+---------------------------------------+-------------------------------------------------+
| Apache Drill     | ``pip install sqlalchemy-drill``      | For the REST API:``                             |
|                  |                                       | ``drill+sadrill://``                            |
|                  |                                       | For JDBC                                        |
|                  |                                       | ``drill+jdbc://``                               |
+------------------+---------------------------------------+-------------------------------------------------+
| Apache Druid     | ``pip install pydruid``                | ``druid://``                                   |
+------------------+---------------------------------------+-------------------------------------------------+
| Apache Hive      | ``pip install pyhive``                | ``hive://``                                     |
+------------------+---------------------------------------+-------------------------------------------------+
| Apache Impala    | ``pip install impyla``                | ``impala://``                                   |
+------------------+---------------------------------------+-------------------------------------------------+
| Apache Kylin     | ``pip install kylinpy``               | ``kylin://``                                    |
+------------------+---------------------------------------+-------------------------------------------------+
| Apache Pinot     | ``pip install pinotdb``               | ``pinot+http://CONTROLLER:5436/``               |
|                  |                                       | ``query?server=http://CONTROLLER:5983/``        |
+------------------+---------------------------------------+-------------------------------------------------+
| Apache Spark SQL | ``pip install pyhive``                | ``jdbc+hive://``                                |
+------------------+---------------------------------------+-------------------------------------------------+
| BigQuery         | ``pip install pybigquery``            | ``bigquery://``                                 |
+------------------+---------------------------------------+-------------------------------------------------+
| ClickHouse       | ``pip install sqlalchemy-clickhouse`` |                                                 |
+------------------+---------------------------------------+-------------------------------------------------+
| Elasticsearch    | ``pip install elasticsearch-dbapi``   | ``elasticsearch+http://``                       |
+------------------+---------------------------------------+-------------------------------------------------+
| Exasol           | ``pip install sqlalchemy-exasol``     | ``exa+pyodbc://``                               |
+------------------+---------------------------------------+-------------------------------------------------+
| Google Sheets    | ``pip install gsheetsdb``             | ``gsheets://``                                  |
+------------------+---------------------------------------+-------------------------------------------------+
| IBM Db2          | ``pip install ibm_db_sa``             | ``db2+ibm_db://``                               |
+------------------+---------------------------------------+-------------------------------------------------+
| MySQL            | ``pip install mysqlclient``           | ``mysql://``                                    |
+------------------+---------------------------------------+-------------------------------------------------+
| Oracle           | ``pip install cx_Oracle``             | ``oracle://``                                   |
+------------------+---------------------------------------+-------------------------------------------------+
| PostgreSQL       | ``pip install psycopg2``              | ``postgresql+psycopg2://``                      |
+------------------+---------------------------------------+-------------------------------------------------+
| Presto           | ``pip install pyhive``                | ``presto://``                                   |
+------------------+---------------------------------------+-------------------------------------------------+
| Snowflake        | ``pip install snowflake-sqlalchemy``  | ``snowflake://``                                |
+------------------+---------------------------------------+-------------------------------------------------+
| SQLite           |                                       | ``sqlite://``                                   |
+------------------+---------------------------------------+-------------------------------------------------+
| SQL Server       | ``pip install pymssql``               | ``mssql://``                                    |
+------------------+---------------------------------------+-------------------------------------------------+
| Teradata         | ``pip install sqlalchemy-teradata``   | ``teradata://``                                 |
+------------------+---------------------------------------+-------------------------------------------------+
| Vertica          | ``pip install                         |  ``vertica+vertica_python://``                  |
|                  | sqlalchemy-vertica-python``           |                                                 |
+------------------+---------------------------------------+-------------------------------------------------+
| Hana             | ``pip install hdbcli sqlalchemy-hana``|  ``hana://``                                    |
|                  | or ``pip install superset[hana]``     |                                                 |
+------------------+---------------------------------------+-------------------------------------------------+


Note that many other databases are supported, the main criteria being the
existence of a functional SqlAlchemy dialect and Python driver. Googling
the keyword ``sqlalchemy`` in addition of a keyword that describes the
database you want to connect to should get you to the right place.

Hana
------------

The connection string for Hana looks like this ::

    hana://{username}:{password}@{host}:{port}


(AWS) Athena
------------

The connection string for Athena looks like this ::

    awsathena+jdbc://{aws_access_key_id}:{aws_secret_access_key}@athena.{region_name}.amazonaws.com/{schema_name}?s3_staging_dir={s3_staging_dir}&...

Where you need to escape/encode at least the s3_staging_dir, i.e., ::

    s3://... -> s3%3A//...

You can also use `PyAthena` library(no java required) like this ::

    awsathena+rest://{aws_access_key_id}:{aws_secret_access_key}@athena.{region_name}.amazonaws.com/{schema_name}?s3_staging_dir={s3_staging_dir}&...

See `PyAthena <https://github.com/laughingman7743/PyAthena#sqlalchemy>`_.

(Google) BigQuery
-----------------

The connection string for BigQuery looks like this ::

    bigquery://{project_id}

Additionally, you will need to configure authentication via a
Service Account. Create your Service Account via the Google
Cloud Platform control panel, provide it access to the appropriate
BigQuery datasets, and download the JSON configuration file
for the service account. In Superset, Add a JSON blob to
the "Secure Extra" field in the database configuration page
with the following format ::

    {
        "credentials_info": <contents of credentials JSON file>
    }

The resulting file should have this structure ::

    {
        "credentials_info": {
            "type": "service_account",
            "project_id": "...",
            "private_key_id": "...",
            "private_key": "...",
            "client_email": "...",
            "client_id": "...",
            "auth_uri": "...",
            "token_uri": "...",
            "auth_provider_x509_cert_url": "...",
            "client_x509_cert_url": "...",
        }
    }

You should then be able to connect to your BigQuery datasets.

To be able to upload data, e.g. sample data, the python library `pandas_gbq` is required.


Elasticsearch
-------------

The connection string for Elasticsearch looks like this ::

    elasticsearch+http://{user}:{password}@{host}:9200/

Using HTTPS ::

    elasticsearch+https://{user}:{password}@{host}:9200/


Elasticsearch as a default limit of 10000 rows, so you can increase this limit on your cluster
or set Superset's row limit on config ::

    ROW_LIMIT = 10000

You can query multiple indices on SQLLab for example ::

    select timestamp, agent from "logstash-*"

But, to use visualizations for multiple indices you need to create an alias index on your cluster ::

    POST /_aliases
    {
        "actions" : [
            { "add" : { "index" : "logstash-**", "alias" : "logstash_all" } }
        ]
    }

Then register your table with the ``alias`` name ``logstasg_all``

Snowflake
---------

The connection string for Snowflake looks like this ::

    snowflake://{user}:{password}@{account}.{region}/{database}?role={role}&warehouse={warehouse}

The schema is not necessary in the connection string, as it is defined per table/query.
The role and warehouse can be omitted if defaults are defined for the user, i.e.

    snowflake://{user}:{password}@{account}.{region}/{database}

Make sure the user has privileges to access and use all required
databases/schemas/tables/views/warehouses, as the Snowflake SQLAlchemy engine does
not test for user rights during engine creation.

See `Snowflake SQLAlchemy <https://github.com/snowflakedb/snowflake-sqlalchemy>`_.

Teradata
---------

The connection string for Teradata looks like this ::

    teradata://{user}:{password}@{host}

*Note*: Its required to have Teradata ODBC drivers installed and environment variables configured for proper work of sqlalchemy dialect. Teradata ODBC Drivers available here: https://downloads.teradata.com/download/connectivity/odbc-driver/linux

Required environment variables: ::

    export ODBCINI=/.../teradata/client/ODBC_64/odbc.ini
    export ODBCINST=/.../teradata/client/ODBC_64/odbcinst.ini

See `Teradata SQLAlchemy <https://github.com/Teradata/sqlalchemy-teradata>`_.

Apache Drill
------------
At the time of writing, the SQLAlchemy Dialect is not available on pypi and must be downloaded here:
`SQLAlchemy Drill <https://github.com/JohnOmernik/sqlalchemy-drill>`_

Alternatively, you can install it completely from the command line as follows: ::

    git clone https://github.com/JohnOmernik/sqlalchemy-drill
    cd sqlalchemy-drill
    python3 setup.py install

Once that is done, you can connect to Drill in two ways, either via the REST interface or by JDBC.  If you are connecting via JDBC, you must have the
Drill JDBC Driver installed.

The basic connection string for Drill looks like this ::

    drill+sadrill://{username}:{password}@{host}:{port}/{storage_plugin}?use_ssl=True

If you are using JDBC to connect to Drill, the connection string looks like this: ::

    drill+jdbc://{username}:{password}@{host}:{port}/{storage_plugin}

For a complete tutorial about how to use Apache Drill with Superset, see this tutorial:
`Visualize Anything with Superset and Drill <http://thedataist.com/visualize-anything-with-superset-and-drill/>`_

Caching
-------

Superset uses `Flask-Cache <https://pythonhosted.org/Flask-Cache/>`_ for
caching purpose. Configuring your caching backend is as easy as providing
a ``CACHE_CONFIG``, constant in your ``superset_config.py`` that
complies with the Flask-Cache specifications.

Flask-Cache supports multiple caching backends (Redis, Memcached,
SimpleCache (in-memory), or the local filesystem). If you are going to use
Memcached please use the `pylibmc` client library as `python-memcached` does
not handle storing binary data correctly. If you use Redis, please install
the `redis <https://pypi.python.org/pypi/redis>`_ Python package: ::

    pip install redis

For setting your timeouts, this is done in the Superset metadata and goes
up the "timeout searchpath", from your slice configuration, to your
data source's configuration, to your database's and ultimately falls back
into your global default defined in ``CACHE_CONFIG``.

.. code-block:: python

    CACHE_CONFIG = {
        'CACHE_TYPE': 'redis',
        'CACHE_DEFAULT_TIMEOUT': 60 * 60 * 24, # 1 day default (in secs)
        'CACHE_KEY_PREFIX': 'superset_results',
        'CACHE_REDIS_URL': 'redis://localhost:6379/0',
    }

It is also possible to pass a custom cache initialization function in the
config to handle additional caching use cases. The function must return an
object that is compatible with the `Flask-Cache <https://pythonhosted.org/Flask-Cache/>`_ API.

.. code-block:: python

    from custom_caching import CustomCache

    def init_cache(app):
        """Takes an app instance and returns a custom cache backend"""
        config = {
            'CACHE_DEFAULT_TIMEOUT': 60 * 60 * 24, # 1 day default (in secs)
            'CACHE_KEY_PREFIX': 'superset_results',
        }
        return CustomCache(app, config)

    CACHE_CONFIG = init_cache

Superset has a Celery task that will periodically warm up the cache based on
different strategies. To use it, add the following to the `CELERYBEAT_SCHEDULE`
section in `config.py`:

.. code-block:: python

    CELERYBEAT_SCHEDULE = {
        'cache-warmup-hourly': {
            'task': 'cache-warmup',
            'schedule': crontab(minute=0, hour='*'),  # hourly
            'kwargs': {
                'strategy_name': 'top_n_dashboards',
                'top_n': 5,
                'since': '7 days ago',
            },
        },
    }

This will cache all the charts in the top 5 most popular dashboards every hour.
For other strategies, check the `superset/tasks/cache.py` file.


Deeper SQLAlchemy integration
-----------------------------

It is possible to tweak the database connection information using the
parameters exposed by SQLAlchemy. In the ``Database`` edit view, you will
find an ``extra`` field as a ``JSON`` blob.

.. image:: images/tutorial/add_db.png
   :scale: 30 %

This JSON string contains extra configuration elements. The ``engine_params``
object gets unpacked into the
`sqlalchemy.create_engine <https://docs.sqlalchemy.org/en/latest/core/engines.html#sqlalchemy.create_engine>`_ call,
while the ``metadata_params`` get unpacked into the
`sqlalchemy.MetaData <https://docs.sqlalchemy.org/en/rel_1_2/core/metadata.html#sqlalchemy.schema.MetaData>`_ call. Refer to the SQLAlchemy docs for more information.

.. note:: If your using CTAS on SQLLab and PostgreSQL
    take a look at :ref:`ref_ctas_engine_config` for specific ``engine_params``.


Schemas (Postgres & Redshift)
-----------------------------

Postgres and Redshift, as well as other databases,
use the concept of **schema** as a logical entity
on top of the **database**. For Superset to connect to a specific schema,
there's a **schema** parameter you can set in the table form.


External Password store for SQLAlchemy connections
--------------------------------------------------
It is possible to use an external store for you database passwords. This is
useful if you a running a custom secret distribution framework and do not wish
to store secrets in Superset's meta database.

Example:
Write a function that takes a single argument of type ``sqla.engine.url`` and returns
the password for the given connection string. Then set ``SQLALCHEMY_CUSTOM_PASSWORD_STORE``
in your config file to point to that function. ::

    def example_lookup_password(url):
        secret = <<get password from external framework>>
        return 'secret'

    SQLALCHEMY_CUSTOM_PASSWORD_STORE = example_lookup_password

A common pattern is to use environment variables to make secrets available.
``SQLALCHEMY_CUSTOM_PASSWORD_STORE`` can also be used for that purpose. ::

    def example_password_as_env_var(url):
        # assuming the uri looks like
        # mysql://localhost?superset_user:{SUPERSET_PASSWORD}
        return url.password.format(os.environ)

    SQLALCHEMY_CUSTOM_PASSWORD_STORE = example_password_as_env_var


SSL Access to databases
-----------------------
This example worked with a MySQL database that requires SSL. The configuration
may differ with other backends. This is what was put in the ``extra``
parameter ::

    {
        "metadata_params": {},
        "engine_params": {
              "connect_args":{
                  "sslmode":"require",
                  "sslrootcert": "/path/to/my/pem"
            }
         }
    }


Druid
-----

* From the UI, enter the information about your clusters in the
  `Sources -> Druid Clusters` menu by hitting the + sign.

* Once the Druid cluster connection information is entered, hit the
  `Sources -> Refresh Druid Metadata` menu item to populate

* Navigate to your datasources

Note that you can run the ``superset refresh_druid`` command to refresh the
metadata from your Druid cluster(s)


Presto
------

By default Superset assumes the most recent version of Presto is being used when
querying the datasource. If you're using an older version of presto, you can configure
it in the ``extra`` parameter::

    {
        "version": "0.123"
    }


Exasol
---------

The connection string for Exasol looks like this ::

    exa+pyodbc://{user}:{password}@{host}

*Note*: It's required to have Exasol ODBC drivers installed for the sqlalchemy dialect to work properly. Exasol ODBC Drivers available are here: https://www.exasol.com/portal/display/DOWNLOAD/Exasol+Download+Section

Example config (odbcinst.ini can be left empty) ::

    $ cat $/.../path/to/odbc.ini
    [EXAODBC]
    DRIVER = /.../path/to/driver/EXASOL_driver.so
    EXAHOST = host:8563
    EXASCHEMA = main

See `SQLAlchemy for Exasol <https://github.com/blue-yonder/sqlalchemy_exasol>`_.

CORS
----

The extra CORS Dependency must be installed:

    superset[cors]


The following keys in `superset_config.py` can be specified to configure CORS:


* ``ENABLE_CORS``: Must be set to True in order to enable CORS
* ``CORS_OPTIONS``: options passed to Flask-CORS (`documentation <https://flask-cors.corydolphin.com/en/latest/api.html#extension>`)


Domain Sharding
---------------

Chrome allows up to 6 open connections per domain at a time. When there are more
than 6 slices in dashboard, a lot of time fetch requests are queued up and wait for
next available socket. `PR 5039 <https://github.com/apache/incubator-superset/pull/5039>`_ adds domain sharding to Superset,
and this feature will be enabled by configuration only (by default Superset
doesn't allow cross-domain request).

* ``SUPERSET_WEBSERVER_DOMAINS``: list of allowed hostnames for domain sharding feature. default `None`


Middleware
----------

Superset allows you to add your own middleware. To add your own middleware, update the ``ADDITIONAL_MIDDLEWARE`` key in
your `superset_config.py`. ``ADDITIONAL_MIDDLEWARE`` should be a list of your additional middleware classes.

For example, to use AUTH_REMOTE_USER from behind a proxy server like nginx, you have to add a simple middleware class to
add the value of ``HTTP_X_PROXY_REMOTE_USER`` (or any other custom header from the proxy) to Gunicorn's ``REMOTE_USER``
environment variable: ::

    class RemoteUserMiddleware(object):
        def __init__(self, app):
            self.app = app
        def __call__(self, environ, start_response):
            user = environ.pop('HTTP_X_PROXY_REMOTE_USER', None)
            environ['REMOTE_USER'] = user
            return self.app(environ, start_response)

    ADDITIONAL_MIDDLEWARE = [RemoteUserMiddleware, ]

*Adapted from http://flask.pocoo.org/snippets/69/*

Event Logging
-------------

Superset by default logs special action event on it's database. These log can be accessed on the UI navigating to
"Security" -> "Action Log". You can freely customize these logs by implementing your own event log class.

Example of a simple JSON to Stdout class::

    class JSONStdOutEventLogger(AbstractEventLogger):

        def log(self, user_id, action, *args, **kwargs):
            records = kwargs.get('records', list())
            dashboard_id = kwargs.get('dashboard_id')
            slice_id = kwargs.get('slice_id')
            duration_ms = kwargs.get('duration_ms')
            referrer = kwargs.get('referrer')

            for record in records:
                log = dict(
                    action=action,
                    json=record,
                    dashboard_id=dashboard_id,
                    slice_id=slice_id,
                    duration_ms=duration_ms,
                    referrer=referrer,
                    user_id=user_id
                )
                print(json.dumps(log))


Then on Superset's config pass an instance of the logger type you want to use.

    EVENT_LOGGER = JSONStdOutEventLogger()


Upgrading
---------

Upgrading should be as straightforward as running::

    pip install apache-superset --upgrade
    superset db upgrade
    superset init

We recommend to follow standard best practices when upgrading Superset, such
as taking a database backup prior to the upgrade, upgrading a staging
environment prior to upgrading production, and upgrading production while less
users are active on the platform.

.. note ::
   Some upgrades may contain backward-incompatible changes, or require
   scheduling downtime, when that is the case, contributors attach notes in
   ``UPDATING.md`` in the repository. It's recommended to review this
   file prior to running an upgrade.


Celery Tasks
------------

On large analytic databases, it's common to run queries that
execute for minutes or hours.
To enable support for long running queries that
execute beyond the typical web request's timeout (30-60 seconds), it is
necessary to configure an asynchronous backend for Superset which consists of:

* one or many Superset workers (which is implemented as a Celery worker), and
  can be started with the ``celery worker`` command, run
  ``celery worker --help`` to view the related options.
* a celery broker (message queue) for which we recommend using Redis
  or RabbitMQ
* a results backend that defines where the worker will persist the query
  results

Configuring Celery requires defining a ``CELERY_CONFIG`` in your
``superset_config.py``. Both the worker and web server processes should
have the same configuration.

.. code-block:: python

    class CeleryConfig(object):
        BROKER_URL = 'redis://localhost:6379/0'
        CELERY_IMPORTS = (
            'superset.sql_lab',
            'superset.tasks',
        )
        CELERY_RESULT_BACKEND = 'redis://localhost:6379/0'
        CELERYD_LOG_LEVEL = 'DEBUG'
        CELERYD_PREFETCH_MULTIPLIER = 10
        CELERY_ACKS_LATE = True
        CELERY_ANNOTATIONS = {
            'sql_lab.get_sql_results': {
                'rate_limit': '100/s',
            },
            'email_reports.send': {
                'rate_limit': '1/s',
                'time_limit': 120,
                'soft_time_limit': 150,
                'ignore_result': True,
            },
        }
        CELERYBEAT_SCHEDULE = {
            'email_reports.schedule_hourly': {
                'task': 'email_reports.schedule_hourly',
                'schedule': crontab(minute=1, hour='*'),
            },
        }

    CELERY_CONFIG = CeleryConfig

* To start a Celery worker to leverage the configuration run: ::

    celery worker --app=superset.tasks.celery_app:app --pool=prefork -O fair -c 4

* To start a job which schedules periodic background jobs, run ::

    celery beat --app=superset.tasks.celery_app:app

To setup a result backend, you need to pass an instance of a derivative
of ``werkzeug.contrib.cache.BaseCache`` to the ``RESULTS_BACKEND``
configuration key in your ``superset_config.py``. It's possible to use
Memcached, Redis, S3 (https://pypi.python.org/pypi/s3werkzeugcache),
memory or the file system (in a single server-type setup or for testing),
or to write your own caching interface. Your ``superset_config.py`` may
look something like:

.. code-block:: python

    # On S3
    from s3cache.s3cache import S3Cache
    S3_CACHE_BUCKET = 'foobar-superset'
    S3_CACHE_KEY_PREFIX = 'sql_lab_result'
    RESULTS_BACKEND = S3Cache(S3_CACHE_BUCKET, S3_CACHE_KEY_PREFIX)

    # On Redis
    from werkzeug.contrib.cache import RedisCache
    RESULTS_BACKEND = RedisCache(
        host='localhost', port=6379, key_prefix='superset_results')

For performance gains, `MessagePack <https://github.com/msgpack/msgpack-python>`_
and `PyArrow <https://arrow.apache.org/docs/python/>`_ are now used for results
serialization. This can be disabled by setting ``RESULTS_BACKEND_USE_MSGPACK = False``
in your configuration, should any issues arise. Please clear your existing results
cache store when upgrading an existing environment.

**Important notes**

* It is important that all the worker nodes and web servers in
  the Superset cluster share a common metadata database.
  This means that SQLite will not work in this context since it has
  limited support for concurrency and
  typically lives on the local file system.

* There should only be one instance of ``celery beat`` running in your
  entire setup. If not, background jobs can get scheduled multiple times
  resulting in weird behaviors like duplicate delivery of reports,
  higher than expected load / traffic etc.

* SQL Lab will only run your queries asynchronously if you enable
  "Asynchronous Query Execution" in your database settings.


Email Reports
-------------
Email reports allow users to schedule email reports for

* chart and dashboard visualization (Attachment or inline)
* chart data (CSV attachment on inline table)

**Setup**

Make sure you enable email reports in your configuration file

.. code-block:: python

    ENABLE_SCHEDULED_EMAIL_REPORTS = True

Now you will find two new items in the navigation bar that allow you to schedule email
reports

* Manage -> Dashboard Emails
* Manage -> Chart Email Schedules

Schedules are defined in crontab format and each schedule
can have a list of recipients (all of them can receive a single mail,
or separate mails). For audit purposes, all outgoing mails can have a
mandatory bcc.

In order get picked up you need to configure a celery worker and a celery beat
(see section above "Celery Tasks"). Your celery configuration also
needs an entry ``email_reports.schedule_hourly`` for ``CELERYBEAT_SCHEDULE``.

To send emails you need to configure SMTP settings in your configuration file. e.g.

.. code-block:: python

    EMAIL_NOTIFICATIONS = True

    SMTP_HOST = "email-smtp.eu-west-1.amazonaws.com"
    SMTP_STARTTLS = True
    SMTP_SSL = False
    SMTP_USER = "smtp_username"
    SMTP_PORT = 25
    SMTP_PASSWORD = os.environ.get("SMTP_PASSWORD")
    SMTP_MAIL_FROM = "insights@komoot.com"


To render dashboards you need to install a local browser on your superset instance

  * `geckodriver <https://github.com/mozilla/geckodriver>`_ and Firefox is preferred
  * `chromedriver <http://chromedriver.chromium.org/>`_ is a good option too

You need to adjust the ``EMAIL_REPORTS_WEBDRIVER`` accordingly in your configuration.

You also need to specify on behalf of which username to render the dashboards. In general
dashboards and charts are not accessible to unauthorized requests, that is why the
worker needs to take over credentials of an existing user to take a snapshot. ::

    EMAIL_REPORTS_USER = 'username_with_permission_to_access_dashboards'


**Important notes**

* Be mindful of the concurrency setting for celery (using ``-c 4``).
  Selenium/webdriver instances can consume a lot of CPU / memory on your servers.

* In some cases, if you notice a lot of leaked ``geckodriver`` processes, try running
  your celery processes with ::

    celery worker --pool=prefork --max-tasks-per-child=128 ...

* It is recommended to run separate workers for ``sql_lab`` and
  ``email_reports`` tasks. Can be done by using ``queue`` field in ``CELERY_ANNOTATIONS``

* Adjust ``WEBDRIVER_BASEURL`` in your config if celery workers can't access superset via its
  default value ``http://0.0.0.0:8080/`` (notice the port number 8080, many other setups use
  port 8088).

SQL Lab
-------
SQL Lab is a powerful SQL IDE that works with all SQLAlchemy compatible
databases. By default, queries are executed in the scope of a web
request so they may eventually timeout as queries exceed the maximum duration of a web
request in your environment, whether it'd be a reverse proxy or the Superset
server itself. In such cases, it is preferred to use ``celery`` to run the queries
in the background. Please follow the examples/notes mentioned above to get your
celery setup working.

Also note that SQL Lab supports Jinja templating in queries and that it's
possible to overload
the default Jinja context in your environment by defining the
``JINJA_CONTEXT_ADDONS`` in your superset configuration. Objects referenced
in this dictionary are made available for users to use in their SQL.

.. code-block:: python

    JINJA_CONTEXT_ADDONS = {
        'my_crazy_macro': lambda x: x*2,
    }

SQL Lab also includes a live query validation feature with pluggable backends.
You can configure which validation implementation is used with which database
engine by adding a block like the following to your config.py:

.. code-block:: python

     FEATURE_FLAGS = {
         'SQL_VALIDATORS_BY_ENGINE': {
             'presto': 'PrestoDBSQLValidator',
         }
     }

The available validators and names can be found in `sql_validators/`.

**Scheduling queries**

You can optionally allow your users to schedule queries directly in SQL Lab.
This is done by addding extra metadata to saved queries, which are then picked
up by an external scheduled (like [Apache Airflow](https://airflow.apache.org/)).

To allow scheduled queries, add the following to your `config.py`:

.. code-block:: python

    FEATURE_FLAGS = {
        # Configuration for scheduling queries from SQL Lab. This information is
        # collected when the user clicks "Schedule query", and saved into the `extra`
        # field of saved queries.
        # See: https://github.com/mozilla-services/react-jsonschema-form
        'SCHEDULED_QUERIES': {
            'JSONSCHEMA': {
                'title': 'Schedule',
                'description': (
                    'In order to schedule a query, you need to specify when it '
                    'should start running, when it should stop running, and how '
                    'often it should run. You can also optionally specify '
                    'dependencies that should be met before the query is '
                    'executed. Please read the documentation for best practices '
                    'and more information on how to specify dependencies.'
                ),
                'type': 'object',
                'properties': {
                    'output_table': {
                        'type': 'string',
                        'title': 'Output table name',
                    },
                    'start_date': {
                        'type': 'string',
                        'title': 'Start date',
                        # date-time is parsed using the chrono library, see
                        # https://www.npmjs.com/package/chrono-node#usage
                        'format': 'date-time',
                        'default': 'tomorrow at 9am',
                    },
                    'end_date': {
                        'type': 'string',
                        'title': 'End date',
                        # date-time is parsed using the chrono library, see
                        # https://www.npmjs.com/package/chrono-node#usage
                        'format': 'date-time',
                        'default': '9am in 30 days',
                    },
                    'schedule_interval': {
                        'type': 'string',
                        'title': 'Schedule interval',
                    },
                    'dependencies': {
                        'type': 'array',
                        'title': 'Dependencies',
                        'items': {
                            'type': 'string',
                        },
                    },
                },
            },
            'UISCHEMA': {
                'schedule_interval': {
                    'ui:placeholder': '@daily, @weekly, etc.',
                },
                'dependencies': {
                    'ui:help': (
                        'Check the documentation for the correct format when '
                        'defining dependencies.'
                    ),
                },
            },
            'VALIDATION': [
                # ensure that start_date <= end_date
                {
                    'name': 'less_equal',
                    'arguments': ['start_date', 'end_date'],
                    'message': 'End date cannot be before start date',
                    # this is where the error message is shown
                    'container': 'end_date',
                },
            ],
            # link to the scheduler; this example links to an Airflow pipeline
            # that uses the query id and the output table as its name
            'linkback': (
                'https://airflow.example.com/admin/airflow/tree?'
                'dag_id=query_${id}_${extra_json.schedule_info.output_table}'
            ),
        },
    }

This feature flag is based on [react-jsonschema-form](https://github.com/mozilla-services/react-jsonschema-form),
and will add a button called "Schedule Query" to SQL Lab. When the button is
clicked, a modal will show up where the user can add the metadata required for
scheduling the query.

This information can then be retrieved from the endpoint `/savedqueryviewapi/api/read`
and used to schedule the queries that have `scheduled_queries` in their JSON
metadata. For schedulers other than Airflow, additional fields can be easily
added to the configuration file above.

Celery Flower
-------------
Flower is a web based tool for monitoring the Celery cluster which you can
install from pip: ::

    pip install flower

and run via: ::

    celery flower --app=superset.tasks.celery_app:app

Building from source
---------------------

More advanced users may want to build Superset from sources. That
would be the case if you fork the project to add features specific to
your environment. See `CONTRIBUTING.md#setup-local-environment-for-development <https://github.com/apache/incubator-superset/blob/master/CONTRIBUTING.md#setup-local-environment-for-development>`_.

Blueprints
----------

`Blueprints are Flask's reusable apps <https://flask.palletsprojects.com/en/1.0.x/tutorial/views/>`_.
Superset allows you to specify an array of Blueprints
in your ``superset_config`` module. Here's
an example of how this can work with a simple Blueprint. By doing
so, you can expect Superset to serve a page that says "OK"
at the ``/simple_page`` url. This can allow you to run other things such
as custom data visualization applications alongside Superset, on the
same server.

.. code-block:: python

    from flask import Blueprint
    simple_page = Blueprint('simple_page', __name__,
                                    template_folder='templates')
    @simple_page.route('/', defaults={'page': 'index'})
    @simple_page.route('/<page>')
    def show(page):
        return "Ok"

    BLUEPRINTS = [simple_page]

StatsD logging
--------------

Superset is instrumented to log events to StatsD if desired. Most endpoints hit
are logged as well as key events like query start and end in SQL Lab.

To setup StatsD logging, it's a matter of configuring the logger in your
``superset_config.py``.

.. code-block:: python

    from superset.stats_logger import StatsdStatsLogger
    STATS_LOGGER = StatsdStatsLogger(host='localhost', port=8125, prefix='superset')

Note that it's also possible to implement you own logger by deriving
``superset.stats_logger.BaseStatsLogger``.


Install Superset with helm in Kubernetes
----------------------------------------

You can install Superset into Kubernetes with Helm <https://helm.sh/>. The chart is
located in ``install/helm``.

To install Superset into your Kubernetes:

.. code-block:: bash

    helm upgrade --install superset ./install/helm/superset

Note that the above command will install Superset into ``default`` namespace of your Kubernetes cluster.

Custom OAuth2 configuration
---------------------------

Beyond FAB supported providers (github, twitter, linkedin, google, azure), its easy to connect Superset with other OAuth2 Authorization Server implementations that support "code" authorization.

The first step: Configure authorization in Superset ``superset_config.py``.

.. code-block:: python

    AUTH_TYPE = AUTH_OAUTH
    OAUTH_PROVIDERS = [
        {   'name':'egaSSO',
            'token_key':'access_token', # Name of the token in the response of access_token_url
            'icon':'fa-address-card',   # Icon for the provider
            'remote_app': {
                'consumer_key':'myClientId',  # Client Id (Identify Superset application)
                'consumer_secret':'MySecret', # Secret for this Client Id (Identify Superset application)
                'request_token_params':{
                    'scope': 'read'               # Scope for the Authorization
                },
                'access_token_method':'POST',    # HTTP Method to call access_token_url
                'access_token_params':{        # Additional parameters for calls to access_token_url
                    'client_id':'myClientId'
                },
                'access_token_headers':{    # Additional headers for calls to access_token_url
                    'Authorization': 'Basic Base64EncodedClientIdAndSecret'
                },
                'base_url':'https://myAuthorizationServer/oauth2AuthorizationServer/',
                'access_token_url':'https://myAuthorizationServer/oauth2AuthorizationServer/token',
                'authorize_url':'https://myAuthorizationServer/oauth2AuthorizationServer/authorize'
            }
        }
    ]

    # Will allow user self registration, allowing to create Flask users from Authorized User
    AUTH_USER_REGISTRATION = True

    # The default user self registration role
    AUTH_USER_REGISTRATION_ROLE = "Public"

Second step: Create a `CustomSsoSecurityManager` that extends `SupersetSecurityManager` and overrides `oauth_user_info`:

.. code-block:: python

    from superset.security import SupersetSecurityManager

    class CustomSsoSecurityManager(SupersetSecurityManager):

        def oauth_user_info(self, provider, response=None):
            logging.debug("Oauth2 provider: {0}.".format(provider))
            if provider == 'egaSSO':
                # As example, this line request a GET to base_url + '/' + userDetails with Bearer  Authentication,
        # and expects that authorization server checks the token, and response with user details
                me = self.appbuilder.sm.oauth_remotes[provider].get('userDetails').data
                logging.debug("user_data: {0}".format(me))
                return { 'name' : me['name'], 'email' : me['email'], 'id' : me['user_name'], 'username' : me['user_name'], 'first_name':'', 'last_name':''}
        ...

This file must be located at the same directory than ``superset_config.py`` with the name ``custom_sso_security_manager.py``.

Then we can add this two lines to ``superset_config.py``:

.. code-block:: python

  from custom_sso_security_manager import CustomSsoSecurityManager
  CUSTOM_SECURITY_MANAGER = CustomSsoSecurityManager

Feature Flags
-------------

Because of a wide variety of users, Superset has some features that are not enabled by default. For example, some users have stronger security restrictions, while some others may not. So Superset allow users to enable or disable some features by config. For feature owners, you can add optional functionalities in Superset, but will be only affected by a subset of users.

You can enable or disable features with flag from ``superset_config.py``:

.. code-block:: python

     DEFAULT_FEATURE_FLAGS = {
         'CLIENT_CACHE': False,
         'ENABLE_EXPLORE_JSON_CSRF_PROTECTION': False,
         'PRESTO_EXPAND_DATA': False,
     }

Here is a list of flags and descriptions:

* ENABLE_EXPLORE_JSON_CSRF_PROTECTION

  * For some security concerns, you may need to enforce CSRF protection on all query request to explore_json endpoint. In Superset, we use `flask-csrf <https://sjl.bitbucket.io/flask-csrf/>`_ add csrf protection for all POST requests, but this protection doesn't apply to GET method.

  * When ENABLE_EXPLORE_JSON_CSRF_PROTECTION is set to true, your users cannot make GET request to explore_json. The default value for this feature False (current behavior), explore_json accepts both GET and POST request. See `PR 7935 <https://github.com/apache/incubator-superset/pull/7935>`_ for more details.

* PRESTO_EXPAND_DATA

  * When this feature is enabled, nested types in Presto will be expanded into extra columns and/or arrays. This is experimental, and doesn't work with all nested types.


SIP-15
------

`SIP-15 <https://github.com/apache/incubator-superset/issues/6360>`_ aims to ensure that time intervals are handled in a consistent and transparent manner for both the Druid and SQLAlchemy connectors.

Prior to SIP-15 SQLAlchemy used inclusive endpoints however these may behave like exclusive for string columns (due to lexicographical ordering) if no formatting was defined and the column formatting did not conform to an ISO 8601 date-time (refer to the SIP for details).

To remedy this rather than having to define the date/time format for every non-IS0 8601 date-time column, once can define a default column mapping on a per database level via the ``extra`` parameter ::

    {
        "python_date_format_by_column_name": {
            "ds": "%Y-%m-%d"
        }
    }

**New deployments**

All new Superset deployments should enable SIP-15 via,

.. code-block:: python

    SIP_15_ENABLED = True

**Existing deployments**

Given that it is not apparent whether the chart creator was aware of the time range inconsistencies (and adjusted the endpoints accordingly) changing the behavior of all charts is overly aggressive. Instead SIP-15 proivides a soft transistion allowing producers (chart owners) to see the impact of the proposed change and adjust their charts accordingly.

Prior to enabling SIP-15 existing deployments should communicate to their users the impact of the change and define a grace period end date (exclusive of course) after which all charts will conform to the [start, end) interval, i.e.,

.. code-block:: python

    from dateime import date

    SIP_15_ENABLED = True
    SIP_15_GRACE_PERIOD_END = date(<YYYY>, <MM>, <DD>)

To aid with transparency the current endpoint behavior is explicitly called out in the chart time range (post SIP-15 this will be [start, end) for all connectors and databases). One can override the defaults on a per database level via the ``extra``
parameter ::

    {
        "time_range_endpoints": ["inclusive", "inclusive"]
    }


Note in a future release the interim SIP-15 logic will be removed (including the ``time_grain_endpoints`` form-data field) via a code change and Alembic migration.
-												Add licenses to translations (#6732)

* Add licenses

* More licenses

* Ignore messages.json as they are generated

* More licenses

* Also typescript

* Fix alignment

* Add to svg

* Many more licenses

* more licenses

* Better excludes

* Add licenses to docs and md files

* Pre-finalize licenses

* Fix newlines

* Finalize all sourde licenses

* Fix lint

											
										
										
											2019-01-22 11:21:13 -05:00
+								..  Licensed to the Apache Software Foundation (ASF) under one
 								    or more contributor license agreements.  See the NOTICE file
 								    distributed with this work for additional information
 								    regarding copyright ownership.  The ASF licenses this file
 								    to you under the Apache License, Version 2.0 (the
 								    "License"); you may not use this file except in compliance
 								    with the License.  You may obtain a copy of the License at
 								..    http://www.apache.org/licenses/LICENSE-2.0
 								..  Unless required by applicable law or agreed to in writing,
 								    software distributed under the License is distributed on an
 								    "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
 								    KIND, either express or implied.  See the License for the
 								    specific language governing permissions and limitations
 								    under the License.
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								Installation & Configuration
 								============================
 								Getting Started
 								---------------
-												Deprecate support for Python < 3.6 (#5985)

* Deprecate support for Python < 3.6

This is a first step, beyond this we can:
* remove all from future imports
* remove 'six' lib as a dependency
* start using f-strings
* enjoy ourselves

* fix tox

* Rebasing

* fix

											
										
										
											2018-10-05 13:44:45 -04:00
+								Superset has deprecated support for Python ``2.*`` and supports
 								only ``~=3.6`` to take advantage of the newer Python features and reduce
 								the burden of supporting previous versions. We run our test suite
-												Add explicit support for python 3.7 (#8309)


											
										
										
											2019-09-27 10:54:45 -04:00
+								against ``3.6``, but ``3.7`` is fully supported as well.
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
-												Better installation docs (#3469)


											
										
										
											2017-09-18 12:49:48 -04:00
+								Cloud-native!
 								-------------
 								Superset is designed to be highly available. It is
 								"cloud-native" as it has been designed scale out in large,
 								distributed environments, and works well inside containers.
 								While you can easily
 								test drive Superset on a modest setup or simply on your laptop,
 								there's virtually no limit around scaling out the platform.
 								Superset is also cloud-native in the sense that it is
 								flexible and lets you choose your web server (Gunicorn, Nginx, Apache),
 								your metadata database engine (MySQL, Postgres, MariaDB, ...),
 								your message queue (Redis, RabbitMQ, SQS, ...),
 								your results backend (S3, Redis, Memcached, ...), your caching layer
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								(Memcached, Redis, ...), works well with services like NewRelic, StatsD and
-												Better installation docs (#3469)


											
										
										
											2017-09-18 12:49:48 -04:00
+								DataDog, and has the ability to run analytic workloads against
 								most popular database technologies.
 								Superset is battle tested in large environments with hundreds
 								of concurrent users. Airbnb's production environment runs inside
 								Kubernetes and serves 600+ daily active users viewing over 100K charts a
 								day.
 								The Superset web server and the Superset Celery workers (optional)
 								are stateless, so you can scale out by running on as many servers
 								as needed.
-												Adding an OS dependencies section to the install docs

											
										
										
											2016-04-01 11:33:28 -04:00
-												Init docker for local development environment. (#4193)

This commit will try to dockerize superset in local development
environment.

The basic design is:
- Enable superset, redis and postgres service instead of using sqlite,
  just want to simulate production environment settings
- Use environment variables to config various app settings. It's easy to
  run and config superset to any environment if we use environment than
  traditional config files
- For local development environment, we just expose postgres and redis
  to local host machine thus you can connect local port via `psql` or
  `redis-cli`
- Wrap start up command in a standard `docker-entrypoint.sh`, and use
  `tail -f /dev/null` combined with manually `superset runserver -d` to
  make sure that code error didn't cause the container to fail.
- Use volumes to share code between host and container, thus you can use
  your favourite tools to modify code and your code will run in
  containerized environment
- Use volumes to persistent postgres and redis data, and also
  `node_modules` data.
  - If we don't cache `node_modules` in docker volume, then every time
    run docker build, the `node_modules` directory, will is about 500 MB
    large, will be sent to docker daemon, and make the build quite slow.
- Wrap initialization commands to a single script `docker-init.sh`

After this dockerize setup, any developers who want to contribute to
superset, just follow three easy steps:

```
git clone https://github.com/apache/incubator-superset/
cd incubator-superset
cp contrib/docker/{docker-build.sh,docker-compose.yml,docker-entrypoint.sh,docker-init.sh,Dockerfile} .
cp contrib/docker/superset_config.py superset/
bash -x docker-build.sh
docker-compose up -d
docker-compose exec superset bash
bash docker-init.sh
```
											
										
										
											2018-06-10 00:26:41 -04:00
+								Start with Docker
 								-----------------
-												docs: warn that docker installation instructions are `contrib/` (#6925)

* [docs] warn that docker installation instructions are `contrib/`

* Words

											
										
										
											2019-03-06 11:33:11 -05:00
+								.. note ::
 								    The Docker-related files and documentation has been
 								    community-contributed and
 								    is not actively maintained and managed by the core committers working on
 								    the project. Some issues have been reported as of 2019-01.
 								    Help and contributions around Docker are welcomed!
 								If you know docker, then you're lucky, we have shortcut road for you to
-												Init docker for local development environment. (#4193)

This commit will try to dockerize superset in local development
environment.

The basic design is:
- Enable superset, redis and postgres service instead of using sqlite,
  just want to simulate production environment settings
- Use environment variables to config various app settings. It's easy to
  run and config superset to any environment if we use environment than
  traditional config files
- For local development environment, we just expose postgres and redis
  to local host machine thus you can connect local port via `psql` or
  `redis-cli`
- Wrap start up command in a standard `docker-entrypoint.sh`, and use
  `tail -f /dev/null` combined with manually `superset runserver -d` to
  make sure that code error didn't cause the container to fail.
- Use volumes to share code between host and container, thus you can use
  your favourite tools to modify code and your code will run in
  containerized environment
- Use volumes to persistent postgres and redis data, and also
  `node_modules` data.
  - If we don't cache `node_modules` in docker volume, then every time
    run docker build, the `node_modules` directory, will is about 500 MB
    large, will be sent to docker daemon, and make the build quite slow.
- Wrap initialization commands to a single script `docker-init.sh`

After this dockerize setup, any developers who want to contribute to
superset, just follow three easy steps:

```
git clone https://github.com/apache/incubator-superset/
cd incubator-superset
cp contrib/docker/{docker-build.sh,docker-compose.yml,docker-entrypoint.sh,docker-init.sh,Dockerfile} .
cp contrib/docker/superset_config.py superset/
bash -x docker-build.sh
docker-compose up -d
docker-compose exec superset bash
bash docker-init.sh
```
											
										
										
											2018-06-10 00:26:41 -04:00
+								initialize development environment: ::
 								    git clone https://github.com/apache/incubator-superset/
-												Improve development experience with Docker (#5966)

- Improve Docker image
  - smaller
  - faster to build
  - deterministict dependencies (see #5958)
- Rework process to simplify setting things up
  - updated documentation
  - less commands to type
  - no files to move and modify
  - optional loading of samples
- Still working in standalone mode (without volumes for superset)
											
										
										
											2018-11-27 14:19:55 -05:00
+								    cd incubator-superset/contrib/docker
 								    # prefix with SUPERSET_LOAD_EXAMPLES=yes to load examples:
 								    docker-compose run --rm superset ./docker-init.sh
 								    # you can run this command everytime you need to start superset now:
 								    docker-compose up
-												Init docker for local development environment. (#4193)

This commit will try to dockerize superset in local development
environment.

The basic design is:
- Enable superset, redis and postgres service instead of using sqlite,
  just want to simulate production environment settings
- Use environment variables to config various app settings. It's easy to
  run and config superset to any environment if we use environment than
  traditional config files
- For local development environment, we just expose postgres and redis
  to local host machine thus you can connect local port via `psql` or
  `redis-cli`
- Wrap start up command in a standard `docker-entrypoint.sh`, and use
  `tail -f /dev/null` combined with manually `superset runserver -d` to
  make sure that code error didn't cause the container to fail.
- Use volumes to share code between host and container, thus you can use
  your favourite tools to modify code and your code will run in
  containerized environment
- Use volumes to persistent postgres and redis data, and also
  `node_modules` data.
  - If we don't cache `node_modules` in docker volume, then every time
    run docker build, the `node_modules` directory, will is about 500 MB
    large, will be sent to docker daemon, and make the build quite slow.
- Wrap initialization commands to a single script `docker-init.sh`

After this dockerize setup, any developers who want to contribute to
superset, just follow three easy steps:

```
git clone https://github.com/apache/incubator-superset/
cd incubator-superset
cp contrib/docker/{docker-build.sh,docker-compose.yml,docker-entrypoint.sh,docker-init.sh,Dockerfile} .
cp contrib/docker/superset_config.py superset/
bash -x docker-build.sh
docker-compose up -d
docker-compose exec superset bash
bash docker-init.sh
```
											
										
										
											2018-06-10 00:26:41 -04:00
-												Fix typo in Start with Docker (#5348)


											
										
										
											2018-07-04 17:02:03 -04:00
+								After several minutes for superset initialization to finish, you can open
-												Init docker for local development environment. (#4193)

This commit will try to dockerize superset in local development
environment.

The basic design is:
- Enable superset, redis and postgres service instead of using sqlite,
  just want to simulate production environment settings
- Use environment variables to config various app settings. It's easy to
  run and config superset to any environment if we use environment than
  traditional config files
- For local development environment, we just expose postgres and redis
  to local host machine thus you can connect local port via `psql` or
  `redis-cli`
- Wrap start up command in a standard `docker-entrypoint.sh`, and use
  `tail -f /dev/null` combined with manually `superset runserver -d` to
  make sure that code error didn't cause the container to fail.
- Use volumes to share code between host and container, thus you can use
  your favourite tools to modify code and your code will run in
  containerized environment
- Use volumes to persistent postgres and redis data, and also
  `node_modules` data.
  - If we don't cache `node_modules` in docker volume, then every time
    run docker build, the `node_modules` directory, will is about 500 MB
    large, will be sent to docker daemon, and make the build quite slow.
- Wrap initialization commands to a single script `docker-init.sh`

After this dockerize setup, any developers who want to contribute to
superset, just follow three easy steps:

```
git clone https://github.com/apache/incubator-superset/
cd incubator-superset
cp contrib/docker/{docker-build.sh,docker-compose.yml,docker-entrypoint.sh,docker-init.sh,Dockerfile} .
cp contrib/docker/superset_config.py superset/
bash -x docker-build.sh
docker-compose up -d
docker-compose exec superset bash
bash docker-init.sh
```
											
										
										
											2018-06-10 00:26:41 -04:00
+								a browser and view `http://localhost:8088` to start your journey.
-												Improve development experience with Docker (#5966)

- Improve Docker image
  - smaller
  - faster to build
  - deterministict dependencies (see #5958)
- Rework process to simplify setting things up
  - updated documentation
  - less commands to type
  - no files to move and modify
  - optional loading of samples
- Still working in standalone mode (without volumes for superset)
											
										
										
											2018-11-27 14:19:55 -05:00
+								From there, the container server will reload on modification of the superset python
 								and javascript source code.
 								Don't forget to reload the page to take the new frontend into account though.
-												docs: fix RST issues while building docs (#7012)


											
										
										
											2019-03-18 21:11:53 -04:00
+								See also `CONTRIBUTING.md#building <https://github.com/apache/incubator-superset/blob/master/CONTRIBUTING.md#building>`_,
-												Improve development experience with Docker (#5966)

- Improve Docker image
  - smaller
  - faster to build
  - deterministict dependencies (see #5958)
- Rework process to simplify setting things up
  - updated documentation
  - less commands to type
  - no files to move and modify
  - optional loading of samples
- Still working in standalone mode (without volumes for superset)
											
										
										
											2018-11-27 14:19:55 -05:00
+								for alternative way of serving the frontend.
 								It is also possible to run Superset in non-development mode: in the `docker-compose.yml` file remove
 								the volumes needed for development and change the variable `SUPERSET_ENV` to `production`.
-												Add additional heatmap schemas (#5549)

* got skeleton started

* added d3-scale-chromatic to package.json

* got hex values instead of calling from a function

* got rid of d3-scale-chromatic - no longer needed

* added schemas to controls

* damn editor broken some line spacing

* commit

* fix style issues

* whyyyyy won't this build

* whyyyyy won't this build

* damn typo

* hahaha got editor to deal with style configs

* no i guess i didn't

* gotta get them all

* again

* trying to get docker build ot work

* updated installation docs with some osx instructions

* restoring yarn.lock not sure why it changed

* trying to fix indent

* trying again

* CODE STYLE CHANGES WORK

* removing some colors that are too close to white

* human readable labels for names

* human readable labels for names

											
										
										
											2018-08-14 02:37:17 -04:00
+								If you are attempting to build on a Mac and it exits with 137 you need to increase your docker resources.
 								OSX instructions: https://docs.docker.com/docker-for-mac/#advanced (Search for memory)
-												Improve development experience with Docker (#5966)

- Improve Docker image
  - smaller
  - faster to build
  - deterministict dependencies (see #5958)
- Rework process to simplify setting things up
  - updated documentation
  - less commands to type
  - no files to move and modify
  - optional loading of samples
- Still working in standalone mode (without volumes for superset)
											
										
										
											2018-11-27 14:19:55 -05:00
+								Or if you're curious and want to install superset from bottom up, then go ahead.
-												Init docker for local development environment. (#4193)

This commit will try to dockerize superset in local development
environment.

The basic design is:
- Enable superset, redis and postgres service instead of using sqlite,
  just want to simulate production environment settings
- Use environment variables to config various app settings. It's easy to
  run and config superset to any environment if we use environment than
  traditional config files
- For local development environment, we just expose postgres and redis
  to local host machine thus you can connect local port via `psql` or
  `redis-cli`
- Wrap start up command in a standard `docker-entrypoint.sh`, and use
  `tail -f /dev/null` combined with manually `superset runserver -d` to
  make sure that code error didn't cause the container to fail.
- Use volumes to share code between host and container, thus you can use
  your favourite tools to modify code and your code will run in
  containerized environment
- Use volumes to persistent postgres and redis data, and also
  `node_modules` data.
  - If we don't cache `node_modules` in docker volume, then every time
    run docker build, the `node_modules` directory, will is about 500 MB
    large, will be sent to docker daemon, and make the build quite slow.
- Wrap initialization commands to a single script `docker-init.sh`

After this dockerize setup, any developers who want to contribute to
superset, just follow three easy steps:

```
git clone https://github.com/apache/incubator-superset/
cd incubator-superset
cp contrib/docker/{docker-build.sh,docker-compose.yml,docker-entrypoint.sh,docker-init.sh,Dockerfile} .
cp contrib/docker/superset_config.py superset/
bash -x docker-build.sh
docker-compose up -d
docker-compose exec superset bash
bash docker-init.sh
```
											
										
										
											2018-06-10 00:26:41 -04:00
-												Rjurney master docs update (#7426)

* resolved conflict

* Docs updated re: Anaconda/certifi issue re #7373

* Removed --console-log "not working" note

* A note about Anaconda virtualenvs

* Make anaconda comment fit on page

* Added README to docker directory

* Added install doc reference to master copy of contrib/docker/README.md

* merged master, removed mysqlclient

* Removed mysql dependency, Anaconda and --console-log references

* Add cypress install command to cypress test instructions

* Fixed cypress instructions re: port 8081

* Removed anaconda reference, runserver references

* Remove anaconda reference

* Added back a self-contained version of mysqlclient to dev requirements

* Added ASF license to docker README.md

											
										
										
											2019-05-20 20:06:08 -04:00
+								See also `contrib/docker/README.md <https://github.com/apache/incubator-superset/blob/master/contrib/docker/README.md>`_
-												Adding an OS dependencies section to the install docs

											
										
										
											2016-04-01 11:33:28 -04:00
+								OS dependencies
 								---------------
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								Superset stores database connection information in its metadata database.
-												Adding an OS dependencies section to the install docs

											
										
										
											2016-04-01 11:33:28 -04:00
+								For that purpose, we use the ``cryptography`` Python library to encrypt
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								connection passwords. Unfortunately, this library has OS level dependencies.
-												Adding an OS dependencies section to the install docs

											
										
										
											2016-04-01 11:33:28 -04:00
 								You may want to attempt the next step
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								("Superset installation and initialization") and come back to this step if
-												Adding an OS dependencies section to the install docs

											
										
										
											2016-04-01 11:33:28 -04:00
+								you encounter an error.
 								Here's how to install them:
-												Tweaking docs

											
										
										
											2016-04-01 11:37:19 -04:00
+								For **Debian** and **Ubuntu**, the following command will ensure that
 								the required dependencies are installed: ::
-												Adding an OS dependencies section to the install docs

											
										
										
											2016-04-01 11:33:28 -04:00
-												docs: add libsasl as system requirement on linux (#1257)

* docs: add libsasl as system requirement on linux

* docs: add openldap as system dependencies on linux too

Fix #1256

											
										
										
											2016-10-05 16:00:38 -04:00
+								    sudo apt-get install build-essential libssl-dev libffi-dev python-dev python-pip libsasl2-dev libldap2-dev
-												Adding an OS dependencies section to the install docs

											
										
										
											2016-04-01 11:33:28 -04:00
-												Update the installation document based on Python 3.6+ (#6370)

* Update the description based on Ubuntu 16.04 with 18.04,
  since Python version bundled with the former is 3.5,
  which is not already supported

* Remove obsolete descriptions based on Python <= 3.5
											
										
										
											2018-11-14 21:14:11 -05:00
+								**Ubuntu 18.04** If you have python3.6 installed alongside with python2.7, as is default on **Ubuntu 18.04 LTS**, run this command also: ::
-												Update installation.rst for Ubuntu 16.04 LTS (#4321)

Ubuntu 16.04 by default install python2.7 alongside with python 3.5 and set python2.7 as default. If you have created a virtualenv with python3.5 compilation fails due to wrong python-dev library installed. 

If you install ``python3.5-dev`` the build for the wheel package of  ``cryptography`` run fine.
											
										
										
											2018-01-31 21:03:16 -05:00
-												Update the installation document based on Python 3.6+ (#6370)

* Update the description based on Ubuntu 16.04 with 18.04,
  since Python version bundled with the former is 3.5,
  which is not already supported

* Remove obsolete descriptions based on Python <= 3.5
											
										
										
											2018-11-14 21:14:11 -05:00
+								    sudo apt-get install build-essential libssl-dev libffi-dev python3.6-dev python-pip libsasl2-dev libldap2-dev
-												Update installation.rst for Ubuntu 16.04 LTS (#4321)

Ubuntu 16.04 by default install python2.7 alongside with python 3.5 and set python2.7 as default. If you have created a virtualenv with python3.5 compilation fails due to wrong python-dev library installed. 

If you install ``python3.5-dev`` the build for the wheel package of  ``cryptography`` run fine.
											
										
										
											2018-01-31 21:03:16 -05:00
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								otherwise build for ``cryptography`` fails.
-												Update installation.rst for Ubuntu 16.04 LTS (#4321)

Ubuntu 16.04 by default install python2.7 alongside with python 3.5 and set python2.7 as default. If you have created a virtualenv with python3.5 compilation fails due to wrong python-dev library installed. 

If you install ``python3.5-dev`` the build for the wheel package of  ``cryptography`` run fine.
											
										
										
											2018-01-31 21:03:16 -05:00
-												Tweaking docs

											
										
										
											2016-04-01 11:37:19 -04:00
+								For **Fedora** and **RHEL-derivatives**, the following command will ensure
 								that the required dependencies are installed: ::
-												Add CORS support (#478)

* Add optional CORS

* make CORS an extra dependency

* add documentation

											
										
										
											2016-06-02 15:34:36 -04:00
-												Add python-pip to the install docs
											
										
										
											2016-04-06 21:11:24 -04:00
+								    sudo yum upgrade python-setuptools
-												Change Fedora installation instructions + some small formatting changes (#8496)


											
										
										
											2019-11-03 21:02:14 -05:00
+								    sudo yum install gcc gcc-c++ libffi-devel python-devel python-pip python-wheel openssl-devel cyrus-sasl-devel openldap-devel
-												Tweaking docs

											
										
										
											2016-04-01 11:37:19 -04:00
-												Update index.rst (#7672)


											
										
										
											2019-06-11 12:46:27 -04:00
+								**Mac OS X** If possible, you should upgrade to the latest version of OS X as issues are more likely to be resolved for that version.
-												Rjurney master docs update (#7426)

* resolved conflict

* Docs updated re: Anaconda/certifi issue re #7373

* Removed --console-log "not working" note

* A note about Anaconda virtualenvs

* Make anaconda comment fit on page

* Added README to docker directory

* Added install doc reference to master copy of contrib/docker/README.md

* merged master, removed mysqlclient

* Removed mysql dependency, Anaconda and --console-log references

* Add cypress install command to cypress test instructions

* Fixed cypress instructions re: port 8081

* Removed anaconda reference, runserver references

* Remove anaconda reference

* Added back a self-contained version of mysqlclient to dev requirements

* Added ASF license to docker README.md

											
										
										
											2019-05-20 20:06:08 -04:00
+								You *will likely need* the latest version of XCode available for your installed version of OS X. You should also install
 								the XCode command line tools: ::
 								    xcode-select --install
 								System python is not recommended. Homebrew's python also ships with pip: ::
-												Tweaking docs

											
										
										
											2016-04-01 11:37:19 -04:00
-												Add python-pip to the install docs
											
										
										
											2016-04-06 21:11:24 -04:00
+								    brew install pkg-config libffi openssl python
-												[docs] bump cryptography lib version in docs (#6755)


											
										
										
											2019-01-25 17:26:08 -05:00
+								    env LDFLAGS="-L$(brew --prefix openssl)/lib" CFLAGS="-I$(brew --prefix openssl)/include" pip install cryptography==2.4.2
-												Tweaking docs

											
										
										
											2016-04-01 11:37:19 -04:00
 								**Windows** isn't officially supported at this point, but if you want to
-												Add python-pip to the install docs
											
										
										
											2016-04-06 21:11:24 -04:00
+								attempt it, download `get-pip.py <https://bootstrap.pypa.io/get-pip.py>`_, and run ``python get-pip.py`` which may need admin access. Then run the following: ::
-												Adding an OS dependencies section to the install docs

											
										
										
											2016-04-01 11:33:28 -04:00
 								    C:\> pip install cryptography
-												Datasource dropdown in Explore view
											
										
										
											2016-04-06 11:23:27 -04:00
+								    # You may also have to create C:\Temp
 								    C:\> md C:\Temp
-												docs: recommend python3 and virtualenv (#901)

* docs: make it clear that python3 is the recommended version

* docs: recommend installing inside a virtualenv

And add a virtualenv primer.

											
										
										
											2016-08-09 18:56:36 -04:00
+								Python virtualenv
 								-----------------
-												Update the installation document based on Python 3.6+ (#6370)

* Update the description based on Ubuntu 16.04 with 18.04,
  since Python version bundled with the former is 3.5,
  which is not already supported

* Remove obsolete descriptions based on Python <= 3.5
											
										
										
											2018-11-14 21:14:11 -05:00
+								It is recommended to install Superset inside a virtualenv. Python 3 already ships virtualenv.
 								But if it's not installed in your environment for some reason, you can install it
 								via the package for your operating systems, otherwise you can install from pip: ::
-												docs: recommend python3 and virtualenv (#901)

* docs: make it clear that python3 is the recommended version

* docs: recommend installing inside a virtualenv

And add a virtualenv primer.

											
										
										
											2016-08-09 18:56:36 -04:00
 								    pip install virtualenv
 								You can create and activate a virtualenv by: ::
-												Update the installation document based on Python 3.6+ (#6370)

* Update the description based on Ubuntu 16.04 with 18.04,
  since Python version bundled with the former is 3.5,
  which is not already supported

* Remove obsolete descriptions based on Python <= 3.5
											
										
										
											2018-11-14 21:14:11 -05:00
+								    # virtualenv is shipped in Python 3.6+ as venv instead of pyvenv.
 								    # See https://docs.python.org/3.6/library/venv.html
 								    python3 -m venv venv
 								    . venv/bin/activate
-												docs: recommend python3 and virtualenv (#901)

* docs: make it clear that python3 is the recommended version

* docs: recommend installing inside a virtualenv

And add a virtualenv primer.

											
										
										
											2016-08-09 18:56:36 -04:00
-												Change Fedora installation instructions + some small formatting changes (#8496)


											
										
										
											2019-11-03 21:02:14 -05:00
+								On Windows the syntax for activating it is a bit different: ::
-												docs: recommend python3 and virtualenv (#901)

* docs: make it clear that python3 is the recommended version

* docs: recommend installing inside a virtualenv

And add a virtualenv primer.

											
										
										
											2016-08-09 18:56:36 -04:00
 								    venv\Scripts\activate
 								Once you activated your virtualenv everything you are doing is confined inside the virtualenv.
 								To exit a virtualenv just type ``deactivate``.
-												Adding an OS dependencies section to the install docs

											
										
										
											2016-04-01 11:33:28 -04:00
-												[docs] suggest to upgrade pip and setuptools

											
										
										
											2016-10-05 00:59:40 -04:00
+								Python's setup tools and pip
 								----------------------------
 								Put all the chances on your side by getting the very latest ``pip``
 								and ``setuptools`` libraries.::
 								    pip install --upgrade setuptools pip
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								Superset installation and initialization
-												Minor documentation touchups

											
										
										
											2016-11-10 14:27:56 -05:00
+								----------------------------------------
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								Follow these few simple steps to install Superset.::
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								    # Install superset
-												docs: reflect the pypi move from superset to apache-superset (#8244)


											
										
										
											2019-09-18 17:55:20 -04:00
+								    pip install apache-superset
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
 								    # Initialize the database
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								    superset db upgrade
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
-												Bump FAB to 2.0.0 (#7323)

* Bump FAB to 2.0.0

* [tests] whitelist SecurityApi login and refresh endpoints

* [style] Fix, C812 missing trailing commas

* [security] Remove SUPERSET_UPDATE_PERMS flag

Registering sources needs to be performed after the views are
initialized on UPDATE_PERMS=False configuration

* [docs] New, FAB_UPDATE_PERMS and flask fab cli

* [docs] Fix, db upgrade needs to come first, create-admin needs a db

* [cli] New, superset init bootstraps all permissions for FAB and Superset

* [style] Fix, flakes

											
										
										
											2019-04-30 12:01:18 -04:00
+								    # Create an admin user (you will be prompted to set a username, first and last name before setting a password)
 								    $ export FLASK_APP=superset
 								    flask fab create-admin
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								    # Load some data to play with
-												[build] fix pip install issues on OSX High Sierra (#6201)

* [build] fix pip install issues on OSX High Sierra

I think requirements.txt was out-of-sync as well.

Also had to:
export
  LDFLAGS="-L/usr/local/opt/openssl/lib"
export
  CPPFLAGS="-I/usr/local/opt/openssl/include"
export
  PKG_CONFIG_PATH="/usr/local/opt/openssl/lib/pkgconfig"

* Fix click

											
										
										
											2018-10-29 13:33:51 -04:00
+								    superset load_examples
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
-												[doc] installation, load examples before init

											
										
										
											2016-10-07 13:57:26 -04:00
+								    # Create default roles and permissions
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								    superset init
-												[doc] installation, load examples before init

											
										
										
											2016-10-07 13:57:26 -04:00
-												[cli] Deprecating gunicorn/flower dependencies (#4451)


											
										
										
											2018-03-30 12:28:16 -04:00
+								    # To start a development web server on port 8088, use -p to bind to another port
-												fixed typo in installation instructions (#8413)


											
										
										
											2019-10-21 00:21:40 -04:00
+								    superset run -p 8088 --with-threads --reload --debugger
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
 								After installation, you should be able to point your browser to the right
-												Fix localhost link in installation docs
											
										
										
											2016-04-06 23:13:04 -04:00
+								hostname:port `http://localhost:8088 <http://localhost:8088>`_, login using
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								the credential you entered while creating the admin account, and navigate to
 								`Menu -> Admin -> Refresh Metadata`. This action should bring in all of
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								your datasources for Superset to be aware of, and they should show up in
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								`Menu -> Datasources`, from where you can start playing with your data!
-												Better installation docs (#3469)


											
										
										
											2017-09-18 12:49:48 -04:00
+								A proper WSGI HTTP Server
 								-------------------------
 								While you can setup Superset to run on Nginx or Apache, many use
 								Gunicorn, preferably in **async mode**, which allows for impressive
 								concurrency even and is fairly easy to install and configure. Please
 								refer to the
 								documentation of your preferred technology to set up this Flask WSGI
-												[cli] Deprecating gunicorn/flower dependencies (#4451)


											
										
										
											2018-03-30 12:28:16 -04:00
+								application in a way that works well in your environment. Here's an **async**
 								setup known to work well in production: ::
-												Better installation docs (#3469)


											
										
										
											2017-09-18 12:49:48 -04:00
-												Fix rst grammar problems (#4116)


											
										
										
											2017-12-26 02:39:28 -05:00
+								 　gunicorn \
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								        -w 10 \
 								        -k gevent \
 								        --timeout 120 \
 								        -b  0.0.0.0:6666 \
 								        --limit-request-line 0 \
 								        --limit-request-field_size 0 \
 								        --statsd-host localhost:8125 \
 								        superset:app
-												Better installation docs (#3469)


											
										
										
											2017-09-18 12:49:48 -04:00
 								Refer to the
-												[fix] Use HTTPS, not HTTP wherever practical (#7040)

* Download RAT binary via HTTPS, not HTTP

* Merge branch 'patch-1' of github.com:hajdbo/incubator-superset into patch-1

											
										
										
											2019-03-18 02:21:32 -04:00
+								`Gunicorn documentation <https://docs.gunicorn.org/en/stable/design.html>`_
-												Better installation docs (#3469)


											
										
										
											2017-09-18 12:49:48 -04:00
+								for more information.
-												Rjurney master docs update (#7426)

* resolved conflict

* Docs updated re: Anaconda/certifi issue re #7373

* Removed --console-log "not working" note

* A note about Anaconda virtualenvs

* Make anaconda comment fit on page

* Added README to docker directory

* Added install doc reference to master copy of contrib/docker/README.md

* merged master, removed mysqlclient

* Removed mysql dependency, Anaconda and --console-log references

* Add cypress install command to cypress test instructions

* Fixed cypress instructions re: port 8081

* Removed anaconda reference, runserver references

* Remove anaconda reference

* Added back a self-contained version of mysqlclient to dev requirements

* Added ASF license to docker README.md

											
										
										
											2019-05-20 20:06:08 -04:00
+								Note that the development web
 								server (`superset run` or `flask run`) is not intended for production use.
-												Better installation docs (#3469)


											
										
										
											2017-09-18 12:49:48 -04:00
-												Allowing config flag to turn off flask-compress (#4617)


											
										
										
											2018-03-15 20:17:04 -04:00
+								If not using gunicorn, you may want to disable the use of flask-compress
 								by setting `ENABLE_FLASK_COMPRESS = False` in your `superset_config.py`
-												[FAB] configuring updating of permissions (#4172)


											
										
										
											2018-01-08 17:39:18 -05:00
+								Flask-AppBuilder Permissions
 								----------------------------
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								By default, every time the Flask-AppBuilder (FAB) app is initialized the
-												[FAB] configuring updating of permissions (#4172)


											
										
										
											2018-01-08 17:39:18 -05:00
+								permissions and views are added automatically to the backend and associated with
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								the ‘Admin’ role. The issue, however, is when you are running multiple concurrent
-												[FAB] configuring updating of permissions (#4172)


											
										
										
											2018-01-08 17:39:18 -05:00
+								workers this creates a lot of contention and race conditions when defining
 								permissions and views.
 								To alleviate this issue, the automatic updating of permissions can be disabled
-												Bump FAB to 2.0.0 (#7323)

* Bump FAB to 2.0.0

* [tests] whitelist SecurityApi login and refresh endpoints

* [style] Fix, C812 missing trailing commas

* [security] Remove SUPERSET_UPDATE_PERMS flag

Registering sources needs to be performed after the views are
initialized on UPDATE_PERMS=False configuration

* [docs] New, FAB_UPDATE_PERMS and flask fab cli

* [docs] Fix, db upgrade needs to come first, create-admin needs a db

* [cli] New, superset init bootstraps all permissions for FAB and Superset

* [style] Fix, flakes

											
										
										
											2019-04-30 12:01:18 -04:00
+								by setting `FAB_UPDATE_PERMS = False` (defaults to True).
-												[FAB] configuring updating of permissions (#4172)


											
										
										
											2018-01-08 17:39:18 -05:00
 								In a production environment initialization could take on the following form:
 								  superset init
 								  gunicorn -w 10 ... superset:app
-												docs: document that gunicorn does not work on windows (#1258)

And suggest to use a supported platform instead.

Fix #1236
											
										
										
											2016-10-05 15:14:09 -04:00
-												Added documentation of the health check endpoint (#644)


											
										
										
											2016-06-20 12:24:49 -04:00
+								Configuration behind a load balancer
 								------------------------------------
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								If you are running superset behind a load balancer or reverse proxy (e.g. NGINX
-												Added documentation of the health check endpoint (#644)


											
										
										
											2016-06-20 12:24:49 -04:00
+								or ELB on AWS), you may need to utilise a healthcheck endpoint so that your
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								load balancer knows if your superset instance is running. This is provided
-												Added documentation of the health check endpoint (#644)


											
										
										
											2016-06-20 12:24:49 -04:00
+								at ``/health`` which will return a 200 response containing "OK" if the
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								the webserver is running.
-												Added documentation of the health check endpoint (#644)


											
										
										
											2016-06-20 12:24:49 -04:00
-												Add support for Werkzeug ProxyFix middleware (#1150)

Add an ENABLE_PROXY_FIX config param.  When set to True, insert the Werkzeug ProxyFix
middleware.  This middleware extracts and applies the X-Forwarded-* headers that are
inserted by common proxies and load balancers.  Fixes #1139.
											
										
										
											2016-09-20 15:24:15 -04:00
+								If the load balancer is inserting X-Forwarded-For/X-Forwarded-Proto headers, you
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								should set `ENABLE_PROXY_FIX = True` in the superset config file to extract and use
-												Add support for Werkzeug ProxyFix middleware (#1150)

Add an ENABLE_PROXY_FIX config param.  When set to True, insert the Werkzeug ProxyFix
middleware.  This middleware extracts and applies the X-Forwarded-* headers that are
inserted by common proxies and load balancers.  Fixes #1139.
											
										
										
											2016-09-20 15:24:15 -04:00
+								the headers.
-												[FAB] configuring updating of permissions (#4172)


											
										
										
											2018-01-08 17:39:18 -05:00
+								In case that the reverse proxy is used for providing ssl encryption,
-												[doc] added setting X-Forwarded-Proto to https behind reverse proxy with ssl encryption; fixes #3655 (#3976)


											
										
										
											2017-12-04 01:00:58 -05:00
+								an explicit definition of the `X-Forwarded-Proto` may be required.
 								For the Apache webserver this can be set as follows: ::
-												[docs] many improvements to the documentation / cleanup (#4817)

* fixed RSTs so images will show up on github
* fresh screenshots on main page
* removing irrelevant portions
* moved a set of sections under 'Misc'
* rebuilt the Gallery with all screenshots
											
										
										
											2018-04-13 13:23:27 -04:00
+								    RequestHeader set X-Forwarded-Proto "https"
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
 								Configuration
 								-------------
 								To configure your application, you need to create a file (module)
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								``superset_config.py`` and make sure it is in your PYTHONPATH. Here are some
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								of the parameters you can copy / paste in that configuration module: ::
 								    #---------------------------------------------------------
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								    # Superset specific config
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								    #---------------------------------------------------------
 								    ROW_LIMIT = 5000
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								    SUPERSET_WEBSERVER_PORT = 8088
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								    #---------------------------------------------------------
 								    #---------------------------------------------------------
 								    # Flask App Builder configuration
 								    #---------------------------------------------------------
 								    # Your App secret key
 								    SECRET_KEY = '\2\1thisismyscretkey\1\2\e\y\y\h'
-												Clarify SQLALCHEMY_DATABASE_URI in the docs

											
										
										
											2016-04-01 19:41:02 -04:00
+								    # The SQLAlchemy connection string to your database backend
 								    # This connection defines the path to the database that stores your
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								    # superset metadata (slices, connections, tables, dashboards, ...).
-												Clarify SQLALCHEMY_DATABASE_URI in the docs

											
										
										
											2016-04-01 19:41:02 -04:00
+								    # Note that the connection information to connect to the datasources
 								    # you want to explore are managed directly in the web UI
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								    SQLALCHEMY_DATABASE_URI = 'sqlite:////path/to/superset.db'
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
 								    # Flask-WTF flag for CSRF
-												remove reference for CSRF_ENABLED, and use WTF_CSRF_ENABLED instead (#2946)


											
										
										
											2017-06-12 16:17:59 -04:00
+								    WTF_CSRF_ENABLED = True
-												Adding hook for CSRF exempting flask views. (#3435)


											
										
										
											2017-09-14 23:54:18 -04:00
+								    # Add endpoints that need to be exempt from CSRF protection
 								    WTF_CSRF_EXEMPT_LIST = []
-												Set longer CSRF token duration (one week) (#4741)

Default is one hour (3600), also this entry makes the setting a bit more
discoverable
http://flask-wtf.readthedocs.io/en/stable/config.html?highlight=csrf
											
										
										
											2018-04-04 18:55:32 -04:00
+								    # A CSRF token that expires in 1 year
 								    WTF_CSRF_TIME_LIMIT = 60 * 60 * 24 * 365
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
-												docs: add a faq about mapbox api key (#968)

Also add it to sample config

Fix #952
											
										
										
											2016-08-17 11:04:39 -04:00
+								    # Set this API key to enable Mapbox visualizations
 								    MAPBOX_API_KEY = ''
-												Set longer CSRF token duration (one week) (#4741)

Default is one hour (3600), also this entry makes the setting a bit more
discoverable
http://flask-wtf.readthedocs.io/en/stable/config.html?highlight=csrf
											
										
										
											2018-04-04 18:55:32 -04:00
+								All the parameters and default values defined in
 								https://github.com/apache/incubator-superset/blob/master/superset/config.py
 								can be altered in your local ``superset_config.py`` .
 								Administrators will want to
 								read through the file to understand what can be configured locally
 								as well as the default values in place.
 								Since ``superset_config.py`` acts as a Flask configuration module, it
 								can be used to alter the settings Flask itself,
 								as well as Flask extensions like ``flask-wtf``, ``flask-cache``,
 								``flask-migrate``, and ``flask-appbuilder``. Flask App Builder, the web
 								framework used by Superset offers many configuration settings. Please consult
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								the `Flask App Builder Documentation
-												[fix] Use HTTPS, not HTTP wherever practical (#7040)

* Download RAT binary via HTTPS, not HTTP

* Merge branch 'patch-1' of github.com:hajdbo/incubator-superset into patch-1

											
										
										
											2019-03-18 02:21:32 -04:00
+								<https://flask-appbuilder.readthedocs.org/en/latest/config.html>`_
-												Set longer CSRF token duration (one week) (#4741)

Default is one hour (3600), also this entry makes the setting a bit more
discoverable
http://flask-wtf.readthedocs.io/en/stable/config.html?highlight=csrf
											
										
										
											2018-04-04 18:55:32 -04:00
+								for more information on how to configure it.
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
-												Set longer CSRF token duration (one week) (#4741)

Default is one hour (3600), also this entry makes the setting a bit more
discoverable
http://flask-wtf.readthedocs.io/en/stable/config.html?highlight=csrf
											
										
										
											2018-04-04 18:55:32 -04:00
+								Make sure to change:
-												docs: make it clear that some config keys really need to be changed (#912)


											
										
										
											2016-08-10 11:15:19 -04:00
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								* *SQLALCHEMY_DATABASE_URI*, by default it is stored at *~/.superset/superset.db*
-												docs: make it clear that some config keys really need to be changed (#912)


											
										
										
											2016-08-10 11:15:19 -04:00
+								* *SECRET_KEY*, to a long random string
-												Adding hook for CSRF exempting flask views. (#3435)


											
										
										
											2017-09-14 23:54:18 -04:00
+								In case you need to exempt endpoints from CSRF, e.g. you are running a custom
 								auth postback endpoint, you can add them to *WTF_CSRF_EXEMPT_LIST*
 								     WTF_CSRF_EXEMPT_LIST = ['']
-												[docs] FAQ entry 'Does Superset work with [database engine]?'

											
										
										
											2018-08-25 19:18:16 -04:00
 								.. _ref_database_deps:
-												Added FAQ and db dependencies to docs

											
										
										
											2016-04-08 11:44:28 -04:00
+								Database dependencies
 								---------------------
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								Superset does not ship bundled with connectivity to databases, except
-												Added FAQ and db dependencies to docs

											
										
										
											2016-04-08 11:44:28 -04:00
+								for Sqlite, which is part of the Python standard library.
 								You'll need to install the required packages for the database you
 								want to use as your metadata database as well as the packages needed to
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								connect to the databases you want to access through Superset.
-												Added FAQ and db dependencies to docs

											
										
										
											2016-04-08 11:44:28 -04:00
 								Here's a list of some of the recommended packages.
-												Update index.rst (#7672)


											
										
										
											2019-06-11 12:46:27 -04:00
+								+------------------+---------------------------------------+-------------------------------------------------+
 								| database         | pypi package                          | SQLAlchemy URI prefix                           |
 								+==================+=======================================+=================================================+
 								| Amazon Athena    | ``pip install "PyAthenaJDBC>1.0.9"``  | ``awsathena+jdbc://``                           |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Amazon Athena    | ``pip install "PyAthena>1.2.0"``      | ``awsathena+rest://``                           |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Amazon Redshift  | ``pip install sqlalchemy-redshift``   | ``redshift+psycopg2://``                        |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Apache Drill     | ``pip install sqlalchemy-drill``      | For the REST API:``                             |
 								|                  |                                       | ``drill+sadrill://``                            |
 								|                  |                                       | For JDBC                                        |
 								|                  |                                       | ``drill+jdbc://``                               |
 								+------------------+---------------------------------------+-------------------------------------------------+
-												Change Fedora installation instructions + some small formatting changes (#8496)


											
										
										
											2019-11-03 21:02:14 -05:00
+								| Apache Druid     | ``pip install pydruid``                | ``druid://``                                   |
-												Update index.rst (#7672)


											
										
										
											2019-06-11 12:46:27 -04:00
+								+------------------+---------------------------------------+-------------------------------------------------+
 								| Apache Hive      | ``pip install pyhive``                | ``hive://``                                     |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Apache Impala    | ``pip install impyla``                | ``impala://``                                   |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Apache Kylin     | ``pip install kylinpy``               | ``kylin://``                                    |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Apache Pinot     | ``pip install pinotdb``               | ``pinot+http://CONTROLLER:5436/``               |
 								|                  |                                       | ``query?server=http://CONTROLLER:5983/``        |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Apache Spark SQL | ``pip install pyhive``                | ``jdbc+hive://``                                |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| BigQuery         | ``pip install pybigquery``            | ``bigquery://``                                 |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| ClickHouse       | ``pip install sqlalchemy-clickhouse`` |                                                 |
 								+------------------+---------------------------------------+-------------------------------------------------+
-												Change Fedora installation instructions + some small formatting changes (#8496)


											
										
										
											2019-11-03 21:02:14 -05:00
+								| Elasticsearch    | ``pip install elasticsearch-dbapi``   | ``elasticsearch+http://``                       |
-												[db engine] Add support for Elasticsearch (#8441)

* [db engine] Add support for Elasticsearch

											
										
										
											2019-10-28 12:04:14 -04:00
+								+------------------+---------------------------------------+-------------------------------------------------+
-												Add support for Exasol (#8343)

* Add support for Exasol

* add time grain functions for Exasol

* remove duplicate of

* override ExasolEngineSpec's fetch_data method

* remove duplicate https

* simplify super call

											
										
										
											2019-10-06 07:43:45 -04:00
+								| Exasol           | ``pip install sqlalchemy-exasol``     | ``exa+pyodbc://``                               |
 								+------------------+---------------------------------------+-------------------------------------------------+
-												Improve documentation (#7813)

* Improve documentation and add type annotations for jinja context

* Fix linting errors

* Move requirements to correct place and remove redundant line change

* Make example query more ANSI SQL

											
										
										
											2019-07-03 12:54:03 -04:00
+								| Google Sheets    | ``pip install gsheetsdb``             | ``gsheets://``                                  |
-												Update index.rst (#7672)


											
										
										
											2019-06-11 12:46:27 -04:00
+								+------------------+---------------------------------------+-------------------------------------------------+
 								| IBM Db2          | ``pip install ibm_db_sa``             | ``db2+ibm_db://``                               |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| MySQL            | ``pip install mysqlclient``           | ``mysql://``                                    |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Oracle           | ``pip install cx_Oracle``             | ``oracle://``                                   |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| PostgreSQL       | ``pip install psycopg2``              | ``postgresql+psycopg2://``                      |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Presto           | ``pip install pyhive``                | ``presto://``                                   |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Snowflake        | ``pip install snowflake-sqlalchemy``  | ``snowflake://``                                |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| SQLite           |                                       | ``sqlite://``                                   |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| SQL Server       | ``pip install pymssql``               | ``mssql://``                                    |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Teradata         | ``pip install sqlalchemy-teradata``   | ``teradata://``                                 |
 								+------------------+---------------------------------------+-------------------------------------------------+
 								| Vertica          | ``pip install                         |  ``vertica+vertica_python://``                  |
 								|                  | sqlalchemy-vertica-python``           |                                                 |
 								+------------------+---------------------------------------+-------------------------------------------------+
-												Add support for database engine SAP Hana (#8411)

* Add support for database engine SAP Hana

* Support hana services

Increase time, minute, and second

* Fix hana return string

* Fix formatting errors

											
										
										
											2019-11-12 01:42:44 -05:00
+								| Hana             | ``pip install hdbcli sqlalchemy-hana``|  ``hana://``                                    |
 								|                  | or ``pip install superset[hana]``     |                                                 |
 								+------------------+---------------------------------------+-------------------------------------------------+
-												Added FAQ and db dependencies to docs

											
										
										
											2016-04-08 11:44:28 -04:00
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								Note that many other databases are supported, the main criteria being the
-												Added FAQ and db dependencies to docs

											
										
										
											2016-04-08 11:44:28 -04:00
+								existence of a functional SqlAlchemy dialect and Python driver. Googling
 								the keyword ``sqlalchemy`` in addition of a keyword that describes the
 								database you want to connect to should get you to the right place.
-												Add support for database engine SAP Hana (#8411)

* Add support for database engine SAP Hana

* Support hana services

Increase time, minute, and second

* Fix hana return string

* Fix formatting errors

											
										
										
											2019-11-12 01:42:44 -05:00
+								Hana
 								------------
 								The connection string for Hana looks like this ::
 								    hana://{username}:{password}@{host}:{port}
-												installation instructions for AWS Athena (#2538)


											
										
										
											2017-04-03 11:22:40 -04:00
+								(AWS) Athena
 								------------
 								The connection string for Athena looks like this ::
 								    awsathena+jdbc://{aws_access_key_id}:{aws_secret_access_key}@athena.{region_name}.amazonaws.com/{schema_name}?s3_staging_dir={s3_staging_dir}&...
 								Where you need to escape/encode at least the s3_staging_dir, i.e., ::
 								    s3://... -> s3%3A//...
-												Modify Athena connection description (#5492)


											
										
										
											2018-07-26 02:12:14 -04:00
+								You can also use `PyAthena` library(no java required) like this ::
-												docs: Add new Athena URI scheme awsathena+rest:// (#5112)

See also some discussions on https://github.com/laughingman7743/PyAthenaJDBC/pull/62
											
										
										
											2018-06-01 01:19:07 -04:00
 								    awsathena+rest://{aws_access_key_id}:{aws_secret_access_key}@athena.{region_name}.amazonaws.com/{schema_name}?s3_staging_dir={s3_staging_dir}&...
-												Modify Athena connection description (#5492)


											
										
										
											2018-07-26 02:12:14 -04:00
+								See `PyAthena <https://github.com/laughingman7743/PyAthena#sqlalchemy>`_.
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
-												Improve documentation (#7813)

* Improve documentation and add type annotations for jinja context

* Fix linting errors

* Move requirements to correct place and remove redundant line change

* Make example query more ANSI SQL

											
										
										
											2019-07-03 12:54:03 -04:00
+								(Google) BigQuery
 								-----------------
 								The connection string for BigQuery looks like this ::
 								    bigquery://{project_id}
-												Provide documentation for using a Service Account to connect to BigQuery (#8462)

* Provide documentation for using a Service Account to connect to BigQuery

* Alter line wrapping for shorter lines

* Whitespace commit to trigger another build (flake)

* Another meaningless whitespace change to trigger another build

											
										
										
											2019-10-29 15:31:45 -04:00
+								Additionally, you will need to configure authentication via a
 								Service Account. Create your Service Account via the Google
 								Cloud Platform control panel, provide it access to the appropriate
 								BigQuery datasets, and download the JSON configuration file
 								for the service account. In Superset, Add a JSON blob to
 								the "Secure Extra" field in the database configuration page
 								with the following format ::
 								    {
 								        "credentials_info": <contents of credentials JSON file>
 								    }
 								The resulting file should have this structure ::
 								    {
 								        "credentials_info": {
 								            "type": "service_account",
 								            "project_id": "...",
 								            "private_key_id": "...",
 								            "private_key": "...",
 								            "client_email": "...",
 								            "client_id": "...",
 								            "auth_uri": "...",
 								            "token_uri": "...",
 								            "auth_provider_x509_cert_url": "...",
 								            "client_x509_cert_url": "...",
 								        }
 								    }
 								You should then be able to connect to your BigQuery datasets.
-												Improve documentation (#7813)

* Improve documentation and add type annotations for jinja context

* Fix linting errors

* Move requirements to correct place and remove redundant line change

* Make example query more ANSI SQL

											
										
										
											2019-07-03 12:54:03 -04:00
+								To be able to upload data, e.g. sample data, the python library `pandas_gbq` is required.
-												Provide documentation for using a Service Account to connect to BigQuery (#8462)

* Provide documentation for using a Service Account to connect to BigQuery

* Alter line wrapping for shorter lines

* Whitespace commit to trigger another build (flake)

* Another meaningless whitespace change to trigger another build

											
										
										
											2019-10-29 15:31:45 -04:00
-												[db engine] Add support for Elasticsearch (#8441)

* [db engine] Add support for Elasticsearch

											
										
										
											2019-10-28 12:04:14 -04:00
+								Elasticsearch
 								-------------
 								The connection string for Elasticsearch looks like this ::
 								    elasticsearch+http://{user}:{password}@{host}:9200/
 								Using HTTPS ::
 								    elasticsearch+https://{user}:{password}@{host}:9200/
 								Elasticsearch as a default limit of 10000 rows, so you can increase this limit on your cluster
 								or set Superset's row limit on config ::
 								    ROW_LIMIT = 10000
 								You can query multiple indices on SQLLab for example ::
 								    select timestamp, agent from "logstash-*"
 								But, to use visualizations for multiple indices you need to create an alias index on your cluster ::
 								    POST /_aliases
 								    {
 								        "actions" : [
 								            { "add" : { "index" : "logstash-**", "alias" : "logstash_all" } }
 								        ]
 								    }
 								Then register your table with the ``alias`` name ``logstasg_all``
-												Add Snowflake connection string instructions (#5443)


											
										
										
											2018-07-20 18:19:18 -04:00
+								Snowflake
 								---------
 								The connection string for Snowflake looks like this ::
 								    snowflake://{user}:{password}@{account}.{region}/{database}?role={role}&warehouse={warehouse}
 								The schema is not necessary in the connection string, as it is defined per table/query.
 								The role and warehouse can be omitted if defaults are defined for the user, i.e.
 								    snowflake://{user}:{password}@{account}.{region}/{database}
 								Make sure the user has privileges to access and use all required
 								databases/schemas/tables/views/warehouses, as the Snowflake SQLAlchemy engine does
 								not test for user rights during engine creation.
-												Modify Athena connection description (#5492)


											
										
										
											2018-07-26 02:12:14 -04:00
+								See `Snowflake SQLAlchemy <https://github.com/snowflakedb/snowflake-sqlalchemy>`_.
-												Add Snowflake connection string instructions (#5443)


											
										
										
											2018-07-20 18:19:18 -04:00
-												Addded documentation for Teradata DB (#5885)

Documentation extension for PR5870
											
										
										
											2018-09-13 11:01:37 -04:00
+								Teradata
 								---------
 								The connection string for Teradata looks like this ::
 								    teradata://{user}:{password}@{host}
 								*Note*: Its required to have Teradata ODBC drivers installed and environment variables configured for proper work of sqlalchemy dialect. Teradata ODBC Drivers available here: https://downloads.teradata.com/download/connectivity/odbc-driver/linux
 								Required environment variables: ::
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
+								    export ODBCINI=/.../teradata/client/ODBC_64/odbc.ini
 								    export ODBCINST=/.../teradata/client/ODBC_64/odbcinst.ini
-												Addded documentation for Teradata DB (#5885)

Documentation extension for PR5870
											
										
										
											2018-09-13 11:01:37 -04:00
 								See `Teradata SQLAlchemy <https://github.com/Teradata/sqlalchemy-teradata>`_.
-												Add support for Apache Drill (#6610)

* Add support for Apache Drill

* Updated Docs

* Removed Extraneous Functions

* Removed Extraneous Functions

* Final Mods

* Fixed Unit Test Error

* Fixed Epoch Conversion Functions

											
										
										
											2019-05-29 00:16:09 -04:00
+								Apache Drill
-												Improve documentation (#7813)

* Improve documentation and add type annotations for jinja context

* Fix linting errors

* Move requirements to correct place and remove redundant line change

* Make example query more ANSI SQL

											
										
										
											2019-07-03 12:54:03 -04:00
+								------------
-												Add support for Apache Drill (#6610)

* Add support for Apache Drill

* Updated Docs

* Removed Extraneous Functions

* Removed Extraneous Functions

* Final Mods

* Fixed Unit Test Error

* Fixed Epoch Conversion Functions

											
										
										
											2019-05-29 00:16:09 -04:00
+								At the time of writing, the SQLAlchemy Dialect is not available on pypi and must be downloaded here:
 								`SQLAlchemy Drill <https://github.com/JohnOmernik/sqlalchemy-drill>`_
 								Alternatively, you can install it completely from the command line as follows: ::
 								    git clone https://github.com/JohnOmernik/sqlalchemy-drill
 								    cd sqlalchemy-drill
 								    python3 setup.py install
 								Once that is done, you can connect to Drill in two ways, either via the REST interface or by JDBC.  If you are connecting via JDBC, you must have the
 								Drill JDBC Driver installed.
 								The basic connection string for Drill looks like this ::
 								    drill+sadrill://{username}:{password}@{host}:{port}/{storage_plugin}?use_ssl=True
 								If you are using JDBC to connect to Drill, the connection string looks like this: ::
 								    drill+jdbc://{username}:{password}@{host}:{port}/{storage_plugin}
 								For a complete tutorial about how to use Apache Drill with Superset, see this tutorial:
 								`Visualize Anything with Superset and Drill <http://thedataist.com/visualize-anything-with-superset-and-drill/>`_
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								Caching
 								-------
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								Superset uses `Flask-Cache <https://pythonhosted.org/Flask-Cache/>`_ for
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								caching purpose. Configuring your caching backend is as easy as providing
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								a ``CACHE_CONFIG``, constant in your ``superset_config.py`` that
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								complies with the Flask-Cache specifications.
-												Fix caching in python3 (#806)

* caravel: fix visualization cache for python3

python3 wants bytes and not strings:

2016-07-22 10:36:09,474:INFO:root:Caching for the next 28800 seconds
2016-07-22 10:36:09,475:WARNING:root:Could not cache key 1eeb45f32960f0df0ad99a125bdaf199
2016-07-22 10:36:09,475:ERROR:root:'str' does not support the buffer interface
Traceback (most recent call last):
  File "/home/rm/caraveltest/venv/lib/python3.4/site-packages/caravel/viz.py", line 306, in get_json
    zlib.compress(self.json_dumps(payload)),
TypeError: 'str' does not support the buffer interface

Tested with memcached and pylibmc client library.

* docs: add note about using a proper memcached client library

											
										
										
											2016-07-22 12:45:51 -04:00
+								Flask-Cache supports multiple caching backends (Redis, Memcached,
 								SimpleCache (in-memory), or the local filesystem). If you are going to use
-												Clarify docs on Redis package required for caching (#2557)


											
										
										
											2017-04-05 13:07:36 -04:00
+								Memcached please use the `pylibmc` client library as `python-memcached` does
-												Change default location for db and logs to ~/.caravel Fix #915 (#947)


											
										
										
											2016-08-17 00:35:31 -04:00
+								not handle storing binary data correctly. If you use Redis, please install
-												Clarify docs on Redis package required for caching (#2557)


											
										
										
											2017-04-05 13:07:36 -04:00
+								the `redis <https://pypi.python.org/pypi/redis>`_ Python package: ::
 								    pip install redis
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								For setting your timeouts, this is done in the Superset metadata and goes
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								up the "timeout searchpath", from your slice configuration, to your
 								data source's configuration, to your database's and ultimately falls back
 								into your global default defined in ``CACHE_CONFIG``.
-												docs: Add new Athena URI scheme awsathena+rest:// (#5112)

See also some discussions on https://github.com/laughingman7743/PyAthenaJDBC/pull/62
											
										
										
											2018-06-01 01:19:07 -04:00
-												Added Example snippet for setting up Redis cache (#4434)

* Added Example snippet for setting up Redis cache

* Update installation.rst

* Update installation.rst

											
										
										
											2018-02-15 23:27:03 -05:00
+								.. code-block:: python
 								    CACHE_CONFIG = {
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								        'CACHE_TYPE': 'redis',
 								        'CACHE_DEFAULT_TIMEOUT': 60 * 60 * 24, # 1 day default (in secs)
 								        'CACHE_KEY_PREFIX': 'superset_results',
 								        'CACHE_REDIS_URL': 'redis://localhost:6379/0',
 								    }
-												Added Example snippet for setting up Redis cache (#4434)

* Added Example snippet for setting up Redis cache

* Update installation.rst

* Update installation.rst

											
										
										
											2018-02-15 23:27:03 -05:00
-												Adds the ability to replace/extend caching backend (#7856)

* Add ability to override cache backend with custom module

* Tests for setup_cache

* Add custom CACHE_CONFIG documentation

* Fix linter errors and documentation link

* Fix black formatting errors

											
										
										
											2019-07-12 17:06:56 -04:00
+								It is also possible to pass a custom cache initialization function in the
 								config to handle additional caching use cases. The function must return an
 								object that is compatible with the `Flask-Cache <https://pythonhosted.org/Flask-Cache/>`_ API.
 								.. code-block:: python
 								    from custom_caching import CustomCache
 								    def init_cache(app):
 								        """Takes an app instance and returns a custom cache backend"""
 								        config = {
 								            'CACHE_DEFAULT_TIMEOUT': 60 * 60 * 24, # 1 day default (in secs)
 								            'CACHE_KEY_PREFIX': 'superset_results',
 								        }
 								        return CustomCache(app, config)
 								    CACHE_CONFIG = init_cache
-												Celery task for warming up cache (#7148)

* Sparkline dates aren't formatting in Time Series Table (#6976)

* Exclude venv for python linter to ignore

* Fix NaN error

* Fix the white background shown in SQL editor on drag (#7021)

This PR sets the background-color css property on `.ace_scroller` instead of `.ace_content` to prevent the white background shown during resizing of the SQL editor before drag ends.

* Show tooltip with time frame (#6979)

* Fix time filter control (#6978)

* Enhancement of query context and object. (#6962)

* added more functionalities for query context and object.

* fixed cache logic

* added default value for groupby

* updated comments and removed print

(cherry picked from commit d5b9795f87f79fa2c41e144ffc00fd9586be7657)

* [fix] /superset/slice/id url is too long (#6989)


(cherry picked from commit 6a4d507ab607b01ed324cb3341b71c6fb2cb5c97)

* [WIP] fix user specified JSON metadata not updating dashboard on refresh (#7027)


(cherry picked from commit cc58f0e661044e95c7c86d0da8d77a0a6640efe7)

* feat: add ability to change font size in big number (#7003)

* Add ability to change font sizes in Big Number

* rename big number to header

* Add comment to clarify font size values

* Allow LIMIT to be specified in parameters (#7052)

* [fix] Cursor jumping when editing chart and dashboard titles (#7038)


(cherry picked from commit fc1770f7b79a4d8815b646b46390fabf190c3815)

* Changing time table viz to pass formatTime a date (#7020)

(cherry picked from commit 7f3c145b1f5a4e2d8b95982119503e98772e2c47)

* [db-engine-spec] Aligning Hive/Presto partition logic (#7007)


(cherry picked from commit 05be86611785fef2904992e4e7d31dce23f1c51b)

* [fix] explore chart from dashboard missed slice title (#7046)


(cherry picked from commit a6d48d4052839286aec725d51303b3b2bf6e8dd4)

* fix inaccurate data calculation with adata rolling and contribution (#7035)


(cherry picked from commit 0782e831cd37f665a2838119d87c433269f1b36b)

* Adding warning message for sqllab save query (#7028)


(cherry picked from commit ead3d48133e7e1ab8b91d51e561544a544b4eaad)

* [datasource] Ensuring consistent behavior of datasource editing/saving. (#7037)

* Update datasource.py

* Update datasource.py

(cherry picked from commit c771625f1068d3a7f41e6bced14b0cbdbf9962cc)

* [csv-upload] Fixing message encoding (#6971)


(cherry picked from commit 48431ab5b9375a94c5262a0336d9c69e5f01a3ac)

* [sql-parse] Fixing LIMIT exceptions (#6963)


(cherry picked from commit 3e076cb60b385e675ed1c9a8053493375e43370b)

* Adding custom control overrides (#6956)

* Adding extraOverrides to line chart

* Updating extraOverrides to fit with more cases

* Moving extraOverrides to index.js

* Removing webpack-merge in package.json

* Fixing metrics control clearing metric

(cherry picked from commit e6194051f486e42922dc4e34a861f4490c1062fc)

* [sqlparse] Fixing table name extraction for ill-defined query (#7029)


(cherry picked from commit 07c340cf8203f13222f16efad1e55e202deb1865)

* [missing values] Removing replacing missing values (#4905)


(cherry picked from commit 61add606ca16a6ba981ccde864b121f5464b697a)

* [SQL Lab] Improved query and results tabs rendering reliability (#7082)

closes #7080

(cherry picked from commit 9b58e9f4920ef424e5b545dcbb4726e22bed5982)

* Fix filter_box migration PR #6523 (#7066)

* Fix filter_box migration PR #6523

* Fix druid-related bug

(cherry picked from commit b210742ad24d01ca05bc58ca3342c90e301fe073)

* SQL editor layout makeover (#7102)

This PR includes the following layout and css tweaks:
- Using flex to layout the north and south sub panes of query pane so resizing works properly in both Chrome and Firefox
- Removal of necessary wrapper divs and tweaking of css in sql lab so we can scroll to the bottom of both the table list and the results pane
- Make sql lab's content not overflow vertically and layout the query result area to eliminate double scroll bars
- css tweaks on the basic.html page so the loading animation appears in the center of the page across the board

(cherry picked from commit 71f1bbd2ec59b99d6ba6d9a4a2f9cfceaf922b80)

* [forms] Fix handling of NULLs

(cherry picked from commit e83a07d3dfda350cc44041cb6cbaec4510887902)

* handle null column_name in sqla and druid models

(cherry picked from commit 2ff721ae072b8d69c5cabddc3e1a388a596b1b6f)

* Use metric name instead of metric in filter box (#7106)


(cherry picked from commit 003364e74ea70cad1a4a6e784933fe8bef4c78ec)

* Bump python lib croniter to an existing version (#7132)

Package maintainers should really never delete packages, but it appears
this happened with croniter and resulted in breaking our builds.

This PR bumps to a more recent existing version of the library

(cherry picked from commit 215ed392a11598eac228f57341dbfd232cf770e3)

* Revert PR #6933 (#7162)

* Celery worker for warming up cache

* Remove testing changes

* Add documentation

* Fix lint

* WIP dashboard filters

* Use new cache so it works with dashboards

* Add more unit tests, fix old ones

* Fix flake8 and docs

* Sparkline dates aren't formatting in Time Series Table (#6976)

* Exclude venv for python linter to ignore

* Fix NaN error

* Changing time table viz to pass formatTime a date (#7020)

(cherry picked from commit 7f3c145b1f5a4e2d8b95982119503e98772e2c47)

* SQL editor layout makeover (#7102)

This PR includes the following layout and css tweaks:
- Using flex to layout the north and south sub panes of query pane so resizing works properly in both Chrome and Firefox
- Removal of necessary wrapper divs and tweaking of css in sql lab so we can scroll to the bottom of both the table list and the results pane
- Make sql lab's content not overflow vertically and layout the query result area to eliminate double scroll bars
- css tweaks on the basic.html page so the loading animation appears in the center of the page across the board

(cherry picked from commit 71f1bbd2ec59b99d6ba6d9a4a2f9cfceaf922b80)

* Celery worker for warming up cache

* Remove testing changes

* Add documentation

* Fix lint

* WIP dashboard filters

* Use new cache so it works with dashboards

* Add more unit tests, fix old ones

* Fix flake8 and docs

* Fix bad merge and pylint

											
										
										
											2019-04-03 19:57:59 -04:00
+								Superset has a Celery task that will periodically warm up the cache based on
 								different strategies. To use it, add the following to the `CELERYBEAT_SCHEDULE`
 								section in `config.py`:
 								.. code-block:: python
 								    CELERYBEAT_SCHEDULE = {
 								        'cache-warmup-hourly': {
 								            'task': 'cache-warmup',
 								            'schedule': crontab(minute=0, hour='*'),  # hourly
 								            'kwargs': {
 								                'strategy_name': 'top_n_dashboards',
 								                'top_n': 5,
 								                'since': '7 days ago',
 								            },
 								        },
 								    }
 								This will cache all the charts in the top 5 most popular dashboards every hour.
 								For other strategies, check the `superset/tasks/cache.py` file.
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
-												Improving the Installation docs

											
										
										
											2016-04-06 11:45:27 -04:00
+								Deeper SQLAlchemy integration
 								-----------------------------
 								It is possible to tweak the database connection information using the
 								parameters exposed by SQLAlchemy. In the ``Database`` edit view, you will
 								find an ``extra`` field as a ``JSON`` blob.
-												[docs] many improvements to the documentation / cleanup (#4817)

* fixed RSTs so images will show up on github
* fresh screenshots on main page
* removing irrelevant portions
* moved a set of sections under 'Misc'
* rebuilt the Gallery with all screenshots
											
										
										
											2018-04-13 13:23:27 -04:00
+								.. image:: images/tutorial/add_db.png
-												Added FAQ and db dependencies to docs

											
										
										
											2016-04-08 11:44:28 -04:00
+								   :scale: 30 %
-												Improving the Installation docs

											
										
										
											2016-04-06 11:45:27 -04:00
 								This JSON string contains extra configuration elements. The ``engine_params``
 								object gets unpacked into the
-												[fix] Use HTTPS, not HTTP wherever practical (#7040)

* Download RAT binary via HTTPS, not HTTP

* Merge branch 'patch-1' of github.com:hajdbo/incubator-superset into patch-1

											
										
										
											2019-03-18 02:21:32 -04:00
+								`sqlalchemy.create_engine <https://docs.sqlalchemy.org/en/latest/core/engines.html#sqlalchemy.create_engine>`_ call,
-												Improving the Installation docs

											
										
										
											2016-04-06 11:45:27 -04:00
+								while the ``metadata_params`` get unpacked into the
-												[fix] Use HTTPS, not HTTP wherever practical (#7040)

* Download RAT binary via HTTPS, not HTTP

* Merge branch 'patch-1' of github.com:hajdbo/incubator-superset into patch-1

											
										
										
											2019-03-18 02:21:32 -04:00
+								`sqlalchemy.MetaData <https://docs.sqlalchemy.org/en/rel_1_2/core/metadata.html#sqlalchemy.schema.MetaData>`_ call. Refer to the SQLAlchemy docs for more information.
-												Improving the Installation docs

											
										
										
											2016-04-06 11:45:27 -04:00
-												[docs] CTAS on PostgreSQL needs commit to apply (#8367)

* [docs] New, document need for PG to use autocommit for CTAS

											
										
										
											2019-10-17 09:54:02 -04:00
+								.. note:: If your using CTAS on SQLLab and PostgreSQL
 								    take a look at :ref:`ref_ctas_engine_config` for specific ``engine_params``.
-												Improving the Installation docs

											
										
										
											2016-04-06 11:45:27 -04:00
-												Fixing the thumbs and the galery (#346)
											
										
										
											2016-04-14 01:32:03 -04:00
+								Schemas (Postgres & Redshift)
 								-----------------------------
-												Improving the Installation docs

											
										
										
											2016-04-06 11:45:27 -04:00
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								Postgres and Redshift, as well as other databases,
-												Fixing the thumbs and the galery (#346)
											
										
										
											2016-04-14 01:32:03 -04:00
+								use the concept of **schema** as a logical entity
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								on top of the **database**. For Superset to connect to a specific schema,
-												Fixing the thumbs and the galery (#346)
											
										
										
											2016-04-14 01:32:03 -04:00
+								there's a **schema** parameter you can set in the table form.
-												Improving the Installation docs

											
										
										
											2016-04-06 11:45:27 -04:00
-												Adding hook for external password store for databases (#3436)


											
										
										
											2017-09-13 23:59:03 -04:00
+								External Password store for SQLAlchemy connections
 								--------------------------------------------------
 								It is possible to use an external store for you database passwords. This is
 								useful if you a running a custom secret distribution framework and do not wish
 								to store secrets in Superset's meta database.
 								Example:
 								Write a function that takes a single argument of type ``sqla.engine.url`` and returns
 								the password for the given connection string. Then set ``SQLALCHEMY_CUSTOM_PASSWORD_STORE``
 								in your config file to point to that function. ::
 								    def example_lookup_password(url):
 								        secret = <<get password from external framework>>
 								        return 'secret'
 								    SQLALCHEMY_CUSTOM_PASSWORD_STORE = example_lookup_password
-												[docs] many improvements to the documentation / cleanup (#4817)

* fixed RSTs so images will show up on github
* fresh screenshots on main page
* removing irrelevant portions
* moved a set of sections under 'Misc'
* rebuilt the Gallery with all screenshots
											
										
										
											2018-04-13 13:23:27 -04:00
+								A common pattern is to use environment variables to make secrets available.
 								``SQLALCHEMY_CUSTOM_PASSWORD_STORE`` can also be used for that purpose. ::
 								    def example_password_as_env_var(url):
 								        # assuming the uri looks like
 								        # mysql://localhost?superset_user:{SUPERSET_PASSWORD}
 								        return url.password.format(os.environ)
 								    SQLALCHEMY_CUSTOM_PASSWORD_STORE = example_password_as_env_var
-												Adding hook for external password store for databases (#3436)


											
										
										
											2017-09-13 23:59:03 -04:00
-												Adding not in docs about connecting to dbs over SSL

											
										
										
											2016-04-12 00:41:32 -04:00
+								SSL Access to databases
 								-----------------------
 								This example worked with a MySQL database that requires SSL. The configuration
 								may differ with other backends. This is what was put in the ``extra``
 								parameter ::
-												Fixing the thumbs and the galery (#346)
											
										
										
											2016-04-14 01:32:03 -04:00
-												Adding not in docs about connecting to dbs over SSL

											
										
										
											2016-04-12 00:41:32 -04:00
+								    {
 								        "metadata_params": {},
 								        "engine_params": {
 								              "connect_args":{
 								                  "sslmode":"require",
 								                  "sslrootcert": "/path/to/my/pem"
 								            }
 								         }
 								    }
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
+								Druid
 								-----
 								* From the UI, enter the information about your clusters in the
-												Minor documentation fix (#3553)


											
										
										
											2017-09-29 19:47:18 -04:00
+								  `Sources -> Druid Clusters` menu by hitting the + sign.
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
 								* Once the Druid cluster connection information is entered, hit the
-												Minor documentation fix (#3553)


											
										
										
											2017-09-29 19:47:18 -04:00
+								  `Sources -> Refresh Druid Metadata` menu item to populate
-												Introducing a caching layer

											
										
										
											2016-03-16 23:25:41 -04:00
 								* Navigate to your datasources
-												Adding upgrade instructions to docs

											
										
										
											2016-04-06 21:02:15 -04:00
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								Note that you can run the ``superset refresh_druid`` command to refresh the
-												Adding upgrade instructions to docs

											
										
										
											2016-04-06 21:02:15 -04:00
+								metadata from your Druid cluster(s)
-												[DB Engine] Support old and new Presto syntax (#7977)


											
										
										
											2019-08-05 14:56:56 -04:00
+								Presto
 								------
 								By default Superset assumes the most recent version of Presto is being used when
 								querying the datasource. If you're using an older version of presto, you can configure
 								it in the ``extra`` parameter::
 								    {
 								        "version": "0.123"
 								    }
-												Add support for Exasol (#8343)

* Add support for Exasol

* add time grain functions for Exasol

* remove duplicate of

* override ExasolEngineSpec's fetch_data method

* remove duplicate https

* simplify super call

											
										
										
											2019-10-06 07:43:45 -04:00
+								Exasol
 								---------
 								The connection string for Exasol looks like this ::
 								    exa+pyodbc://{user}:{password}@{host}
 								*Note*: It's required to have Exasol ODBC drivers installed for the sqlalchemy dialect to work properly. Exasol ODBC Drivers available are here: https://www.exasol.com/portal/display/DOWNLOAD/Exasol+Download+Section
 								Example config (odbcinst.ini can be left empty) ::
 								    $ cat $/.../path/to/odbc.ini
 								    [EXAODBC]
 								    DRIVER = /.../path/to/driver/EXASOL_driver.so
 								    EXAHOST = host:8563
 								    EXASCHEMA = main
 								See `SQLAlchemy for Exasol <https://github.com/blue-yonder/sqlalchemy_exasol>`_.
-												Add CORS support (#478)

* Add optional CORS

* make CORS an extra dependency

* add documentation

											
										
										
											2016-06-02 15:34:36 -04:00
+								CORS
-												Fix rst grammar problems (#4116)


											
										
										
											2017-12-26 02:39:28 -05:00
+								----
-												Add CORS support (#478)

* Add optional CORS

* make CORS an extra dependency

* add documentation

											
										
										
											2016-06-02 15:34:36 -04:00
 								The extra CORS Dependency must be installed:
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								    superset[cors]
-												Add CORS support (#478)

* Add optional CORS

* make CORS an extra dependency

* add documentation

											
										
										
											2016-06-02 15:34:36 -04:00
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								The following keys in `superset_config.py` can be specified to configure CORS:
-												Add CORS support (#478)

* Add optional CORS

* make CORS an extra dependency

* add documentation

											
										
										
											2016-06-02 15:34:36 -04:00
 								* ``ENABLE_CORS``: Must be set to True in order to enable CORS
-												[fix] Use HTTPS, not HTTP wherever practical (#7040)

* Download RAT binary via HTTPS, not HTTP

* Merge branch 'patch-1' of github.com:hajdbo/incubator-superset into patch-1

											
										
										
											2019-03-18 02:21:32 -04:00
+								* ``CORS_OPTIONS``: options passed to Flask-CORS (`documentation <https://flask-cors.corydolphin.com/en/latest/api.html#extension>`)
-												Add CORS support (#478)

* Add optional CORS

* make CORS an extra dependency

* add documentation

											
										
										
											2016-06-02 15:34:36 -04:00
-												Add ADDITIONAL_MIDDLEWARE option to config (#1832)

Add documentation to explain ADDITIONAL_MIDDLEWARE
											
										
										
											2016-12-14 12:39:59 -05:00
-												Change Fedora installation instructions + some small formatting changes (#8496)


											
										
										
											2019-11-03 21:02:14 -05:00
+								Domain Sharding
-												allow domain sharding in frontend (#5039)


											
										
										
											2018-11-30 13:30:04 -05:00
+								---------------
 								Chrome allows up to 6 open connections per domain at a time. When there are more
 								than 6 slices in dashboard, a lot of time fetch requests are queued up and wait for
-												[feature flag] Enforce csrf protection on explore_json endpoint (#7935)

also added a section for featured flags in http://superset.incubator.apache.org/installation.html
											
										
										
											2019-07-29 19:22:47 -04:00
+								next available socket. `PR 5039 <https://github.com/apache/incubator-superset/pull/5039>`_ adds domain sharding to Superset,
-												allow domain sharding in frontend (#5039)


											
										
										
											2018-11-30 13:30:04 -05:00
+								and this feature will be enabled by configuration only (by default Superset
 								doesn't allow cross-domain request).
-												docs: fix RST issues while building docs (#7012)


											
										
										
											2019-03-18 21:11:53 -04:00
+								* ``SUPERSET_WEBSERVER_DOMAINS``: list of allowed hostnames for domain sharding feature. default `None`
-												allow domain sharding in frontend (#5039)


											
										
										
											2018-11-30 13:30:04 -05:00
-												Change Fedora installation instructions + some small formatting changes (#8496)


											
										
										
											2019-11-03 21:02:14 -05:00
+								Middleware
-												Add ADDITIONAL_MIDDLEWARE option to config (#1832)

Add documentation to explain ADDITIONAL_MIDDLEWARE
											
										
										
											2016-12-14 12:39:59 -05:00
+								----------
 								Superset allows you to add your own middleware. To add your own middleware, update the ``ADDITIONAL_MIDDLEWARE`` key in
 								your `superset_config.py`. ``ADDITIONAL_MIDDLEWARE`` should be a list of your additional middleware classes.
 								For example, to use AUTH_REMOTE_USER from behind a proxy server like nginx, you have to add a simple middleware class to
 								add the value of ``HTTP_X_PROXY_REMOTE_USER`` (or any other custom header from the proxy) to Gunicorn's ``REMOTE_USER``
 								environment variable: ::
 								    class RemoteUserMiddleware(object):
 								        def __init__(self, app):
 								            self.app = app
 								        def __call__(self, environ, start_response):
 								            user = environ.pop('HTTP_X_PROXY_REMOTE_USER', None)
 								            environ['REMOTE_USER'] = user
 								            return self.app(environ, start_response)
 								    ADDITIONAL_MIDDLEWARE = [RemoteUserMiddleware, ]
 								*Adapted from http://flask.pocoo.org/snippets/69/*
-												[log] New, make action log configurable and generic (#7705)

* [log] New, make action log configurable and generic

* [log] Fix, missing apache license

* [log] Fix, user_id is a required parameter on event logs

* [log] Fix, Rename Action to Event

* [log] Fix, flake8

* [logger] Change all log_this decorators to new abstract one

* [logger] [docs] Simple docs to show how to override the event log

* [style] Fix, single quote to double quote

* [style] Fix, single quote to double quote

											
										
										
											2019-07-08 12:38:12 -04:00
+								Event Logging
 								-------------
 								Superset by default logs special action event on it's database. These log can be accessed on the UI navigating to
 								"Security" -> "Action Log". You can freely customize these logs by implementing your own event log class.
 								Example of a simple JSON to Stdout class::
 								    class JSONStdOutEventLogger(AbstractEventLogger):
 								        def log(self, user_id, action, *args, **kwargs):
 								            records = kwargs.get('records', list())
 								            dashboard_id = kwargs.get('dashboard_id')
 								            slice_id = kwargs.get('slice_id')
 								            duration_ms = kwargs.get('duration_ms')
 								            referrer = kwargs.get('referrer')
 								            for record in records:
 								                log = dict(
 								                    action=action,
 								                    json=record,
 								                    dashboard_id=dashboard_id,
 								                    slice_id=slice_id,
 								                    duration_ms=duration_ms,
 								                    referrer=referrer,
 								                    user_id=user_id
 								                )
 								                print(json.dumps(log))
-												Event logger config takes instance instead of class (#7997)

* allow preconfigured event logger instance; deprecate specifying class

* add func docs and simplify conditions

* modify docs to reflect EVENT_LOGGER cfg change

* commit black formatting fixes and license header

* add type checking, fix other pre-commit failues

* remove superfluous/wordy condition

* fix flake8 failure

* fix new black failure

* dedent warning msg; use f-strings

											
										
										
											2019-08-08 16:47:18 -04:00
+								Then on Superset's config pass an instance of the logger type you want to use.
-												[log] New, make action log configurable and generic (#7705)

* [log] New, make action log configurable and generic

* [log] Fix, missing apache license

* [log] Fix, user_id is a required parameter on event logs

* [log] Fix, Rename Action to Event

* [log] Fix, flake8

* [logger] Change all log_this decorators to new abstract one

* [logger] [docs] Simple docs to show how to override the event log

* [style] Fix, single quote to double quote

* [style] Fix, single quote to double quote

											
										
										
											2019-07-08 12:38:12 -04:00
-												Event logger config takes instance instead of class (#7997)

* allow preconfigured event logger instance; deprecate specifying class

* add func docs and simplify conditions

* modify docs to reflect EVENT_LOGGER cfg change

* commit black formatting fixes and license header

* add type checking, fix other pre-commit failues

* remove superfluous/wordy condition

* fix flake8 failure

* fix new black failure

* dedent warning msg; use f-strings

											
										
										
											2019-08-08 16:47:18 -04:00
+								    EVENT_LOGGER = JSONStdOutEventLogger()
-												[log] New, make action log configurable and generic (#7705)

* [log] New, make action log configurable and generic

* [log] Fix, missing apache license

* [log] Fix, user_id is a required parameter on event logs

* [log] Fix, Rename Action to Event

* [log] Fix, flake8

* [logger] Change all log_this decorators to new abstract one

* [logger] [docs] Simple docs to show how to override the event log

* [style] Fix, single quote to double quote

* [style] Fix, single quote to double quote

											
										
										
											2019-07-08 12:38:12 -04:00
-												Add ADDITIONAL_MIDDLEWARE option to config (#1832)

Add documentation to explain ADDITIONAL_MIDDLEWARE
											
										
										
											2016-12-14 12:39:59 -05:00
-												Adding upgrade instructions to docs

											
										
										
											2016-04-06 21:02:15 -04:00
+								Upgrading
 								---------
 								Upgrading should be as straightforward as running::
-												docs: reflect the pypi move from superset to apache-superset (#8244)


											
										
										
											2019-09-18 17:55:20 -04:00
+								    pip install apache-superset --upgrade
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								    superset db upgrade
 								    superset init
-												Documenting making your own build (#990)


											
										
										
											2016-08-19 18:27:35 -04:00
-												[docs] improve upgrading instructions (#6766)


											
										
										
											2019-01-29 01:14:02 -05:00
+								We recommend to follow standard best practices when upgrading Superset, such
 								as taking a database backup prior to the upgrade, upgrading a staging
 								environment prior to upgrading production, and upgrading production while less
 								users are active on the platform.
 								.. note ::
 								   Some upgrades may contain backward-incompatible changes, or require
 								   scheduling downtime, when that is the case, contributors attach notes in
 								   ``UPDATING.md`` in the repository. It's recommended to review this
 								   file prior to running an upgrade.
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
+								Celery Tasks
 								------------
-												More improvements to SQL Lab (#1104)

* Handling timeouts

* Fixing timer on non-utc server

* Allowing async with results

* [bugfix] database is not selected

* Making sure the session is up and running

* Cleaning up query results and query objects

* Picking a groupby and metric field on visualize flow

* Showing local time in query history

* Using pull-left pull-right instead of grid layout for table metdata

Long column name were looking weird and icons were wrapping oddly

* Linting

* Eliminating east buttons under the sql editor

* Sort database dropdown by name

* Linting

* Allowing non-SELECT statements to run

* Adding a db config

* Making sqla checkout check cross-db

											
										
										
											2016-09-19 18:28:10 -04:00
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
+								On large analytic databases, it's common to run queries that
 								execute for minutes or hours.
-												More improvements to SQL Lab (#1104)

* Handling timeouts

* Fixing timer on non-utc server

* Allowing async with results

* [bugfix] database is not selected

* Making sure the session is up and running

* Cleaning up query results and query objects

* Picking a groupby and metric field on visualize flow

* Showing local time in query history

* Using pull-left pull-right instead of grid layout for table metdata

Long column name were looking weird and icons were wrapping oddly

* Linting

* Eliminating east buttons under the sql editor

* Sort database dropdown by name

* Linting

* Allowing non-SELECT statements to run

* Adding a db config

* Making sqla checkout check cross-db

											
										
										
											2016-09-19 18:28:10 -04:00
+								To enable support for long running queries that
 								execute beyond the typical web request's timeout (30-60 seconds), it is
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								necessary to configure an asynchronous backend for Superset which consists of:
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								* one or many Superset workers (which is implemented as a Celery worker), and
-												[cli] Deprecating gunicorn/flower dependencies (#4451)


											
										
										
											2018-03-30 12:28:16 -04:00
+								  can be started with the ``celery worker`` command, run
 								  ``celery worker --help`` to view the related options.
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
+								* a celery broker (message queue) for which we recommend using Redis
 								  or RabbitMQ
 								* a results backend that defines where the worker will persist the query
 								  results
 								Configuring Celery requires defining a ``CELERY_CONFIG`` in your
 								``superset_config.py``. Both the worker and web server processes should
 								have the same configuration.
 								.. code-block:: python
-												docs: fixup installation examples code indentation (#3169)


											
										
										
											2017-07-26 12:20:06 -04:00
+								    class CeleryConfig(object):
 								        BROKER_URL = 'redis://localhost:6379/0'
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
+								        CELERY_IMPORTS = (
 								            'superset.sql_lab',
 								            'superset.tasks',
 								        )
-												docs: fixup installation examples code indentation (#3169)


											
										
										
											2017-07-26 12:20:06 -04:00
+								        CELERY_RESULT_BACKEND = 'redis://localhost:6379/0'
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
+								        CELERYD_LOG_LEVEL = 'DEBUG'
 								        CELERYD_PREFETCH_MULTIPLIER = 10
 								        CELERY_ACKS_LATE = True
 								        CELERY_ANNOTATIONS = {
 								            'sql_lab.get_sql_results': {
 								                'rate_limit': '100/s',
 								            },
 								            'email_reports.send': {
 								                'rate_limit': '1/s',
 								                'time_limit': 120,
 								                'soft_time_limit': 150,
 								                'ignore_result': True,
 								            },
 								        }
 								        CELERYBEAT_SCHEDULE = {
 								            'email_reports.schedule_hourly': {
 								                'task': 'email_reports.schedule_hourly',
 								                'schedule': crontab(minute=1, hour='*'),
 								            },
 								        }
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
-												docs: fixup installation examples code indentation (#3169)


											
										
										
											2017-07-26 12:20:06 -04:00
+								    CELERY_CONFIG = CeleryConfig
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
+								* To start a Celery worker to leverage the configuration run: ::
-												introduce a space in command line option (#8438)

see https://docs.celeryproject.org/en/latest/userguide/optimizing.html
											
										
										
											2019-10-24 19:00:33 -04:00
+								    celery worker --app=superset.tasks.celery_app:app --pool=prefork -O fair -c 4
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
 								* To start a job which schedules periodic background jobs, run ::
-												[cli] Deprecating gunicorn/flower dependencies (#4451)


											
										
										
											2018-03-30 12:28:16 -04:00
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
+								    celery beat --app=superset.tasks.celery_app:app
-												[cli] Deprecating gunicorn/flower dependencies (#4451)


											
										
										
											2018-03-30 12:28:16 -04:00
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
+								To setup a result backend, you need to pass an instance of a derivative
 								of ``werkzeug.contrib.cache.BaseCache`` to the ``RESULTS_BACKEND``
 								configuration key in your ``superset_config.py``. It's possible to use
 								Memcached, Redis, S3 (https://pypi.python.org/pypi/s3werkzeugcache),
 								memory or the file system (in a single server-type setup or for testing),
 								or to write your own caching interface. Your ``superset_config.py`` may
 								look something like:
 								.. code-block:: python
-												docs: fixup installation examples code indentation (#3169)


											
										
										
											2017-07-26 12:20:06 -04:00
+								    # On S3
 								    from s3cache.s3cache import S3Cache
 								    S3_CACHE_BUCKET = 'foobar-superset'
 								    S3_CACHE_KEY_PREFIX = 'sql_lab_result'
 								    RESULTS_BACKEND = S3Cache(S3_CACHE_BUCKET, S3_CACHE_KEY_PREFIX)
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
-												docs: fixup installation examples code indentation (#3169)


											
										
										
											2017-07-26 12:20:06 -04:00
+								    # On Redis
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
+								    from werkzeug.contrib.cache import RedisCache
 								    RESULTS_BACKEND = RedisCache(
 								        host='localhost', port=6379, key_prefix='superset_results')
-												[SQL Lab] Async query results serialization with MessagePack and PyArrow (#8069)

* Add support for msgpack results_backend serialization

* Serialize DataFrame with PyArrow rather than JSON

* Adjust dependencies, de-lint

* Add tests for (de)serialization methods

* Add MessagePack config info to Installation docs

* Enable msgpack/arrow serialization by default

* [Fix] Prevent msgpack serialization on synchronous queries

* Add type annotations

											
										
										
											2019-08-27 17:23:40 -04:00
+								For performance gains, `MessagePack <https://github.com/msgpack/msgpack-python>`_
 								and `PyArrow <https://arrow.apache.org/docs/python/>`_ are now used for results
 								serialization. This can be disabled by setting ``RESULTS_BACKEND_USE_MSGPACK = False``
 								in your configuration, should any issues arise. Please clear your existing results
 								cache store when upgrading an existing environment.
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
+								**Important notes**
 								* It is important that all the worker nodes and web servers in
 								  the Superset cluster share a common metadata database.
 								  This means that SQLite will not work in this context since it has
 								  limited support for concurrency and
 								  typically lives on the local file system.
 								* There should only be one instance of ``celery beat`` running in your
 								  entire setup. If not, background jobs can get scheduled multiple times
 								  resulting in weird behaviors like duplicate delivery of reports,
 								  higher than expected load / traffic etc.
-												[sip-15] Adding database level python-date-format (#8474)


											
										
										
											2019-10-31 10:13:41 -04:00
 								* SQL Lab will only run your queries asynchronously if you enable
-												explain the need to enable async queries (#8444)


											
										
										
											2019-10-24 12:52:16 -04:00
+								  "Asynchronous Query Execution" in your database settings.
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
 								Email Reports
 								-------------
 								Email reports allow users to schedule email reports for
-												Enhance docs for email reporting (#8486)

* extend documentation about setting up email reporting

* mention EMAIL_REPORTS_USER

											
										
										
											2019-10-31 01:37:13 -04:00
+								* chart and dashboard visualization (Attachment or inline)
 								* chart data (CSV attachment on inline table)
 								**Setup**
 								Make sure you enable email reports in your configuration file
 								.. code-block:: python
 								    ENABLE_SCHEDULED_EMAIL_REPORTS = True
 								Now you will find two new items in the navigation bar that allow you to schedule email
 								reports
 								* Manage -> Dashboard Emails
 								* Manage -> Chart Email Schedules
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
 								Schedules are defined in crontab format and each schedule
 								can have a list of recipients (all of them can receive a single mail,
 								or separate mails). For audit purposes, all outgoing mails can have a
 								mandatory bcc.
-												Enhance docs for email reporting (#8486)

* extend documentation about setting up email reporting

* mention EMAIL_REPORTS_USER

											
										
										
											2019-10-31 01:37:13 -04:00
+								In order get picked up you need to configure a celery worker and a celery beat
 								(see section above "Celery Tasks"). Your celery configuration also
 								needs an entry ``email_reports.schedule_hourly`` for ``CELERYBEAT_SCHEDULE``.
 								To send emails you need to configure SMTP settings in your configuration file. e.g.
 								.. code-block:: python
 								    EMAIL_NOTIFICATIONS = True
 								    SMTP_HOST = "email-smtp.eu-west-1.amazonaws.com"
 								    SMTP_STARTTLS = True
 								    SMTP_SSL = False
 								    SMTP_USER = "smtp_username"
 								    SMTP_PORT = 25
 								    SMTP_PASSWORD = os.environ.get("SMTP_PASSWORD")
 								    SMTP_MAIL_FROM = "insights@komoot.com"
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
-												Enhance docs for email reporting (#8486)

* extend documentation about setting up email reporting

* mention EMAIL_REPORTS_USER

											
										
										
											2019-10-31 01:37:13 -04:00
 								To render dashboards you need to install a local browser on your superset instance
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
 								  * `geckodriver <https://github.com/mozilla/geckodriver>`_ and Firefox is preferred
 								  * `chromedriver <http://chromedriver.chromium.org/>`_ is a good option too
-												Enhance docs for email reporting (#8486)

* extend documentation about setting up email reporting

* mention EMAIL_REPORTS_USER

											
										
										
											2019-10-31 01:37:13 -04:00
+								You need to adjust the ``EMAIL_REPORTS_WEBDRIVER`` accordingly in your configuration.
 								You also need to specify on behalf of which username to render the dashboards. In general
 								dashboards and charts are not accessible to unauthorized requests, that is why the
 								worker needs to take over credentials of an existing user to take a snapshot. ::
 								    EMAIL_REPORTS_USER = 'username_with_permission_to_access_dashboards'
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
 								**Important notes**
 								* Be mindful of the concurrency setting for celery (using ``-c 4``).
 								  Selenium/webdriver instances can consume a lot of CPU / memory on your servers.
 								* In some cases, if you notice a lot of leaked ``geckodriver`` processes, try running
 								  your celery processes with ::
 								    celery worker --pool=prefork --max-tasks-per-child=128 ...
 								* It is recommended to run separate workers for ``sql_lab`` and
 								  ``email_reports`` tasks. Can be done by using ``queue`` field in ``CELERY_ANNOTATIONS``
-												Enhance docs for email reporting (#8486)

* extend documentation about setting up email reporting

* mention EMAIL_REPORTS_USER

											
										
										
											2019-10-31 01:37:13 -04:00
+								* Adjust ``WEBDRIVER_BASEURL`` in your config if celery workers can't access superset via its
 								  default value ``http://0.0.0.0:8080/`` (notice the port number 8080, many other setups use
 								  port 8088).
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
+								SQL Lab
 								-------
 								SQL Lab is a powerful SQL IDE that works with all SQLAlchemy compatible
 								databases. By default, queries are executed in the scope of a web
 								request so they may eventually timeout as queries exceed the maximum duration of a web
 								request in your environment, whether it'd be a reverse proxy or the Superset
 								server itself. In such cases, it is preferred to use ``celery`` to run the queries
 								in the background. Please follow the examples/notes mentioned above to get your
 								celery setup working.
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								Also note that SQL Lab supports Jinja templating in queries and that it's
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
+								possible to overload
-												[sqllab] add support for Jinja templating (#1426)

* [sqllab] add support for Jinja templating

* Adressing comments

* Presto macros

* Progress

* Addressing coments

											
										
										
											2016-10-26 14:09:27 -04:00
+								the default Jinja context in your environment by defining the
-												[WiP] rename project from Caravel to Superset (#1576)

* Change in files

* Renamin files and folders

* cleaning up a single piece of lint

* Removing boat picture from docs

* add superset word mark

* Update rename note in docs

* Fixing images

* Pinning datatables

* Fixing issues with mapbox-gl

* Forgot to rename one file

* Linting

* v0.13.0

* adding pyyaml to dev-reqs

											
										
										
											2016-11-10 02:08:22 -05:00
+								``JINJA_CONTEXT_ADDONS`` in your superset configuration. Objects referenced
-												[sqllab] add support for Jinja templating (#1426)

* [sqllab] add support for Jinja templating

* Adressing comments

* Presto macros

* Progress

* Addressing coments

											
										
										
											2016-10-26 14:09:27 -04:00
+								in this dictionary are made available for users to use in their SQL.
-												Removing uneeded results_backends.py (#2717)


											
										
										
											2017-05-05 02:41:10 -04:00
+								.. code-block:: python
 								    JINJA_CONTEXT_ADDONS = {
 								        'my_crazy_macro': lambda x: x*2,
 								    }
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
+								SQL Lab also includes a live query validation feature with pluggable backends.
 								You can configure which validation implementation is used with which database
 								engine by adding a block like the following to your config.py:
 								.. code-block:: python
-												Improve documentation (#7813)

* Improve documentation and add type annotations for jinja context

* Fix linting errors

* Move requirements to correct place and remove redundant line change

* Make example query more ANSI SQL

											
										
										
											2019-07-03 12:54:03 -04:00
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
+								     FEATURE_FLAGS = {
 								         'SQL_VALIDATORS_BY_ENGINE': {
 								             'presto': 'PrestoDBSQLValidator',
 								         }
 								     }
 								The available validators and names can be found in `sql_validators/`.
-												feat: Scheduling queries from SQL Lab (#7416) (#7446)

* Merge lastest from master into lyft-release-sp8 (#7405)

* filter out all nan series (#7313)

* improve not rich tooltip (#7345)

* Create issue_label_bot.yaml (#7341)

* fix: do not save colors without a color scheme (#7347)

* [wtforms] Strip leading/trailing whitespace (#7084)

* [schema] Updating the datasources schema (#5451)

* limit tables/views returned if schema is not provided (#7358)

* limit tables/views returned if schema is not provided

* fix typo

* improve code performance

* handle the case when table name or view name does not present a schema

* Add type anno (#7342)

* Updated local dev instructions to include missing step

* First pass at type annotations

* [schema] Updating the base column schema (#5452)

* Update 937d04c16b64_update_datasources.py (#7361)

* Feature flag for client cache (#7348)

* Feature flag for client cache

* Fix integration test

* Revert "Fix integration test"

This reverts commit 58434ab98a015d6e96db4a97f26255aa282d989d.

* Feature flag for client cache

* Fix integration tests

* Add feature flag to config.py

* Add another feature check

* Fix more integration tests

* Fix raw HTML in SliceAdder (#7338)

* remove backendSync.json (#7331)

* [bubbles] issue when using duplicated metrics (#7087)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04 (#7359)

* SUPERSET-8: Update text in docs copyright footer (#7360)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04

* SUPERSET-8: Extra text in docs copyright footer

* [schema] Adding commits and removing unnecessary foreign-key definitions (#7371)

*  Store last selected dashboard in sessionStorage (#7181)

* Store last selected dashboard in sessionStorage

* Fix tests

* [schema] Updating the base metric schema (#5453)

* Fix NoneType bug & fill the test recipients with original recipients if empty (#7365)

* feat: see Presto row and array data types (#7391)

* feat: see Presto row and array data types

* fix: address PR comments

* fix: lint and build issues

* fix: add types

* Incorporate feedback from initial PR (prematurely merged to lyft-release-sp8) (#7415)

* add stronger type hints where possible

* fix: lint issues and add select_star func in Hive

* add missing pkg init

* fix: build issues

* fix: pylint issues

* fix: use logging instead of print

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* Workaround for no results returned (#7442)

* feat: view presto row objects in data grid (#7436)

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* feat: Scheduling queries from SQL Lab (#7416)

* Lightweight pipelines POC

* Add docs

* Minor fixes

* Remove Lyft URL

* Use enum

* Minor fix

* Fix unit tests

* Mark props as required

											
										
										
											2019-05-03 19:45:19 -04:00
+								**Scheduling queries**
 								You can optionally allow your users to schedule queries directly in SQL Lab.
 								This is done by addding extra metadata to saved queries, which are then picked
 								up by an external scheduled (like [Apache Airflow](https://airflow.apache.org/)).
 								To allow scheduled queries, add the following to your `config.py`:
 								.. code-block:: python
 								    FEATURE_FLAGS = {
 								        # Configuration for scheduling queries from SQL Lab. This information is
 								        # collected when the user clicks "Schedule query", and saved into the `extra`
 								        # field of saved queries.
 								        # See: https://github.com/mozilla-services/react-jsonschema-form
 								        'SCHEDULED_QUERIES': {
 								            'JSONSCHEMA': {
 								                'title': 'Schedule',
 								                'description': (
 								                    'In order to schedule a query, you need to specify when it '
 								                    'should start running, when it should stop running, and how '
 								                    'often it should run. You can also optionally specify '
 								                    'dependencies that should be met before the query is '
 								                    'executed. Please read the documentation for best practices '
 								                    'and more information on how to specify dependencies.'
 								                ),
 								                'type': 'object',
 								                'properties': {
 								                    'output_table': {
 								                        'type': 'string',
 								                        'title': 'Output table name',
 								                    },
 								                    'start_date': {
 								                        'type': 'string',
 								                        'title': 'Start date',
-												Validate start/end when scheduling queries (#7544)

* Validate start/end when scheduling queries

* Use chrono instead of Sugar

											
										
										
											2019-05-17 20:30:13 -04:00
+								                        # date-time is parsed using the chrono library, see
 								                        # https://www.npmjs.com/package/chrono-node#usage
 								                        'format': 'date-time',
 								                        'default': 'tomorrow at 9am',
-												feat: Scheduling queries from SQL Lab (#7416) (#7446)

* Merge lastest from master into lyft-release-sp8 (#7405)

* filter out all nan series (#7313)

* improve not rich tooltip (#7345)

* Create issue_label_bot.yaml (#7341)

* fix: do not save colors without a color scheme (#7347)

* [wtforms] Strip leading/trailing whitespace (#7084)

* [schema] Updating the datasources schema (#5451)

* limit tables/views returned if schema is not provided (#7358)

* limit tables/views returned if schema is not provided

* fix typo

* improve code performance

* handle the case when table name or view name does not present a schema

* Add type anno (#7342)

* Updated local dev instructions to include missing step

* First pass at type annotations

* [schema] Updating the base column schema (#5452)

* Update 937d04c16b64_update_datasources.py (#7361)

* Feature flag for client cache (#7348)

* Feature flag for client cache

* Fix integration test

* Revert "Fix integration test"

This reverts commit 58434ab98a015d6e96db4a97f26255aa282d989d.

* Feature flag for client cache

* Fix integration tests

* Add feature flag to config.py

* Add another feature check

* Fix more integration tests

* Fix raw HTML in SliceAdder (#7338)

* remove backendSync.json (#7331)

* [bubbles] issue when using duplicated metrics (#7087)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04 (#7359)

* SUPERSET-8: Update text in docs copyright footer (#7360)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04

* SUPERSET-8: Extra text in docs copyright footer

* [schema] Adding commits and removing unnecessary foreign-key definitions (#7371)

*  Store last selected dashboard in sessionStorage (#7181)

* Store last selected dashboard in sessionStorage

* Fix tests

* [schema] Updating the base metric schema (#5453)

* Fix NoneType bug & fill the test recipients with original recipients if empty (#7365)

* feat: see Presto row and array data types (#7391)

* feat: see Presto row and array data types

* fix: address PR comments

* fix: lint and build issues

* fix: add types

* Incorporate feedback from initial PR (prematurely merged to lyft-release-sp8) (#7415)

* add stronger type hints where possible

* fix: lint issues and add select_star func in Hive

* add missing pkg init

* fix: build issues

* fix: pylint issues

* fix: use logging instead of print

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* Workaround for no results returned (#7442)

* feat: view presto row objects in data grid (#7436)

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* feat: Scheduling queries from SQL Lab (#7416)

* Lightweight pipelines POC

* Add docs

* Minor fixes

* Remove Lyft URL

* Use enum

* Minor fix

* Fix unit tests

* Mark props as required

											
										
										
											2019-05-03 19:45:19 -04:00
+								                    },
 								                    'end_date': {
 								                        'type': 'string',
 								                        'title': 'End date',
-												Validate start/end when scheduling queries (#7544)

* Validate start/end when scheduling queries

* Use chrono instead of Sugar

											
										
										
											2019-05-17 20:30:13 -04:00
+								                        # date-time is parsed using the chrono library, see
 								                        # https://www.npmjs.com/package/chrono-node#usage
 								                        'format': 'date-time',
 								                        'default': '9am in 30 days',
-												feat: Scheduling queries from SQL Lab (#7416) (#7446)

* Merge lastest from master into lyft-release-sp8 (#7405)

* filter out all nan series (#7313)

* improve not rich tooltip (#7345)

* Create issue_label_bot.yaml (#7341)

* fix: do not save colors without a color scheme (#7347)

* [wtforms] Strip leading/trailing whitespace (#7084)

* [schema] Updating the datasources schema (#5451)

* limit tables/views returned if schema is not provided (#7358)

* limit tables/views returned if schema is not provided

* fix typo

* improve code performance

* handle the case when table name or view name does not present a schema

* Add type anno (#7342)

* Updated local dev instructions to include missing step

* First pass at type annotations

* [schema] Updating the base column schema (#5452)

* Update 937d04c16b64_update_datasources.py (#7361)

* Feature flag for client cache (#7348)

* Feature flag for client cache

* Fix integration test

* Revert "Fix integration test"

This reverts commit 58434ab98a015d6e96db4a97f26255aa282d989d.

* Feature flag for client cache

* Fix integration tests

* Add feature flag to config.py

* Add another feature check

* Fix more integration tests

* Fix raw HTML in SliceAdder (#7338)

* remove backendSync.json (#7331)

* [bubbles] issue when using duplicated metrics (#7087)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04 (#7359)

* SUPERSET-8: Update text in docs copyright footer (#7360)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04

* SUPERSET-8: Extra text in docs copyright footer

* [schema] Adding commits and removing unnecessary foreign-key definitions (#7371)

*  Store last selected dashboard in sessionStorage (#7181)

* Store last selected dashboard in sessionStorage

* Fix tests

* [schema] Updating the base metric schema (#5453)

* Fix NoneType bug & fill the test recipients with original recipients if empty (#7365)

* feat: see Presto row and array data types (#7391)

* feat: see Presto row and array data types

* fix: address PR comments

* fix: lint and build issues

* fix: add types

* Incorporate feedback from initial PR (prematurely merged to lyft-release-sp8) (#7415)

* add stronger type hints where possible

* fix: lint issues and add select_star func in Hive

* add missing pkg init

* fix: build issues

* fix: pylint issues

* fix: use logging instead of print

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* Workaround for no results returned (#7442)

* feat: view presto row objects in data grid (#7436)

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* feat: Scheduling queries from SQL Lab (#7416)

* Lightweight pipelines POC

* Add docs

* Minor fixes

* Remove Lyft URL

* Use enum

* Minor fix

* Fix unit tests

* Mark props as required

											
										
										
											2019-05-03 19:45:19 -04:00
+								                    },
 								                    'schedule_interval': {
 								                        'type': 'string',
 								                        'title': 'Schedule interval',
 								                    },
 								                    'dependencies': {
 								                        'type': 'array',
 								                        'title': 'Dependencies',
 								                        'items': {
 								                            'type': 'string',
 								                        },
 								                    },
 								                },
 								            },
 								            'UISCHEMA': {
 								                'schedule_interval': {
 								                    'ui:placeholder': '@daily, @weekly, etc.',
 								                },
 								                'dependencies': {
 								                    'ui:help': (
 								                        'Check the documentation for the correct format when '
 								                        'defining dependencies.'
 								                    ),
 								                },
 								            },
-												Validate start/end when scheduling queries (#7544)

* Validate start/end when scheduling queries

* Use chrono instead of Sugar

											
										
										
											2019-05-17 20:30:13 -04:00
+								            'VALIDATION': [
 								                # ensure that start_date <= end_date
 								                {
 								                    'name': 'less_equal',
 								                    'arguments': ['start_date', 'end_date'],
 								                    'message': 'End date cannot be before start date',
 								                    # this is where the error message is shown
 								                    'container': 'end_date',
 								                },
 								            ],
-												Add link to scheduled pipeline (#7584)

* Add link to scheduled pipeline

* Split utils into separate file

* Fix unit test

* Fix separator recursion

											
										
										
											2019-05-23 14:22:15 -04:00
+								            # link to the scheduler; this example links to an Airflow pipeline
 								            # that uses the query id and the output table as its name
 								            'linkback': (
 								                'https://airflow.example.com/admin/airflow/tree?'
 								                'dag_id=query_${id}_${extra_json.schedule_info.output_table}'
 								            ),
-												feat: Scheduling queries from SQL Lab (#7416) (#7446)

* Merge lastest from master into lyft-release-sp8 (#7405)

* filter out all nan series (#7313)

* improve not rich tooltip (#7345)

* Create issue_label_bot.yaml (#7341)

* fix: do not save colors without a color scheme (#7347)

* [wtforms] Strip leading/trailing whitespace (#7084)

* [schema] Updating the datasources schema (#5451)

* limit tables/views returned if schema is not provided (#7358)

* limit tables/views returned if schema is not provided

* fix typo

* improve code performance

* handle the case when table name or view name does not present a schema

* Add type anno (#7342)

* Updated local dev instructions to include missing step

* First pass at type annotations

* [schema] Updating the base column schema (#5452)

* Update 937d04c16b64_update_datasources.py (#7361)

* Feature flag for client cache (#7348)

* Feature flag for client cache

* Fix integration test

* Revert "Fix integration test"

This reverts commit 58434ab98a015d6e96db4a97f26255aa282d989d.

* Feature flag for client cache

* Fix integration tests

* Add feature flag to config.py

* Add another feature check

* Fix more integration tests

* Fix raw HTML in SliceAdder (#7338)

* remove backendSync.json (#7331)

* [bubbles] issue when using duplicated metrics (#7087)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04 (#7359)

* SUPERSET-8: Update text in docs copyright footer (#7360)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04

* SUPERSET-8: Extra text in docs copyright footer

* [schema] Adding commits and removing unnecessary foreign-key definitions (#7371)

*  Store last selected dashboard in sessionStorage (#7181)

* Store last selected dashboard in sessionStorage

* Fix tests

* [schema] Updating the base metric schema (#5453)

* Fix NoneType bug & fill the test recipients with original recipients if empty (#7365)

* feat: see Presto row and array data types (#7391)

* feat: see Presto row and array data types

* fix: address PR comments

* fix: lint and build issues

* fix: add types

* Incorporate feedback from initial PR (prematurely merged to lyft-release-sp8) (#7415)

* add stronger type hints where possible

* fix: lint issues and add select_star func in Hive

* add missing pkg init

* fix: build issues

* fix: pylint issues

* fix: use logging instead of print

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* Workaround for no results returned (#7442)

* feat: view presto row objects in data grid (#7436)

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* feat: Scheduling queries from SQL Lab (#7416)

* Lightweight pipelines POC

* Add docs

* Minor fixes

* Remove Lyft URL

* Use enum

* Minor fix

* Fix unit tests

* Mark props as required

											
										
										
											2019-05-03 19:45:19 -04:00
+								        },
 								    }
 								This feature flag is based on [react-jsonschema-form](https://github.com/mozilla-services/react-jsonschema-form),
-												Update index.rst (#7672)


											
										
										
											2019-06-11 12:46:27 -04:00
+								and will add a button called "Schedule Query" to SQL Lab. When the button is
-												feat: Scheduling queries from SQL Lab (#7416) (#7446)

* Merge lastest from master into lyft-release-sp8 (#7405)

* filter out all nan series (#7313)

* improve not rich tooltip (#7345)

* Create issue_label_bot.yaml (#7341)

* fix: do not save colors without a color scheme (#7347)

* [wtforms] Strip leading/trailing whitespace (#7084)

* [schema] Updating the datasources schema (#5451)

* limit tables/views returned if schema is not provided (#7358)

* limit tables/views returned if schema is not provided

* fix typo

* improve code performance

* handle the case when table name or view name does not present a schema

* Add type anno (#7342)

* Updated local dev instructions to include missing step

* First pass at type annotations

* [schema] Updating the base column schema (#5452)

* Update 937d04c16b64_update_datasources.py (#7361)

* Feature flag for client cache (#7348)

* Feature flag for client cache

* Fix integration test

* Revert "Fix integration test"

This reverts commit 58434ab98a015d6e96db4a97f26255aa282d989d.

* Feature flag for client cache

* Fix integration tests

* Add feature flag to config.py

* Add another feature check

* Fix more integration tests

* Fix raw HTML in SliceAdder (#7338)

* remove backendSync.json (#7331)

* [bubbles] issue when using duplicated metrics (#7087)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04 (#7359)

* SUPERSET-8: Update text in docs copyright footer (#7360)

* SUPERSET-7: Docker compose config version breaks on Ubuntu 16.04

* SUPERSET-8: Extra text in docs copyright footer

* [schema] Adding commits and removing unnecessary foreign-key definitions (#7371)

*  Store last selected dashboard in sessionStorage (#7181)

* Store last selected dashboard in sessionStorage

* Fix tests

* [schema] Updating the base metric schema (#5453)

* Fix NoneType bug & fill the test recipients with original recipients if empty (#7365)

* feat: see Presto row and array data types (#7391)

* feat: see Presto row and array data types

* fix: address PR comments

* fix: lint and build issues

* fix: add types

* Incorporate feedback from initial PR (prematurely merged to lyft-release-sp8) (#7415)

* add stronger type hints where possible

* fix: lint issues and add select_star func in Hive

* add missing pkg init

* fix: build issues

* fix: pylint issues

* fix: use logging instead of print

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* Workaround for no results returned (#7442)

* feat: view presto row objects in data grid (#7436)

* feat: view presto row objects in data grid

* fix: address feedback

* fix: spacing

* feat: Scheduling queries from SQL Lab (#7416)

* Lightweight pipelines POC

* Add docs

* Minor fixes

* Remove Lyft URL

* Use enum

* Minor fix

* Fix unit tests

* Mark props as required

											
										
										
											2019-05-03 19:45:19 -04:00
+								clicked, a modal will show up where the user can add the metadata required for
 								scheduling the query.
 								This information can then be retrieved from the endpoint `/savedqueryviewapi/api/read`
 								and used to schedule the queries that have `scheduled_queries` in their JSON
 								metadata. For schedulers other than Airflow, additional fields can be easily
 								added to the configuration file above.
-												[sqllab] add support for Jinja templating (#1426)

* [sqllab] add support for Jinja templating

* Adressing comments

* Presto macros

* Progress

* Addressing coments

											
										
										
											2016-10-26 14:09:27 -04:00
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
+								Celery Flower
 								-------------
-												[cli] Deprecating gunicorn/flower dependencies (#4451)


											
										
										
											2018-03-30 12:28:16 -04:00
+								Flower is a web based tool for monitoring the Celery cluster which you can
 								install from pip: ::
 								    pip install flower
 								and run via: ::
-												[SIP-3] Scheduled email reports for Slices / Dashboards (#5294)

* [scheduled reports] Add support for scheduled reports

* Scheduled email reports for slice and dashboard visualization
  (attachment or inline)
* Scheduled email reports for slice data (CSV attachment on inline table)
* Each schedule has a list of recipients (all of them can receive a single mail,
  or separate mails)
* All outgoing mails can have a mandatory bcc - for audit purposes.
* Each dashboard/slice can have multiple schedules.

In addition, this PR also makes a few minor improvements to the celery
infrastructure.
* Create a common celery app
* Added more celery annotations for the tasks
* Introduced celery beat
* Update docs about concurrency / pools

* [scheduled reports] - Debug mode for scheduled emails

* [scheduled reports] - Ability to send test mails

* [scheduled reports] - Test email functionality - minor improvements

* [scheduled reports] - Rebase with master. Minor fixes

* [scheduled reports] - Add warning messages

* [scheduled reports] - flake8

* [scheduled reports] - fix rebase

* [scheduled reports] - fix rebase

* [scheduled reports] - fix flake8

* [scheduled reports] Rebase in prep for merge

* Fixed alembic tree after rebase
* Updated requirements to latest version of packages (and tested)
* Removed py2 stuff

* [scheduled reports] - fix flake8

* [scheduled reports] - address review comments

* [scheduled reports] - rebase with master

											
										
										
											2018-12-11 01:29:29 -05:00
+								    celery flower --app=superset.tasks.celery_app:app
-												[cli] Deprecating gunicorn/flower dependencies (#4451)


											
										
										
											2018-03-30 12:28:16 -04:00
-												Clean up CONTRIBUTING.md: (#5911)

- Reorganize sections for better navigability
- Add table of contents
- Rework frontend assets section for clarity and DRY
- Rework translating section for clarity, add "Add Translations" contribution type
- Move release docs only useful for maintainers to RELEASING.md
- Other miscellaneous improvements
											
										
										
											2018-09-19 13:01:51 -04:00
+								Building from source
-												Documenting making your own build (#990)


											
										
										
											2016-08-19 18:27:35 -04:00
+								---------------------
-												Clean up CONTRIBUTING.md: (#5911)

- Reorganize sections for better navigability
- Add table of contents
- Rework frontend assets section for clarity and DRY
- Rework translating section for clarity, add "Add Translations" contribution type
- Move release docs only useful for maintainers to RELEASING.md
- Other miscellaneous improvements
											
										
										
											2018-09-19 13:01:51 -04:00
+								More advanced users may want to build Superset from sources. That
-												Documenting making your own build (#990)


											
										
										
											2016-08-19 18:27:35 -04:00
+								would be the case if you fork the project to add features specific to
-												docs: fix RST issues while building docs (#7012)


											
										
										
											2019-03-18 21:11:53 -04:00
+								your environment. See `CONTRIBUTING.md#setup-local-environment-for-development <https://github.com/apache/incubator-superset/blob/master/CONTRIBUTING.md#setup-local-environment-for-development>`_.
-												Allow running Flask Blueprints alongside Superset (#2337)

* Allowing environments to import Blueprints

* Docs entry

* Fix typos

											
										
										
											2017-03-03 20:09:54 -05:00
 								Blueprints
 								----------
-												Update to fix the broken blueprints link (#7949)

* Updated to fix the broken blueprints link

The current link is http://flask.pocoo.org/docs/0.12/blueprints/ which redirects to a non-existent page. The new link is https://flask.palletsprojects.com/en/1.1.x/tutorial/views/.

* Link to the current Flask version(1.0.x)

											
										
										
											2019-08-07 17:31:22 -04:00
+								`Blueprints are Flask's reusable apps <https://flask.palletsprojects.com/en/1.0.x/tutorial/views/>`_.
-												Allow running Flask Blueprints alongside Superset (#2337)

* Allowing environments to import Blueprints

* Docs entry

* Fix typos

											
										
										
											2017-03-03 20:09:54 -05:00
+								Superset allows you to specify an array of Blueprints
-												Remove duplicate text (#2761)


											
										
										
											2017-05-16 00:31:37 -04:00
+								in your ``superset_config`` module. Here's
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								an example of how this can work with a simple Blueprint. By doing
-												Allow running Flask Blueprints alongside Superset (#2337)

* Allowing environments to import Blueprints

* Docs entry

* Fix typos

											
										
										
											2017-03-03 20:09:54 -05:00
+								so, you can expect Superset to serve a page that says "OK"
 								at the ``/simple_page`` url. This can allow you to run other things such
 								as custom data visualization applications alongside Superset, on the
 								same server.
-												docs: fixup code blocks rendering (#4594)


											
										
										
											2018-03-11 14:34:58 -04:00
+								.. code-block:: python
-												Allow running Flask Blueprints alongside Superset (#2337)

* Allowing environments to import Blueprints

* Docs entry

* Fix typos

											
										
										
											2017-03-03 20:09:54 -05:00
 								    from flask import Blueprint
 								    simple_page = Blueprint('simple_page', __name__,
 								                                    template_folder='templates')
 								    @simple_page.route('/', defaults={'page': 'index'})
 								    @simple_page.route('/<page>')
 								    def show(page):
 								        return "Ok"
 								    BLUEPRINTS = [simple_page]
-												[docs] add StatsD setup instructions (#3813)


											
										
										
											2017-11-10 20:54:56 -05:00
 								StatsD logging
 								--------------
 								Superset is instrumented to log events to StatsD if desired. Most endpoints hit
 								are logged as well as key events like query start and end in SQL Lab.
 								To setup StatsD logging, it's a matter of configuring the logger in your
 								``superset_config.py``.
-												docs: fixup code blocks rendering (#4594)


											
										
										
											2018-03-11 14:34:58 -04:00
+								.. code-block:: python
-												[docs] add StatsD setup instructions (#3813)


											
										
										
											2017-11-10 20:54:56 -05:00
 								    from superset.stats_logger import StatsdStatsLogger
 								    STATS_LOGGER = StatsdStatsLogger(host='localhost', port=8125, prefix='superset')
 								Note that it's also possible to implement you own logger by deriving
 								``superset.stats_logger.BaseStatsLogger``.
-												Install superset in Kubernetes with helm chart (#4923)

* Add helm chart to install superset in kubernetes

* set resources into unlimited

* Add descriptions to Chart.yaml

* add an entry in docs/installation.rst

											
										
										
											2018-05-03 20:35:38 -04:00
 								Install Superset with helm in Kubernetes
-												[docs] FAQ entry 'Does Superset work with [database engine]?'

											
										
										
											2018-08-25 19:18:16 -04:00
+								----------------------------------------
-												Install superset in Kubernetes with helm chart (#4923)

* Add helm chart to install superset in kubernetes

* set resources into unlimited

* Add descriptions to Chart.yaml

* add an entry in docs/installation.rst

											
										
										
											2018-05-03 20:35:38 -04:00
-												docs: Add new Athena URI scheme awsathena+rest:// (#5112)

See also some discussions on https://github.com/laughingman7743/PyAthenaJDBC/pull/62
											
										
										
											2018-06-01 01:19:07 -04:00
+								You can install Superset into Kubernetes with Helm <https://helm.sh/>. The chart is
-												Install superset in Kubernetes with helm chart (#4923)

* Add helm chart to install superset in kubernetes

* set resources into unlimited

* Add descriptions to Chart.yaml

* add an entry in docs/installation.rst

											
										
										
											2018-05-03 20:35:38 -04:00
+								located in ``install/helm``.
 								To install Superset into your Kubernetes:
 								.. code-block:: bash
-												docs: Add new Athena URI scheme awsathena+rest:// (#5112)

See also some discussions on https://github.com/laughingman7743/PyAthenaJDBC/pull/62
											
										
										
											2018-06-01 01:19:07 -04:00
+								    helm upgrade --install superset ./install/helm/superset
-												Install superset in Kubernetes with helm chart (#4923)

* Add helm chart to install superset in kubernetes

* set resources into unlimited

* Add descriptions to Chart.yaml

* add an entry in docs/installation.rst

											
										
										
											2018-05-03 20:35:38 -04:00
 								Note that the above command will install Superset into ``default`` namespace of your Kubernetes cluster.
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
 								Custom OAuth2 configuration
 								---------------------------
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
+								Beyond FAB supported providers (github, twitter, linkedin, google, azure), its easy to connect Superset with other OAuth2 Authorization Server implementations that support "code" authorization.
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
 								The first step: Configure authorization in Superset ``superset_config.py``.
 								.. code-block:: python
 								    AUTH_TYPE = AUTH_OAUTH
 								    OAUTH_PROVIDERS = [
 								        {   'name':'egaSSO',
 								            'token_key':'access_token', # Name of the token in the response of access_token_url
 								            'icon':'fa-address-card',   # Icon for the provider
 								            'remote_app': {
 								                'consumer_key':'myClientId',  # Client Id (Identify Superset application)
 								                'consumer_secret':'MySecret', # Secret for this Client Id (Identify Superset application)
 								                'request_token_params':{
 								                    'scope': 'read'               # Scope for the Authorization
 								                },
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								                'access_token_method':'POST',    # HTTP Method to call access_token_url
 								                'access_token_params':{        # Additional parameters for calls to access_token_url
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
+								                    'client_id':'myClientId'
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
+								                },
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
+								                'access_token_headers':{    # Additional headers for calls to access_token_url
 								                    'Authorization': 'Basic Base64EncodedClientIdAndSecret'
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
+								                },
 								                'base_url':'https://myAuthorizationServer/oauth2AuthorizationServer/',
 								                'access_token_url':'https://myAuthorizationServer/oauth2AuthorizationServer/token',
 								                'authorize_url':'https://myAuthorizationServer/oauth2AuthorizationServer/authorize'
 								            }
 								        }
 								    ]
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
+								    # Will allow user self registration, allowing to create Flask users from Authorized User
 								    AUTH_USER_REGISTRATION = True
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
+								    # The default user self registration role
 								    AUTH_USER_REGISTRATION_ROLE = "Public"
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
+								Second step: Create a `CustomSsoSecurityManager` that extends `SupersetSecurityManager` and overrides `oauth_user_info`:
 								.. code-block:: python
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
+								    from superset.security import SupersetSecurityManager
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
+								    class CustomSsoSecurityManager(SupersetSecurityManager):
 								        def oauth_user_info(self, provider, response=None):
 								            logging.debug("Oauth2 provider: {0}.".format(provider))
 								            if provider == 'egaSSO':
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
+								                # As example, this line request a GET to base_url + '/' + userDetails with Bearer  Authentication,
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								        # and expects that authorization server checks the token, and response with user details
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
+								                me = self.appbuilder.sm.oauth_remotes[provider].get('userDetails').data
 								                logging.debug("user_data: {0}".format(me))
 								                return { 'name' : me['name'], 'email' : me['email'], 'id' : me['user_name'], 'username' : me['user_name'], 'first_name':'', 'last_name':''}
-												Fixed typos and grammatical errors (#6248)


											
										
										
											2018-10-31 16:31:11 -04:00
+								        ...
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
 								This file must be located at the same directory than ``superset_config.py`` with the name ``custom_sso_security_manager.py``.
 								Then we can add this two lines to ``superset_config.py``:
 								.. code-block:: python
-												feat: Add `validate_sql_json` endpoint for checking that a given sql query is valid for the chosen database (#7422) (#7462)

merge from lyft-release-sp8 to master
											
										
										
											2019-05-06 13:21:02 -04:00
-												Describe the use of custom OAuth2 authorization servers (#5220)

As Superset extends flask SecurityManager with its own implementation, it's not obvious how to connect Superset with OAuth2 authorization servers that are not covered under flask.
											
										
										
											2018-06-19 11:48:48 -04:00
+								  from custom_sso_security_manager import CustomSsoSecurityManager
 								  CUSTOM_SECURITY_MANAGER = CustomSsoSecurityManager
-												[feature flag] Enforce csrf protection on explore_json endpoint (#7935)

also added a section for featured flags in http://superset.incubator.apache.org/installation.html
											
										
										
											2019-07-29 19:22:47 -04:00
 								Feature Flags
-												[SIP-15] Adding initial framework (#8398)

* [sip-15] Adding initial framework

* [toast] Addressing etr2460's comments

* [fix] Addressing etr2460's comments

											
										
										
											2019-10-28 17:23:12 -04:00
+								-------------
-												[feature flag] Enforce csrf protection on explore_json endpoint (#7935)

also added a section for featured flags in http://superset.incubator.apache.org/installation.html
											
										
										
											2019-07-29 19:22:47 -04:00
 								Because of a wide variety of users, Superset has some features that are not enabled by default. For example, some users have stronger security restrictions, while some others may not. So Superset allow users to enable or disable some features by config. For feature owners, you can add optional functionalities in Superset, but will be only affected by a subset of users.
 								You can enable or disable features with flag from ``superset_config.py``:
 								.. code-block:: python
 								     DEFAULT_FEATURE_FLAGS = {
 								         'CLIENT_CACHE': False,
-												Add feature flag for Presto expand data (#8056)

* Add feature flag for Presto expand data

* Fix unit tests

* Fix black

* Revert temporary file change

											
										
										
											2019-08-15 23:10:05 -04:00
+								         'ENABLE_EXPLORE_JSON_CSRF_PROTECTION': False,
 								         'PRESTO_EXPAND_DATA': False,
-												[feature flag] Enforce csrf protection on explore_json endpoint (#7935)

also added a section for featured flags in http://superset.incubator.apache.org/installation.html
											
										
										
											2019-07-29 19:22:47 -04:00
+								     }
 								Here is a list of flags and descriptions:
 								* ENABLE_EXPLORE_JSON_CSRF_PROTECTION
 								  * For some security concerns, you may need to enforce CSRF protection on all query request to explore_json endpoint. In Superset, we use `flask-csrf <https://sjl.bitbucket.io/flask-csrf/>`_ add csrf protection for all POST requests, but this protection doesn't apply to GET method.
 								  * When ENABLE_EXPLORE_JSON_CSRF_PROTECTION is set to true, your users cannot make GET request to explore_json. The default value for this feature False (current behavior), explore_json accepts both GET and POST request. See `PR 7935 <https://github.com/apache/incubator-superset/pull/7935>`_ for more details.
-												Add feature flag for Presto expand data (#8056)

* Add feature flag for Presto expand data

* Fix unit tests

* Fix black

* Revert temporary file change

											
										
										
											2019-08-15 23:10:05 -04:00
 								* PRESTO_EXPAND_DATA
 								  * When this feature is enabled, nested types in Presto will be expanded into extra columns and/or arrays. This is experimental, and doesn't work with all nested types.
-												[SIP-15] Adding initial framework (#8398)

* [sip-15] Adding initial framework

* [toast] Addressing etr2460's comments

* [fix] Addressing etr2460's comments

											
										
										
											2019-10-28 17:23:12 -04:00
 								SIP-15
 								------
 								`SIP-15 <https://github.com/apache/incubator-superset/issues/6360>`_ aims to ensure that time intervals are handled in a consistent and transparent manner for both the Druid and SQLAlchemy connectors.
-												[sip-15] Adding database level python-date-format (#8474)


											
										
										
											2019-10-31 10:13:41 -04:00
+								Prior to SIP-15 SQLAlchemy used inclusive endpoints however these may behave like exclusive for string columns (due to lexicographical ordering) if no formatting was defined and the column formatting did not conform to an ISO 8601 date-time (refer to the SIP for details).
 								To remedy this rather than having to define the date/time format for every non-IS0 8601 date-time column, once can define a default column mapping on a per database level via the ``extra`` parameter ::
 								    {
 								        "python_date_format_by_column_name": {
 								            "ds": "%Y-%m-%d"
 								        }
 								    }
-												[SIP-15] Adding grace period (#8490)


											
										
										
											2019-11-04 15:00:41 -05:00
+								**New deployments**
 								All new Superset deployments should enable SIP-15 via,
 								.. code-block:: python
 								    SIP_15_ENABLED = True
 								**Existing deployments**
 								Given that it is not apparent whether the chart creator was aware of the time range inconsistencies (and adjusted the endpoints accordingly) changing the behavior of all charts is overly aggressive. Instead SIP-15 proivides a soft transistion allowing producers (chart owners) to see the impact of the proposed change and adjust their charts accordingly.
 								Prior to enabling SIP-15 existing deployments should communicate to their users the impact of the change and define a grace period end date (exclusive of course) after which all charts will conform to the [start, end) interval, i.e.,
 								.. code-block:: python
 								    from dateime import date
 								    SIP_15_ENABLED = True
 								    SIP_15_GRACE_PERIOD_END = date(<YYYY>, <MM>, <DD>)
 								To aid with transparency the current endpoint behavior is explicitly called out in the chart time range (post SIP-15 this will be [start, end) for all connectors and databases). One can override the defaults on a per database level via the ``extra``
-												[SIP-15] Adding initial framework (#8398)

* [sip-15] Adding initial framework

* [toast] Addressing etr2460's comments

* [fix] Addressing etr2460's comments

											
										
										
											2019-10-28 17:23:12 -04:00
+								parameter ::
 								    {
 								        "time_range_endpoints": ["inclusive", "inclusive"]
 								    }
-												[SIP-15] Adding grace period (#8490)


											
										
										
											2019-11-04 15:00:41 -05:00
 								Note in a future release the interim SIP-15 logic will be removed (including the ``time_grain_endpoints`` form-data field) via a code change and Alembic migration.