The data architect has mandated that all tables in the Lakehouse should be configured as external (also known as "unmanaged") Delta Lake tables.
Which approach will ensure that this requirement is met?
The data architect has mandated that all tables in the Lakehouse should be configured as external (also known as "unmanaged") Delta Lake tables.
Which approach will ensure that this requirement is met?
To ensure that tables in the Lakehouse are configured as external (unmanaged) Delta Lake tables, it is necessary to specify the full file path when saving data to a table. This step ensures that the data is stored in the location specified by the user, rather than the default directory managed by Delta Lake. By doing so, the table will be considered external since the data resides outside the managed storage, fulfilling the requirement set by the data architect.
Non of the provided. It should be: When a table is created, make sure LOCATION is provided
In Delta Lake, an external (or unmanaged) table is a table created outside of the data lake but is still accessible from the data lake. The data for external tables is stored in a location specified by the user, not in the default directory of the data lake. When you save data to an external table, you need to specify the full file path where the data will be stored. This makes the table “external” because the data itself is not managed by Delta Lake, only the metadata is. This is why specifying a full file path alongside the Delta format when saving data to a table will ensure that the table is configured as an external Delta Lake table.
Here's why the other options may not ensure the requirement is met: D. Delta Lake does not use the EXTERNAL keyword in the same way as some other SQL-based systems. In Delta Lake, whether a table is external is determined by where the data files are stored, not by a keyword in the CREATE TABLE statement. %sql CREATE TABLE f1_demo.results_external USING DELTA LOCATION '/mnt/formula1dl/demo/results_external'
C is correct.
C is correct
possible answer is C
I think it should be A because when a database is created using a location all tables within this database are automatically assign as unmanaged tables.
Not quite. Test & see. The tables are 'managed' though database creation has 'LOCATION' keyword. C is best.
provide path (LOCATION)
C is plausible answer, as in this case we are writing the data to an external location