This topic provides examples on how to perform operations on configurations in typical scenarios by using the SDK for Python.
Configuration
PyODPS provides a series of configuration options, which can be obtained by using odps.options. The following table describes configurable MaxCompute options.
General configurations
Option
Description
Default value
end_point
The MaxCompute endpoint.
None
default_project
The default project.
None
log_view_host
The hostname in the Logview URL.
None
log_view_hours
The holding time of Logview. Unit: hours.
24
local_timezone
The time zone that is used. True indicates the local time, and False indicates UTC. The time zone pytz can also be used.
None
lifecycle
The lifecycle of all tables.
None
temp_lifecycle
The lifecycle of temporary tables.
1
biz_id
The user ID.
None
verbose
Specifies whether to display logs.
False
verbose_log
The log receiver.
None
chunk_size
The size of the write buffer.
1496
retry_times
The number of request retries.
4
pool_connections
The number of cached connections in the connection pool.
10
pool_maxsize
The maximum capacity of the connection pool.
10
connect_timeout
The connection timeout.
5
read_timeout
The time to wait before the read operation times out.
120
completion_size
The limit on the number of object completion listing items.
10
notebook_repr_widget
Specifies whether to use interactive graphs.
True
sql.settings
MaxCompute SQL runs global hints.
None
sql.use_odps2_extension
Specifies whether to enable MaxCompute 2.0 language extension.
False
Data upload or download configurations
Option
Description
Default value
tunnel.endpoint
The Tunnel endpoint.
None
tunnel.use_instance_tunnel
Specifies whether to use Instance Tunnel to obtain the upload or download results.
True
tunnel.limited_instance_tunnel
Specifies whether to limit the number of data records obtained by using Instance Tunnel.
True
tunnel.string_as_binary
Specifies whether to use bytes instead of unicode for data of the STRING type.
False
DataFrame configurations
Option
Description
Default value
interactive
Specifies whether DataFrame is used in an interactive environment.
Based on the measured value
df.analyze
Specifies whether to enable non-MaxCompute built-in functions.
True
df.optimize
Specifies whether to enable full DataFrame optimization.
True
df.optimizes.pp
Specifies whether to enable DataFrame predicate push optimization.
True
df.optimizes.cp
Specifies whether to enable DataFrame column pruning optimization.
True
df.optimizes.tunnel
Specifies whether to enable DataFrame tunnel optimization.
True
df.quote
Specifies whether to use a pair of grave accents (``) to mark fields and table names in the backend of MaxCompute SQL.
True
df.libraries
The resource name of the third-party library that is used for the DataFrame operation.
None