models
formatting.models
¤
Classes¤
Classification
¤
Bases: PermissionsBase
Contains instructions on how to classify the data into a specific variable.
In particular, it links a format to a variable, and provides the column indices for the value, maximum, and minimum columns, as well as the validator columns. It also contains information on whether the data is accumulated, incremental, and the resolution of the data.
Attributes:
Name | Type | Description |
---|---|---|
cls_id |
AutoField
|
Primary key. |
format |
ForeignKey
|
The format of the data file. |
variable |
ForeignKey
|
The variable to which the data belongs. |
value |
PositiveSmallIntegerField
|
Index of the value column, starting in 0. |
maximum |
PositiveSmallIntegerField
|
Index of the maximum value column, starting in 0. |
minimum |
PositiveSmallIntegerField
|
Index of the minimum value column, starting in 0. |
value_validator_column |
PositiveSmallIntegerField
|
Index of the value validator column, starting in 0. |
value_validator_text |
CharField
|
Value validator text. |
maximum_validator_column |
PositiveSmallIntegerField
|
Index of the maximum value validator column, starting in 0. |
maximum_validator_text |
CharField
|
Maximum value validator text. |
minimum_validator_column |
PositiveSmallIntegerField
|
Index of the minimum value validator column, starting in 0. |
minimum_validator_text |
CharField
|
Minimum value validator text. |
accumulate |
PositiveSmallIntegerField
|
If set to a number of minutes, the data will be accumulated over that period. |
resolution |
DecimalField
|
Resolution of the data. Only used if it is to be accumulated. |
incremental |
BooleanField
|
Whether the data is an incremental counter. If it is, any value below the previous one will be removed. |
decimal_comma |
BooleanField
|
Whether the data uses a comma as a decimal separator. |
Functions¤
__str__()
¤
Return the string representation of the object.
Source code in formatting/models.py
419 420 421 |
|
clean()
¤
Validate the model instance.
It checks that the column indices are different, and that the accumulation period is greater than zero if it is set. It also checks that the resolution is set if the data is accumulated.
Source code in formatting/models.py
427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 |
|
get_absolute_url()
¤
Get the absolute URL of the object.
Source code in formatting/models.py
423 424 425 |
|
Date
¤
Bases: PermissionsBase
Date format.
Format string for the date column. It is used to parse the date column in the data
file. The format string must be compatible with the datetime
module in Python. See
the datetime documentation
for more information on valid format codes.
Attributes:
Name | Type | Description |
---|---|---|
date_id |
AutoField
|
Primary key. |
date_format |
CharField
|
The format string for the date column in human readable
form, eg. |
code |
CharField
|
The code used to parse the date column, eg. |
Functions¤
__str__()
¤
Return the string representation of the object.
Source code in formatting/models.py
115 116 117 |
|
get_absolute_url()
¤
Get the absolute URL of the object.
Source code in formatting/models.py
119 120 121 |
|
Delimiter
¤
Bases: PermissionsBase
Delimiter between columns in the data file.
One or more characters that separate columns in a text file. The most common values
are ,
, ;
, and \t
(tab).
Attributes:
Name | Type | Description |
---|---|---|
delimiter_id |
AutoField
|
Primary key. |
name |
CharField
|
The name of the delimiter. eg. |
character |
CharField
|
The character used as a delimiter. eg. |
Functions¤
__str__()
¤
Return the string representation of the object.
Source code in formatting/models.py
78 79 80 |
|
get_absolute_url()
¤
Get the absolute URL of the object.
Source code in formatting/models.py
82 83 84 |
|
Extension
¤
Bases: PermissionsBase
Extension of the data file.
It is mostly used to choose the tool to be employed to ingest the data. While it can
take any value, there is currently explicit support only for xlsx
and xlx
.
Anything else will be interpreted as a text file and loaded using pandas.read_csv
.
Attributes:
Name | Type | Description |
---|---|---|
extension_id |
AutoField
|
Primary key. |
value |
CharField
|
The extension value. eg. |
Functions¤
__str__()
¤
Return the string representation of the object.
Source code in formatting/models.py
44 45 46 |
|
get_absolute_url()
¤
Get the absolute URL of the object.
Source code in formatting/models.py
48 49 50 |
|
Format
¤
Bases: PermissionsBase
Details of the data file format, describing how to read the file.
It combines several properties, such as the file extension, the delimiter, the date and time formats, and the column indices for the date and time columns, instructing how to read the data file and parse the dates. It is mostly used to ingest data from text files, like CSV.
Attributes:
Name | Type | Description |
---|---|---|
format_id |
AutoField
|
Primary key. |
name |
CharField
|
Short name of the format entry. |
description |
TextField
|
Description of the format. |
extension |
ForeignKey
|
The extension of the data file. |
delimiter |
ForeignKey
|
The delimiter between columns in the data file. Only required for text files. |
first_row |
PositiveSmallIntegerField
|
Index of the first row with data, starting in 0. |
footer_rows |
PositiveSmallIntegerField
|
Number of footer rows to be ignored at the end. |
date |
ForeignKey
|
Format for the date column. Only required for text files. |
date_column |
PositiveSmallIntegerField
|
Index of the date column, starting in 0. |
time |
ForeignKey
|
Format for the time column. Only required for text files. |
time_column |
PositiveSmallIntegerField
|
Index of the time column, starting in 0. |
Attributes¤
datetime_format: str
property
¤
Obtain the datetime format string.
Functions¤
__str__()
¤
Return the string representation of the object.
Source code in formatting/models.py
253 254 255 |
|
datetime_columns(delimiter)
¤
Column indices that correspond to the date and time columns in the dataset.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
delimiter
|
str
|
The delimiter used to split the date and time codes. |
required |
Returns:
Type | Description |
---|---|
list[int]
|
list[int]: A list of column indices. |
Source code in formatting/models.py
266 267 268 269 270 271 272 273 274 275 276 277 278 279 |
|
get_absolute_url()
¤
Get the absolute URL of the object.
Source code in formatting/models.py
257 258 259 |
|
Time
¤
Bases: PermissionsBase
Time format.
Format string for the time column. It is used to parse the time column in the data
file. The format string must be compatible with the datetime
module in Python. See
the datetime documentation
for more information on valid format codes.
Attributes:
Name | Type | Description |
---|---|---|
date_id |
AutoField
|
Primary key. |
date_format |
CharField
|
The format string for the date column in human readable
form, eg. |
code |
CharField
|
The code used to parse the date column, eg. |
Functions¤
__str__()
¤
Return the string representation of the object.
Source code in formatting/models.py
155 156 157 |
|
get_absolute_url()
¤
Get the absolute URL of the object.
Source code in formatting/models.py
159 160 161 |
|
Variable
¤
Bases: PermissionsBase
A variable with a physical meaning.
Such as precipitation, wind speed, wind direction, soil moisture, including the associated unit. It also includes metadata to help identify what is a reasonable value for the data, to flag outliers and to help with the validation process.
The nature of the variable can be one of the following:
- sum: Cumulative value over a period of time.
- average: Average value over a period of time.
- value: One-off value.
Attributes:
Name | Type | Description |
---|---|---|
variable_id |
AutoField
|
Primary key. |
variable_code |
CharField
|
Code of the variable, eg. airtemperature. |
name |
CharField
|
Human-readable name of the variable, eg. Air temperature. |
unit |
ForeignKey
|
Unit of the variable. |
maximum |
DecimalField
|
Maximum value allowed for the variable. |
minimum |
DecimalField
|
Minimum value allowed for the variable. |
diff_error |
DecimalField
|
If two sequential values in the time-series data of this variable differ by more than this value, the validation process can mark this with an error flag. |
outlier_limit |
DecimalField
|
The statistical deviation for defining outliers, in times the standard deviation (sigma). |
null_limit |
DecimalField
|
The max % of null values (missing, caused by e.g. equipment malfunction) allowed for hourly, daily, monthly data. Cumulative values are not deemed trustworthy if the number of missing values in a given period is greater than the null_limit. |
nature |
CharField
|
Nature of the variable, eg. if it represents a one-off value, the average over a period of time or the cumulative value over a period |
Attributes¤
is_cumulative: bool
property
¤
Return True if the nature of the variable is sum.
Functions¤
__str__()
¤
Return the string representation of the object.
Source code in variable/models.py
165 166 167 |
|
clean()
¤
Validate the model fields.
Source code in variable/models.py
173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 |
|
get_absolute_url()
¤
Get the absolute URL of the object.
Source code in variable/models.py
169 170 171 |
|