kylebrandt/DataplaneFAQ.md

Last active October 6, 2025 13:16

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Select an option

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/kylebrandt/eb307b49961e3003c8c93b32dac0313d.js"></script>
Save kylebrandt/eb307b49961e3003c8c93b32dac0313d to your computer and use it in GitHub Desktop.

Download ZIP

SQL Expressions and Dataplane DOC drafts

Raw

DataplaneFAQ.md

Dataplane FAQ

Note: much of this is moved to grafana/dataplane#56

How does dataplane relate to Dataframes?

Dataplane adds a data typing layer on top of data frames. By analogy Dataplane Types are to dataframes as typescript is to javascript.

Specifically, a property is added to the dataframe to indicate its type with is made up of a kind and a format, for example "Timeseries Wide".

There are two properties attached to each data frame for this, frame.meta.type and frame.meta.typeversion.

What is the Dataplane "Contract"?

The contract is a written set of rules for each dataplane data type. It says how frames should be formed by producers of data (datasources, transformations) and how consumers like dashboard, alerting, and apps can expect that data.

In short it describes the rules for valid and invalid schemas for each type.

What is the point?

The main point is self-interoperability within Grafana between datasources and features like Dashboards and alerting.

For devs: Data source authors know what type of frames to output, and authors of features know what to expect for their input. This makes the platform scalable and development more efficient and less frustrating due to incompatibilities.

For users: They should see a side effect of things being more reliable and working as expected.

So compatibility becomes about supporting data types and not specific features and data sources. For example, if datasource produces type "A", and alerting accepts type "A" and certain visualizations. That source should work with alerting and those visualizations.

Why the Frack (or silly swear word of your choice) is it so complicated?

Two main reasons, open tent and history.

Open Tent

The Open tent nature of our software means that there are different formats of data, in order to support those more natively, there are set of formats where generally one of them should be close to what a datasource produces, so the data source can relay the information with minimal distortion.

Historical Retrofitting

Dataframes existed before the dataplane types did. So the Dataplane formats were designed to be close to the coventions for data that had are evolved while trying remove places where things were inconsistent, and make decision one way or the other.

How do things work when there are frames with no dataplane type?

Consumers of data have to infer the type from looking at the data returned. This has a few problems:

Users get confused about how to write queries to work with different things
Error messages can become seemingly unrelated to what users are doing
Different features guess differently (e.g. alerting vs visualizations), hard for users and devs to know what to send
"Guessing code" (on the consumer side) gets more convoluted over time as more exceptions are added for various data sources.

What if there is no type close to what my data source returns

You can still return dataframes and work with consumers to support them. The point of dataplane types is for well established types or proposing types to be established. You can set the frame.meta.type to whatever string you want, and the version to whatever x.x.

So dataplane is designed to allow data types to grow into maturity in a way not limit new innovation.

What opportunities does dataplane provide?

Besides interoperability:

We can suggest what actions can be taken with the data if the type is reliably known. So for example, we can suggest creating alert rules or certain visualizations in dashboards that work well that type.
We can suggest transformations that get you from the current type to another type support additional actions

Is dataplane defined in something like `plugin.json`?

No, and this would be a very helpful evolution.

Right now everything is runtime, or in code. If had a schema of query types, and the data types they returned. We could generate compatibility matrices of data sources and features.

What if my data source is schemaless and doesn't have kinds or types?

The normal pattern is to have a drop down in DS query UI to assert the query type.

This often appears as "format as" in the UI. This data source query information can then be used my the data source to produce a dataplane compatible type for the response.

That is extra work for the user, isn't guessing better?

Perhaps, but not at the feature/consumer level. The data source knows more about the data that comes from the system behind the data source. So this more ends up confusing the user.

Rather, if we want to help the user, auto-determine the type should be done with in the DS in a way that fits with the constraints of the datasource.

How do I know data type (kind and format) to use for my Datasource queries?

What data types work with what today?

Many visualizations work natively with the labeled types (multi and wide). (TODO: more discovery around what works with what viz). The Long types will naturally work in the table visualization. But presently long will not work with the timeseries viz until a user adds the prepare time series transform (from Grafana Transformations).
Grafana Managed Alerts and Recording rules via Server Side Expressions (SSE) support multi, wide, and long format formats for both numeric and timeseries kinds. Serve side expressions converts the responses to the corresponding kind. numeric can be directly alerted on, timeseries will need a SSE reduce operation (which creates a numeric result) to be alerted on.

What are some datasources that already send dataplane data today for some of their responses?

Prometheus (and amamazon / azure variants)
Loki
Azure Monitor and Azure Data Explorer
Bigquery
Clickhouse
Cloudlflare
Databricks
New Relic
Oracle, Postgres, MySQL
Influx
Snowflake
Victoria metrics

Are there examples of frames for dataplane types?

Yes, see https://github.com/grafana/dataplane/tree/main/examples/data for numeric and time series examples. Each json file is an array of frame(s) as they are encoded in JSON.

Raw

SQL_Expressions_DS_Dev_Guide.md

What does a data source need to do to support SQL Expressions?

It must be a backend data source since SQL expressions is server side.

SQL Expressions supports two general categories of responses:

Tabular data: A single dataframe, with no labels on any of the fields (columns). In short, a dataframe that can be directly mapped to a SQL table.
Labeled Metric Time Data (Timeseries or Numeric): Data that meets the Dataplane spec and has the type Frame.Meta.Type property set.

Therefore, data source support is really per query type or response type within a data source. So it is often not all or nothing in terms of data source support.

Tabular Data

For tabular data responses SQL expressions should work out of the box, so the only thing to do is test out a basic select query.

Labeled Metric Time Data (Timeseries or Numeric)

For labeled metric data, SQL Expressions detects it based on the Frame.Meta.Type property of the data frame. This property holds the the dataplane type.

The supported types are:

timeseries-multi
timeseries-wide
numeric-multi
numeric-wide

(Note: the timeseries-long and timeseries-multi fall into the tabular category).

So if your DS has metric data that matches one of those data types, it should work as long as Frame.Meta.Type (serialized into json as schema.meta.type) is set to that data type, for example:

[
  {
    "schema": {
      "meta": {
        "type": "numeric-wide", // SQL expressions need this property set for labeled metric data
        "typeVersion": [0, 1], // optional for SQL Expressions (so can be default 0.0)
        // TypeVersion > 0.0 should make other SSE operations more deterministic,
        // but if not a new DS, safest path is to do that as a separate task.
        // ...
      }
    },
    "fields": [
        // ...
    ]
  }
]

In go code:

import (
    "github.com/grafana/grafana-plugin-sdk-go/data"
)

func main() {
    frame := data.NewFrame("")
    // ... add data and fields the create "NumericWide" type.
    frame.Meta = &data.FrameMeta{Type: data.FrameTypeNumericWide}
}

When SQL expressions receives labeled metric data, it will convert (flatten) the data into the full-long format for the corresponding kind (timeseries or numeric). This happens once the data source query is selected by RefID from a SQL expression. This is because SQL has no notion of labels.

Manual Testing

In a dashboard, and for each type for response type your query offers:

Add your data source query
Add Expression -> Type SQL
The default should query should be SELECT * from A LIMIT 10 (assuming your query is RefID A). If this works on a few variations of that query type, it should be compatible with SQL expressions.
If it doesn't work, see section above, if you are sending metadata and the datatypes mentioned above (or tabular data, and it isn't working, there may be an issue with SQL expressions)

FAQ / Other

What happens my DS sends labeled data without a frame type and Select from it in a SQL expressions?

You should get an error like:

[sse.sql.input_conversion] failed to convert the results of query [A] (Datasource Type: [grafana-mock-datasource]) into a SQL/Tabular format for sql expression [B]: can not convert because the response is missing the data type (frame.meta.type) and has labels in the response that can not be mapped to a table

What happens if DS returns with multiple frames without a frame type?

You should get an error like:

[sse.sql.input_conversion] failed to convert the results of query [A] (Datasource Type: [grafana-mock-datasource]) into a SQL/Tabular format for sql expression [B]: can not convert because the response is missing the data type (frame.meta.type) and has more than one dataframe that can not be automatically mapped to a single table

What happens if DS returns with multiple frames without different types?

This is not currently supported, SQL expressions will treat the first response as the type of all the frames, and you will like get an odd error that the additional type doesn't have what the first type needs. Or it will work and the results will be odd.

TODO: Should add an error in the case, or discard anything past the first type.

Do instrument any Datasource specific metrics in relation to SQL expression support?

Yes:

sum(rate(grafana_sse_sql_command_input_count[$__rate_interval])) by (status,attempted_conversion,datasource_type,input_frame_type)

What does the conversion look like?

See https://gist.github.com/kylebrandt/ff5fc4061d85fe08fd24f5cee6333414 for dashboard.json of above (uses mock datasource).

Raw

z_full_long_types.md

Full Long Formats

Two new data types have been introduced with SQL expressions. timeseries-full-long and numeric-full-long. This is a row-oriented tabular format similar to long (labels are not used), but structured in a way that allows more lossless conversion of information between the labeled (wide and multi) formats.

So the use case for this format is when you need to be able to convert between labeled and tabular formats with more preservation of the information. Whereas, when converting back and forth with Long can create artifacts.

Properties shared by Full Long Formats

They are single frame formats.

There are three reserved fields (columns):

__metric_name__: The name of the metric
__value__: A nullable *float64 numeric field, where this is the value that that represents combination of the metric name and dimensions for the row.
__display_name__: (Optional) When converting from other formats, if the DisplayNameFromDS property is set, it is flatted into this field
Note: Additional reserved names may be added in the same naming style. This would be to flatten other column metadata such as a data links or units.

Dimension Columns

Like long any dimensions (that would be labels in in wide or multi) become their own field. The dimension fields name corresponds to key of the dimensions, and the values values are in the rows of that field.

In the case of full-long they are nullable string fields. When the value on a row is null, the label is considered absent. This helps with more correct conversion to and from the wide and multi formats because of all of the items do not have the same set of label keys.

Numeric Full Long

The numeric kind is for when each item in the response as a value. The numeric kind in format full long format is identified by the data frame type numeric-full-long.

Numeric Full Long Examples

Single Metric Example

__metric_name__	__value__	host	region
cpu_load	0.82	a	us-east-1
cpu_load	0.61	b	us-east-1

Multi-Metric Example

__metric_name__	__value__	host	region
cpu_load	0.82	a	us-east-1
cpu_load	0.61	b	null
disk_free_bytes	1.23e+12	a	us-east-1
disk_free_bytes	9.80e+11	b	us-east-1

Timeseries Full Long

The timeseries kind shares the same properties as the numeric kind for the full long format in terms of reserved columns and the behavior of dimensions. The timeseries kind in format full long format is identified by the data frame type timeseries_full-long.

The main difference is that there must be a time column the is is sorted by time in ascending order (old to new). When this type is produced by converting from the multi or wide formats, the name of the field (column) is kept.

Timeseries Full Long Examples

Example 1

time	__value__	__metric_name__	host	iface
2025-10-02 15:55:41.000	1	cpu	a	x
2025-10-02 15:55:41.000	6	cpu	b	y
2025-10-02 15:55:42.000	4	cpu	a	x
2025-10-02 15:55:42.000	8	cpu	b	y
2025-10-02 15:55:43.000	2	cpu	a	x

How Full Long differs from the Long format

The long long format is partially flatted, where as the full-long format is fully flatted.

In the full-long formats, the metric name is treated as another dimension. This creates two reserved fields for this purpose in full long: __value__ and __metric_name__.

In the long and each unique metric name gets a field (column). The field's Name property is the metric name. Like long the dimensions that would be labels in the wide and multi formats get flatted. But in long the metric name is not treated as a dimension and does not get flatted, whereas it does in full-long

This format also uses nullable strings for dimensions. This means that if the value is null for the row, where converting to labels, that label should be considered not present. This allows full-long to avoid creating label sets that didn't exist during a round trip conversion from wide or multi.

In the case of the timeseries kind, with roundtrip conversion (multi or wide to long, and then back) long can create false datapoints. Since long does not flatten the metric name portion, there was no way to represent timestamped data points that existed for one set of series sharing a name but not the other set of series. Since full-long is fully flattened into __value__ and __metric_name__, those missing points don't get falsely created like to with full long with doing roundtrip conversion.

In long, the numeric type for the value is persevered. For example a metric that is a unit64 will stay that way. Because full-long flattens to a a single column (__value__), all numeric values become nullable float64 values. In most cases, upstream we effectively convert to something like a float64 (or JS float), so this likely doesn't matter much in current practice.

Conversion between `wide` and `multi` formats

From `wide`/`multi` to `full-long`

Fundamentally we are flattening a column-oriented structure with labels into row-oriented structure without labels.
Each label key present becomes a field (column) with the name of the label key
All the numeric value columns are combined and flatted into one __value__ field. Whatever the numeric type is, it will be converted to a nullable *float64 field.
The field.Name from wide/multi gets flatted into the __metric_name__ field.
If the timeseries kind, the first field of type time gets flatted and keeps the name

Use in SQL expressions

When SQL expressions receives the data in the kinds of timeseries or numeric in the wide or multi formats, it converts it to the corresponding full long format. This is because there is no notion of labels in SQL, so the data must be flatted into tabular format.