Dada Language Specification

This document specifies the Dada programming language. It serves as the authoritative reference for language behavior and implementation requirements.

Purpose

The specification defines the syntax, semantics, and behavior of Dada programs. It is intended for:

Language implementers
Tool developers
Advanced users seeking precise language details

Relationship to RFCs

This specification incorporates accepted RFCs. Non-normative references to RFCs provide historical context and rationale for design decisions.

Conventions

This chapter describes the conventions used throughout this specification.

Paragraph References

Specification paragraphs use MyST directive syntax with the {spec} directive:

<div class="spec-paragraph" id="conventions.paragraph-references.local-name">
<div class="spec-label"><a href="#conventions.paragraph-references.local-name" class="spec-label-link">conventions.paragraph-references.local-name</a> <span class="spec-rfc-badge">rfc123</span></div>
<div class="spec-content">

Paragraph content.

</div>
</div>

ID Resolution

Paragraph IDs are resolved automatically from context:

File path: syntax/string-literals.md contributes prefix syntax.string-literals
Section headings: ## Escape Sequences contributes segment escape-sequences
Local name: The name in the :::{spec} directive (e.g., invalid)

These combine to form the full ID: syntax.string-literals.escape-sequences.invalid

The local name is optional. A directive with only tags uses the heading context as its ID:

## Type

<div class="spec-paragraph" id="conventions.type">
<div class="spec-label"><a href="#conventions.type" class="spec-label-link">conventions.type</a> <span class="spec-rfc-badge">rfc0001</span> <span class="spec-rfc-badge spec-rfc-unimpl">unimpl</span></div>
<div class="spec-content">

String literals have type `my String`.

</div>
</div>

This paragraph’s ID is syntax.string-literals.type (file prefix + heading).

Inline Sub-paragraphs

List items within a :::{spec} block can be marked as individually referenceable sub-paragraphs using the {spec}`name` syntax:

<div class="spec-paragraph" id="conventions.type.inline-sub-paragraphs">
<div class="spec-label"><a href="#conventions.type.inline-sub-paragraphs" class="spec-label-link">conventions.type.inline-sub-paragraphs</a> <span class="spec-rfc-badge">rfc0001</span> <span class="spec-rfc-badge spec-rfc-unimpl">unimpl</span></div>
<div class="spec-content">

There are multiple forms of string literals:

* <span id="conventions.type.inline-sub-paragraphs.quoted" class="spec-sub-paragraph"><a href="#conventions.type.inline-sub-paragraphs.quoted" class="spec-sub-label">.quoted</a></span> Single-quoted string literals begin with `"` and end with `"`.
* <span id="conventions.type.inline-sub-paragraphs.triple-quoted" class="spec-sub-paragraph"><a href="#conventions.type.inline-sub-paragraphs.triple-quoted" class="spec-sub-label">.triple-quoted</a></span> Triple-quoted string literals begin with `"""` and end with `"""`.

</div>
</div>

Under ## Delimiters in syntax/string-literals.md, this creates:

syntax.string-literals.delimiters (parent paragraph)
syntax.string-literals.delimiters.quoted (sub-paragraph)
syntax.string-literals.delimiters.triple-quoted (sub-paragraph)

Each sub-paragraph gets its own linkable anchor in the rendered output.

RFC and Status Annotations

Paragraphs include tags after the optional local name:

<div class="spec-paragraph" id="conventions.type.rfc-and-status-annotations.local-name">
<div class="spec-label"><a href="#conventions.type.rfc-and-status-annotations.local-name" class="spec-label-link">conventions.type.rfc-and-status-annotations.local-name</a> <span class="spec-rfc-badge">rfc123</span> <span class="spec-rfc-badge spec-rfc-unimpl">unimpl</span></div>
<div class="spec-content">

Content added by RFC 123, not yet implemented.

</div>
</div>

Available tags:

rfcN — content added or modified by RFC N
!rfcN — content deleted by RFC N
unimpl — specified but not yet implemented

Multiple tags can be combined: :::{spec} local-name rfc123 rfc456 unimpl

Test Annotations

Tests reference spec paragraphs using #:spec comments with the fully-qualified ID:

#:spec syntax.string-literals.delimiters.quoted

These labels serve multiple purposes:

Cross-referencing within the specification
Linking from RFC documents
Test validation via #:spec annotations in .dada test files

Identifiers use semantic names rather than numbers to remain stable as the specification evolves.

EBNF Notation

This specification uses Extended Backus-Naur Form (EBNF) to describe syntax. Standard EBNF operators apply:

A* — zero or more repetitions of A
A+ — one or more repetitions of A
A? — optional A
A | B — A or B
`keyword` — a literal terminal
ε — the empty production

In addition, this specification uses the following shorthand for comma-separated lists with optional trailing commas:

A,* — zero or more comma-separated occurrences of A
A,+ — one or more comma-separated occurrences of A

Normative Language

This specification uses the following terms to indicate requirements:

must: An absolute requirement
must not: An absolute prohibition
should: A strong recommendation
should not: A strong recommendation against
may: An optional feature or behavior

Syntax

This chapter describes the lexical structure and grammar of Dada programs.

Lexical Structure

This chapter specifies the lexical structure of Dada programs. A Dada source file is a sequence of Unicode characters, which the lexer converts into a sequence of tokens.

Source Encoding

syntax.lexical-structure.source-encoding

Dada source files are encoded as UTF-8.

Tokens

syntax.lexical-structure.tokens

The lexer produces a sequence of tokens:


Token ::= Identifier
          | Keyword
          | Literal
          | Operator
          | Delimiter

A token Token is one of the following kinds:

syntax.lexical-structure.tokens.preceding-whitespace

Each token records whether it was preceded by whitespace, a newline, or a comment. This information is used by the parser but does not produce separate tokens.

Whitespace and Comments

Whitespace

syntax.lexical-structure.whitespace-and-comments.whitespace

Whitespace characters (spaces, tabs, and other Unicode whitespace excluding newlines) separate tokens but are otherwise not significant.

syntax.lexical-structure.whitespace-and-comments.whitespace.newlines

Newline characters (\n) are tracked by the lexer. Whether a token is preceded by a newline may affect how the parser interprets certain constructs.

Comments

syntax.lexical-structure.whitespace-and-comments.comments

A comment begins with # and extends to the end of the line.

syntax.lexical-structure.whitespace-and-comments.comments.content

The content of a comment, including the leading #, is ignored by the lexer. A comment implies a newline for the purpose of preceding-whitespace tracking.

`Identifier` definition

syntax.lexical-structure.identifier-definition

An identifier Identifier begins with a Unicode alphabetic character or underscore (_), followed by zero or more Unicode alphanumeric characters or underscores, provided it is not a keyword Keyword:


Identifier ::= (Alphabetic | _) (Alphanumeric | _)*    (not a Keyword)

syntax.lexical-structure.identifier-definition.case-sensitivity

Identifiers are case-sensitive.

`Keyword` definition

syntax.lexical-structure.keyword-definition

The following words are reserved as keywords:


Keyword ::= as
            | async
            | await
            | class
            | else
            | enum
            | export
            | false
            | fn
            | give
            | given
            | if
            | is
            | let
            | match
            | mod
            | mut
            | my
            | our
            | perm
            | pub
            | ref
            | return
            | self
            | share
            | shared
            | struct
            | true
            | type
            | unsafe
            | use
            | where

`Operator` definition

syntax.lexical-structure.operator-definition

The following single characters are recognized as operator tokens:


Operator ::= + | - | * | / | % | = | !
           | < | > | & | | | : | , | . | ; | ?

.plus +
.minus -
.star *
.slash /
.percent %
.equals =
.bang !
.less-than <
.greater-than >
.ampersand &
.pipe |
.colon :
.comma ,
.dot .
.semicolon ;
.question ?

syntax.lexical-structure.operator-definition.multi-character

Multi-character operators such as &&, ||, ==, <=, >=, and -> are formed by the parser from adjacent operator tokens.

`Delimiter` definition

syntax.lexical-structure.delimiter-definition

A delimited token contains a matched pair of brackets and their contents:


Delimiter ::= ( Token* ) | [ Token* ] | { Token* }

.parentheses Parentheses: ( and ).
.square-brackets Square brackets: [ and ].
.curly-braces Curly braces: { and }.

syntax.lexical-structure.delimiter-definition.balanced

Delimiters must be balanced. An opening delimiter without a matching closing delimiter is an error.

syntax.lexical-structure.delimiter-definition.nesting

The lexer tracks delimiter nesting. Content between matching delimiters is treated as a unit, which enables deferred parsing of function bodies and other nested structures.

`Literal` definition

syntax.lexical-structure.literal-definition

A literal Literal is one of the following:


Literal ::= IntegerLiteral
            | BooleanLiteral
            | StringLiteral

`IntegerLiteral` definition

syntax.lexical-structure.literal-definition.integerliteral-definition

An integer literal IntegerLiteral is a sequence of one or more ASCII decimal digits (0–9), optionally separated by underscores (_) that do not affect the value:


IntegerLiteral ::= Digit (_? Digit)*
Digit ::= 0 | 1 | ... | 9

`BooleanLiteral` definition

syntax.lexical-structure.literal-definition.booleanliteral-definition

The keywords true and false are boolean literals:


BooleanLiteral ::= true | false

`StringLiteral` definition

syntax.lexical-structure.literal-definition.stringliteral-definition

String literal syntax is specified in String Literals.

Lexical Errors

syntax.lexical-structure.lexical-errors

Characters that do not begin a valid token are accumulated and reported as a single error spanning the invalid sequence.

Items

This chapter specifies the top-level items that can appear in a Dada source file.

Source Files

syntax.items.source-files

A Dada source file defines a module. The module name is derived from the file name. A source file contains zero or more items, optionally followed by zero or more statements:


SourceFile ::= Item* Statement*

syntax.items.source-files.implicit-main

If a source file contains top-level statements, they are wrapped in an implicit async fn main() function.

syntax.items.source-files.kinds

An item Item is one of the following:


Item ::= Function
         | Class
         | Struct
         | UseDeclaration

`Visibility` definition

syntax.items.visibility-definition

Items and fields may have a visibility modifier. Without a modifier, the item is private to the enclosing module.


Visibility ::= pub
               | export
               | ε

`Function` definition

syntax.items.function-definition

A function Function is declared with the fn keyword, optionally preceded by effect keywords and followed by a name, optional generic parameters, parameters, optional return type, optional where clause, and a body or semicolon:


Function ::= Visibility Effect* fn Identifier GenericParameters?
             ( Parameters ) ReturnType? WhereClause? FunctionBody

`Effect` definition

syntax.items.function-definition.effect-definition

Effect keywords may appear in any order before fn:


Effect ::= async
           | unsafe

`Parameters` definition

syntax.items.function-definition.parameters-definition

Function parameters are enclosed in parentheses and separated by commas:


Parameters ::= FunctionInput,*
FunctionInput ::= SelfParameter | Parameter

syntax.items.function-definition.parameters-definition.self

A function may have a self parameter as its first parameter, optionally preceded by a permission keyword, which makes it a method:


SelfParameter ::= PermissionKeyword? self

syntax.items.function-definition.parameters-definition.parameter-syntax

Each non-self parameter has the form name: Type. A parameter may be preceded by mut to declare a mutable binding:


Parameter ::= mut? Identifier : Type

`FunctionBody` definition

syntax.items.function-definition.functionbody-definition

A function may have a body, which is a block enclosed in curly braces. If no body is present, the function has no definition.


FunctionBody ::= Block | ε

`ReturnType` definition

syntax.items.function-definition.returntype-definition

A function may declare a return type with -> followed by a Type after the parameters.

ReturnType ::= -> Type

`GenericParameters` definition

syntax.items.function-definition.genericparameters-definition

A function may declare generic parameters in square brackets after the name:


GenericParameters ::= [ GenericParameter,* ]
GenericParameter ::= type Identifier | perm Identifier

.type-parameters A type parameter type followed by a name: type T.
.permission-parameters A permission parameter perm followed by a name: perm P.

`WhereClause` definition

syntax.items.function-definition.whereclause-definition

A function may have a where clause after the return type that constrains its generic parameters:


WhereClause ::= where WhereConstraint,+
WhereConstraint ::= Type is WhereKind (+ WhereKind)*
WhereKind ::= ref
              | mut
              | shared
              | unique
              | owned
              | lent

`Class` definition

syntax.items.class-definition

A class Class is declared with the class keyword. Classes have reference semantics.


Class ::= Visibility class Identifier GenericParameters?
          ConstructorFields? WhereClause? ClassBody?

`ConstructorFields` definition

syntax.items.class-definition.constructorfields-definition

A class may declare constructor fields in parentheses after the name:


ConstructorFields ::= ( Field,* )

`ClassBody` definition

syntax.items.class-definition.classbody-definition

A class body enclosed in curly braces may contain field declarations and method definitions:


ClassBody ::= { ClassMember* }
ClassMember ::= Field
                | Method

`Method` definition

syntax.items.class-definition.method-definition

A method Method is a function declared inside a class or struct body:


Method ::= Function

`Field` definition

syntax.items.class-definition.field-definition.field-syntax

A field declaration Field has the form:


Field ::= Visibility mut? Identifier : Type

Generics and Where Clauses

syntax.items.class-definition.generics-and-where-clauses

Classes support generic parameters and where clauses with the same syntax as functions.

`Struct` definition

syntax.items.struct-definition

A struct Struct is declared with the struct keyword. The syntax is identical to Class but structs have value semantics.


Struct ::= Visibility struct Identifier GenericParameters?
           ConstructorFields? WhereClause? ClassBody?

`UseDeclaration` definition

syntax.items.usedeclaration-definition

A use declaration UseDeclaration imports a name from another crate, optionally renaming it with as:


UseDeclaration ::= use Path (as Identifier)?
Path ::= Identifier (. Identifier)*

Statements

This chapter specifies the statement syntax of Dada.

`Block` definition

syntax.statements.block-definition

A block Block is a sequence of zero or more statements enclosed in curly braces:


Block ::= { Statement* }

syntax.statements.block-definition.value

A block evaluates to the value of its last expression, if the last statement is an expression statement.

`Statement` definition

syntax.statements.statement-definition

A statement Statement is one of the following:


Statement ::= LetStatement
              | ExprStatement

`LetStatement` definition

syntax.statements.letstatement-definition

A let statement LetStatement introduces a new variable binding:


LetStatement ::= let mut? Identifier (: Type)? (= Expr)?

syntax.statements.letstatement-definition.type-annotation

A let statement may include a type annotation: let name: Type = value.

syntax.statements.letstatement-definition.mutable

A let statement may use mut to declare a mutable binding: let mut name = value.

syntax.statements.letstatement-definition.initializer-optional

The initializer (= value) is optional. A variable may be declared without an initial value.

`ExprStatement` definition

syntax.statements.exprstatement-definition

An expression statement ExprStatement is an expression followed by a newline or end of block:


ExprStatement ::= Expr

Expressions

This chapter specifies the expression syntax of Dada.

`Expr` definition

syntax.expressions.expr-definition

An expression Expr is parsed using precedence climbing. From lowest to highest precedence:


Expr ::= AssignExpr

`AssignExpr` definition

syntax.expressions.assignexpr-definition

The assignment operator = assigns a value to a place expression. It has the lowest precedence among binary operators:


AssignExpr ::= OrExpr

`OrExpr` definition

syntax.expressions.orexpr-definition

The logical OR operator || performs short-circuit boolean logic:


OrExpr ::= AndExpr

`AndExpr` definition

syntax.expressions.andexpr-definition

The logical AND operator && performs short-circuit boolean logic:


AndExpr ::= CompareExpr

`CompareExpr` definition

syntax.expressions.compareexpr-definition

The comparison operators compare two values and produce a boolean result:


CompareExpr ::= AddExpr


CompareOp ::= == | < | > | <= | >=

`AddExpr` definition

syntax.expressions.addexpr-definition

The additive operators perform addition and subtraction:


AddExpr ::= MulExpr

`MulExpr` definition

syntax.expressions.mulexpr-definition

The multiplicative operators perform multiplication and division:


MulExpr ::= UnaryExpr

`UnaryExpr` definition

syntax.expressions.unaryexpr-definition

A unary expression applies a prefix operator to a postfix expression:


UnaryExpr ::= UnaryOp* PostfixExpr
UnaryOp ::= ! | -

.not ! performs logical negation.
.negate - performs arithmetic negation.

Newline Sensitivity

syntax.expressions.newline-sensitivity

A binary operator must appear on the same line as its left operand. An operator on a new line begins a new expression or is interpreted as a prefix operator.

`PostfixExpr` definition

syntax.expressions.postfixexpr-definition

A postfix expression applies zero or more postfix operators to a primary expression:


PostfixExpr ::= PrimaryExpr PostfixOp*

`PostfixOp` definition

syntax.expressions.postfixexpr-definition.postfixop-definition

A postfix operator PostfixOp is one of the following:


PostfixOp ::= FieldAccess
              | Call
              | Await
              | PermissionOp

`FieldAccess` definition

syntax.expressions.postfixexpr-definition.fieldaccess-definition

A field access FieldAccess uses dot notation to access a field or name a method:


FieldAccess ::= . Identifier

`Call` definition

syntax.expressions.postfixexpr-definition.call-definition

A function or method call Call follows an expression with parenthesized arguments separated by commas. The opening parenthesis must appear on the same line as the callee:


Call ::= ( Expr,* )

`Await` definition

syntax.expressions.postfixexpr-definition.await-definition

The .await postfix operator awaits the result of a future:


Await ::= . await

`PermissionOp` definition

syntax.expressions.postfixexpr-definition.permissionop-definition

A permission operation PermissionOp requests specific permissions on a value:


PermissionOp ::= give
                 | share
                 | lease
                 | ref

`PrimaryExpr` definition

syntax.expressions.primaryexpr-definition

A primary expression PrimaryExpr is one of the following:


PrimaryExpr ::= Literal
                | identifier
                | self
                | IfExpr
                | ReturnExpr
                | ConstructorExpr
                | paren-expr
                | block-expr

`IfExpr` definition

syntax.expressions.primaryexpr-definition.ifexpr-definition

An if expression IfExpr evaluates a condition and executes a block:


IfExpr ::= if Expr Block (else if Expr Block)* (else Block)?

syntax.expressions.primaryexpr-definition.ifexpr-definition.else

An if expression may have an else clause.

syntax.expressions.primaryexpr-definition.ifexpr-definition.else-if

Multiple conditions may be chained with else if.

`ReturnExpr` definition

syntax.expressions.primaryexpr-definition.returnexpr-definition

A return expression ReturnExpr exits the enclosing function, optionally with a value. The value, if present, must appear on the same line as return:


ReturnExpr ::= return Expr?

`ConstructorExpr` definition

syntax.expressions.primaryexpr-definition.constructorexpr-definition

A constructor expression ConstructorExpr creates a new instance of a class or struct. The opening brace must appear on the same line as the type name:


ConstructorExpr ::= Identifier { ConstructorField,* }
ConstructorField ::= Identifier : Expr

Types and Permissions

This chapter specifies the syntax for types and permissions in Dada.

Types

Named Types

syntax.types-and-permissions.types.named-types

A type may be a simple name: String, i32, bool.

syntax.types-and-permissions.types.named-types.paths

A type may be a dotted path: module.Type.

Generic Application

syntax.types-and-permissions.types.generic-application

A type may be applied to generic arguments in square brackets: Vec[String], Pair[i32, bool].

Permission-Qualified Types

syntax.types-and-permissions.types.permission-qualified-types

A type may be preceded by a permission to form a permission-qualified type: my String, ref Point, mut Vec[i32].

Permissions

syntax.types-and-permissions.permissions

The following permission keywords are available:

.my my — exclusive ownership.
.our our — shared ownership.
.ref ref — immutable reference.
.mut mut — mutable reference.
.given given — a permission supplied by the caller.

Place Lists

syntax.types-and-permissions.permissions.place-lists

The permissions ref, mut, and given may include a place list in square brackets specifying which places they refer to: ref[x, y], mut[self], given[p].

syntax.types-and-permissions.permissions.place-lists.place-list-optional

The place list is optional. When omitted, the permission applies without place restrictions.

Generic Declarations

In Type Position

syntax.types-and-permissions.generic-declarations.in-type-position

A generic type parameter is declared as type T.

syntax.types-and-permissions.generic-declarations.in-type-position.permission-declaration

A generic permission parameter is declared as perm P.

Ambiguity

syntax.types-and-permissions.generic-declarations.ambiguity

A single identifier in a generic position is ambiguous between a type and a permission. The ambiguity is resolved during type checking, not parsing.

Literals

This chapter describes literal expressions in Dada.

Numeric Literals

See Integer Literals for lexical syntax.

Numeric type inference and overflow behavior to be specified.

Boolean Literals

See Boolean Literals for lexical syntax.

String Literals

See String Literals for detailed specification.

String Literals

This chapter specifies string literal syntax in Dada.

Delimiters

syntax.string-literals.delimiters rfc0001

There are multiple forms of string literals:

.quoted Single-quoted string literals begin with a " and end with a ".
.triple-quoted Triple-quoted string literals begin with a """ and end with a """.

syntax.string-literals.delimiters.disambiguation rfc0001

The syntax """ is interpreted as the start of a triple-quoted string literal and not a single-quoted string literal followed by the start of another single-quoted string literal.

syntax.string-literals.delimiters.triple-quote-termination rfc0001

A triple-quoted string literal cannot contain three consecutive unescaped double-quote characters.

Type

syntax.string-literals.type rfc0001

String literals have type my String.

Escape Sequences

syntax.string-literals.escape-sequences rfc0001

String literals support the following escape sequences:

.backslash \\ produces a literal backslash.
.double-quote \" produces a literal double quote.
.newline \n produces a newline.
.carriage-return \r produces a carriage return.
.tab \t produces a tab.
.open-brace \{ produces a literal {.
.close-brace \} produces a literal }.

syntax.string-literals.escape-sequences.triple-quoted rfc0001

The \" escape sequence is not needed in triple-quoted strings, since embedded double quotes do not terminate the string.

syntax.string-literals.escape-sequences.invalid rfc0001

A \ followed by a character not listed above is an error.

Interpolation

syntax.string-literals.interpolation rfc0001 unimpl

String literals may contain interpolation expressions delimited by curly braces ({ and }). Any valid Dada expression may appear inside the braces.

syntax.string-literals.interpolation.brace-escaping rfc0001

Literal brace characters are produced by the \{ and \} escape sequences.

syntax.string-literals.interpolation.nesting rfc0001 unimpl

The lexer tracks brace nesting depth, so that braces within interpolated expressions (e.g., block expressions, struct literals) do not prematurely terminate the interpolation.

syntax.string-literals.interpolation.nested-quotes rfc0001 unimpl

Quotes inside interpolated expressions do not terminate the enclosing string literal.

syntax.string-literals.interpolation.scope rfc0001 unimpl

Interpolated expressions are evaluated at runtime in the enclosing scope.

syntax.string-literals.interpolation.order rfc0001 unimpl

Interpolated expressions are evaluated left-to-right.

syntax.string-literals.interpolation.type-check rfc0001 unimpl

Each interpolated expression must produce a value that can be converted to a string. This is checked at compile time.

syntax.string-literals.interpolation.permissions rfc0001 unimpl

The permission system applies normally to interpolated expressions.

Multiline Strings

syntax.string-literals.multiline-strings rfc0001

A string literal that begins with a newline immediately after the opening quote (either " or """) is a multiline string literal with automatic indentation handling.

syntax.string-literals.multiline-strings.leading-newline rfc0001

The leading newline immediately after the opening quote is removed.

syntax.string-literals.multiline-strings.trailing-whitespace rfc0001

The trailing newline immediately before the closing quote is removed, along with any whitespace on the final line.

syntax.string-literals.multiline-strings.dedenting rfc0001

The common whitespace prefix across all non-empty lines is removed from the start of each line.

syntax.string-literals.multiline-strings.escape-sequences-are-content rfc0001

Escape sequences are part of the string content, not whitespace. They are not affected by leading/trailing stripping or dedenting.

syntax.string-literals.multiline-strings.raw rfc0001

A string literal beginning with "\ followed by a newline disables automatic dedenting. The string preserves its content exactly as written, including the leading newline and all indentation.

String Conversion

syntax.string-literals.string-conversion rfc0001 unimpl

Interpolated expressions must produce values that can be converted to strings. The exact conversion mechanism is not yet defined and depends on Dada’s trait/interface system.

Implementation Notes

A string literal with no interpolation expressions can be compiled as a simple string constant with no runtime overhead.

Keyboard shortcuts

Dada Language Specification