Infernu’s Type System

Quick Summary

Infernu’s type system is polymorphic and structural. It is built to allow the compiler to infer all types without any annotations by the user.

The type system features:

Full type inference: the most generic type is always inferred without help from the user.
Parametric polymorphism (aka “generics”), based on Hindley-Milner type inference.
Row-type polymorphism, otherwise known as “static duck typing” or structural typing.
Simple type classes (which allow for example correct support of JS + and [] operators), allowing for ad-hoc polymorphism.
Recursive types for true representation of object-oriented methods.
Correct handling of JS’s this dynamic scoping rules.
Polymorphic “methods” are supported, so rows may have rank-2 types.
Mutability is not represented in the types, but affects type inference (polymorphism is restricted for mutable variables).

Note: Currently, all types are inferred. Support for type annotations for specifically constraining or for documentation is planned.

Overview
Primitive Types
Generics, or Parametric Polymorphism
Type Variables
Built-in Parameterized Types
Functions
Row types
Type Classes

Overview

Infernu is a type checker for JavaScript. Since JavaScript is dynamically and weakly typed, it makes no sense to talk about “type errors” in arbitrary JavaScript code. Consequently Infernu makes assumptions about the code and expects it to follow certain rules that are not required by plain JavaScript (for example, implicit coercions such as 3 + 'a' are not allowed.)

Infernu’s type system is designed for writing dynamic-looking code in a safe statically type-checked environment. Type annotations are not required (though they would be nice to support, for various reasons). Instead, Infernu infers the types of expressions by examining the code. If the inferred types contradict each other, Infernu reports the contradiction as an error.

Infernu places restrictions on JS programs that are otherwise valid. In other words, Infernu is a subset of JavaScript. Infernu tries to strike a balance between type system complexity and dynamic-style coding flexibility.

Infernu can deduce the types without any type annotations at all. Such complete type inference is more powerful than what languages such as C#’s “var” and C++’s “auto” keywords represent. Infernu tracks types throughout the code so you never need to specify them. The type inference algorithm guarantees that the type being inferred is always the most generic type possible.

Notation: Concrete types are capitalized; type variables are lowercase letters (a, b, c, etc.)

For type system geeks

Infernu’s type system is based on Damas-Hindley-Milner (and Algorithm W) with added mutable variables (monomorphically constrained using value restriction), and with row type polymorphism, equi-recursive types (constrained to go through row types), rank-2 row types (for polymorphic methods), and simple type classes. Functions are uncurried.

Primitive Types

Number
Boolean
String
Regex
Undefined
Null (may be removed in the future)

(Date should probably be added)

Generics, or Parametric Polymorphism

Parameterized types take one or more type parameters. Infernu can infer about code that it requires a certain parameterized type without necessarily fixing the parameter. For example, a[0] = x may be interpreted as “assignment to an array of elements which have the same type as x” (actually Infernu places a weaker assumption on such code - see type classes below).

Type Variables

Type variables represent a type that is not fully constrained to any particular type. A type variable can represent a partial constraint:

The same type variable can be referred to multiple times, constraining all those types to be the same. For example, a -> a - a function returns the same type as its argument.
Sometimes a type variable is constrained by a type class predicate, such as Plus a => (a, a) -> a (the type of the + operator). See type classes below.

Polymorphic vs. Free Type Variables

There are two cases where type variables are needed:

A polymorphic type variable is allowed to vary. For example, a function that can take any type for its argument and returns a String, can be represented as a -> String but the type variable a must be allowed to vary. Being “allowed to vary” means: if we use this function in a way that forces a to be, say, the type Number, doesn’t mean that now a will always be Number: other pieces of code can use our function in a different way, having a be whatever other type. A polymorphic type variable such as this a is known as universally quantified (or “foralled”). The type of that function is really forall a. a -> String, or “given any type, call it a, this function will have the type a -> String” (Currently the pretty-printing mechanism of Infernu doesn’t print out the “forall” parts).
A free (or non-polymorphic) type variable is used when a certain type is not known. If we add more code to the program we may end up determining what the type variable should represent, for example some type variable b that appears throughout the code before the change, may turn out to be a String due to a new line of code that uses it as such.

Free type variables are not very useful, but they represent code that can be made generic (for example by wrapping it in a function).

Polymorphism and Mutable Variables

In short: variables that are assigned to beyond the declaration (mutable variables) are restricted to a non-polymorphic type. Also, some immutable variables are also restricted to non-polymorphic types, for example if they are declared to be an array (because an array is itself mutable).

(You may skip this if you don’t care about type inference details.)

Value Restriction

There is a well known difficulty in combining inference of polymorphic types with mutability (usually reference cells). In Standard ML and family (including OCaml) the issue is handled by the value restriction, which prevents some bindings (term variables) from being generalized to a polymorphic type. The criteria for preventing polymorphism of a binding is a simple syntactic rule (in OCaml this was elaborated, if I understand correctly).

Immutability Assumption

In Infernu, the value restriction isn’t enough: unlike ML-family languages, Infernu’s AST includes mutable bindings. In other languages bindings are actually not variables - they are immutable names of values which themselves could be mutable (such as a reference cell). In Infernu, bindings and reference cells are one and the same. Every name binding can be assigned to, hence the name “variable” is accurate. Treating all JS variables as mutable reference cells would render polymorphism degenerate: all variables would be value-restricted to monomorphic types. Instead, Infernu’s type inference algorithm assumes that a variable is immutable unless it’s assigned to (beyond the variable declaration). If a variable is assigned to, so that it “turns out to be mutable”, we then restrict it to monomorphic types.

Built-in Parameterized Types

[a] - Arrays of items of type a
Map a - Maps from String to a. Maps correspond to plain JS objects that are used via dynamic keys (e.g. obj[key] = value). See the section “row types” for another view of JS objects.

In plain JS, arrays are heterogeneous and can contain a mix of values: numbers, booleans, objects, functions, etc. For example, ['a', 3, { b: [] } ] is a valid JS array. In Infernu, arrays are restricted to be homogeneous: all items must be of the same type.

Arrays, string maps, and plain strings can all be accessed using the bracket notation in JS. Infernu defines instances of all three types for the Indexable type class. More on that later.

Functions

Functions are similar to other parameterized types, but are represented slightly differently to support this and because they can be constructors (when using the new keyword).

Notation:

a.(b,c,...) -> r represents functions that take this of type a, arguments of types b, c, … and return a result of type r.

For constructors, there are also properties one can tack onto the .prototype of a constructor function (support for .prototype is still in the works).

Row types

JavaScript objects can be viewed as either maps from strings (property names) to values, or as records (or “rows”) containing a set of fields, each field having a certain type. Infernu supports both of these two views, maps and rows, but represents them as separate (incompatible) types. This section is about the second view of objects, as rows.

The basic notation for row types is: { prop1: type1, ... , propN: typeN } where prop are strings, names of properties, with their corresponding types. The ellipsis are meant to say that there can be any number of properties.

Row types can be either closed or open:

A closed row type has a set number of properties with specific names: it is not compatible with any other closed row.
An open row type is a row that is known to have some properties, but may have (and is compatible with) more properties, as long as their names don’t collide with the existing properties. An open row type has a row type variable representing the “tail” or remaining properties.

The notation for an open row type is:

{ prop1: type1, ..., propN: typeN, ..r } where r is the row type variable.

Open row types are most useful as function arguments or return types. For example consider the following function:

function getLength(x) {
    return x.length;
}

What is the type of getLength? It expects a row type that a property length, but doesn’t care if x has other properties, nor does it care about the type of length. So the type is: { length: a, ..b } -> a or, in words: a function that takes a row that has at least a property named “length” of type a and returns the type a. The type parameter b isn’t used, but it represents that the input argument is an open row type, and it can have additional properties.

Type Classes

Why do we need type classes?

Some operators in JS can be used in multiple contexts. For example, + can be used to add numbers or to concatenate strings (Infernu doesn’t allow implicit conversions). Consider the following function:

function add(x,y) { return x + y; }

What is the type of add? It isn’t (Number, Number) -> Number) because it could work on strings. We are tempted to say the type is: (a, a) -> a: given two arguments of type a, whatever it is, add returns the same type. However such a permissive type is wrong: we wouldn’t want the user to call add(false, true), for example.

The solution adopted by Infernu is inspired by Haskell’s type classes (which have nothing to do with object-oriented “classes”). A type class in Infernu is like an interface in Java/C#, but without any associated functions (in the future, support for type class functions may be added). A type can be an instance of a type class.

For example, Infernu defines the built-in type class Plus. The types Number and String are instances of the type class Plus. The notation Plus a => a means “given any type a which must be an instance of the type class Plus, the type of this thing is a”.

Back to our add function, the type would be: Plus a => (a, a) -> a or “given any type a that is an instance of the Plus type class, the type of this function is (a,a) -> a.

Type Classes are Closed

Currently all type classes in Infernu are closed. Every type class has a pre-determined set of instances, and no other types can be made an instance. Closed type classes are useful for inference, because they allow determining the full set of types that inhabit a given set of constraints. The Indexable multi-parameter type class below shows how closed type classes are useful.

Indexable

Aside from Plus, Infernu also defines another type class: Indexable. Indexable takes three parameters: the container type, an index type, and an element type. Instances of indexable are:

Indexable ([a], Number, a) - arrays, where the container is an array [a] of elements with type a, and the index type is Number.
Indexable (Map a, String, a) - string maps, where the container is a map from string to a, the index type is String, and the element type is a.
Indexable (String, Number, String) - strings are indexable too: the “container” is a string, the index is a number and the elements at each position are (single-character) strings (since JS has no character type: it may be interesting to add them to Infernu).

Example of using the bracket syntax to read:

//Indexable (c, d, b) => a.((c, d) -> b)
function getAt(thing, i) {
    return thing[i];
}

The bracket get expression thing[i] requires the three types involved (container, index, element) together to be an instance of Indexable.

The Indexable type class simplifies inference because it’s a closed type class. A somewhat contrived example:

//a.((Map b, String) -> b)
function getByKey(thing, key) {
    if (key === 'timestamp') {
        // timestamp deprecated in favor of betterTimestamp
        return thing['betterTimestamp'];
    }
    return thing[key];
}

Because there are only three possible instances of Indexable and only one of them has strings as index types (middle type parameter), Infernu can deduce the exact types involved: thing must be a Map, it cannot be an array or a string. If the Indexable type class wasn’t closed, we would have to allow arbitrary unknown instances that may have strings as index types.

Polymorphic methods / rank-2 row types

TODO

Equi-recursive types

Equi-recursive types are constrained to at least include a row type in the recursion to prevent inference of evil recursive types.