jlouis / graphql-erlang Goto Github PK

GraphQL implementation in Erlang.

License: Other

Makefile 0.11% Erlang 99.30% Shell 0.03% Ruby 0.45% Python 0.05% Nix 0.07%

graphql erlang graph facebook

graphql-erlang's Introduction

A GraphQL Server library - in Erlang

This project contains the necessary support code to implement GraphQL servers in Erlang. Its major use is on top of some other existing transport library, for instance the cowboy web server. When a request arrives, it can be processed by the GraphQL support library and a GraphQL answer can be given. In a way, this replaces all of your REST endpoints with a single endpoint: one for Graph Queries.

This README provides the system overview and its mode of operation.

Changelog

See the file CHANGELOG.md in the root of the project.

Status

Currently, the code implements all of the October 2016 GraphQL specification, except for a few areas:

Some validators are missing and pending implementation. The important validators are present, however. Missing stuff are all tracked as issues in this repository.
Parametrization inside fragments are not yet implemented fully.

In addition, we are working towards June 2018 compliance. We already implemented many of the changes in the system. But we are still missing some parts. The implementation plan is on a demand driven basis for Shopgun currently, in that we tend to implement things when there is a need for them.

Documentation

This is a big library. In order to ease development, we have provided a complete tutorial for GraphQL Erlang:

https://github.com/shopgun/graphql-erlang-tutorial

Also, the tutorial has a book which describes how the tutorial example is implemented in detail:

https://shopgun.github.io/graphql-erlang-tutorial/

NOTE: Read the tutorial before reading on in this repository if you haven't already. This README gives a very quick overview, but the canonical documentation is the book at the moment.

What is GraphQL

GraphQL is a query language for the web. It allows a client to tell the server what it wants in a declarative way. The server then materializes a response based on the clients query. This makes your development client-centric and client-driven, which tend to be a lot faster from a development perspective. A project is usually driven from the top-of-the-iceberg and down, so shuffling more onus on the client side is a wise move in modern system design.

GraphQL is also a contract. Queries and responses are typed and contract-verified on both the input and output side. That is, GraphQL also acts as a contract-checker. This ensures:

No client can provide illegal queries to the server backend. These are filtered out by the GraphQL layer.
No server can provide illegal responses to the client. These are altered such that the client gets a valid response according to the schema by replacing failing nodes with null-values.
The contract documents the API
The contract describes how the client can query data. This is the closest to HATEOAS we will probably get without going there.
Queries tend to be large and all-encompassing. This means we don't pay the round-trip-time for a request/response like you do in e.g., HTTP and HTTP/2 based systems where multiple queries are executed back to back and depends on each other. Almost every query can be handled in a single round trip.

Finally, GraphQL supports introspection of its endpoint. This allows systems to query the server in order to learn what the schema is. In turn, tooling can be built on top of GraphQL servers to provide development-debug user interfaces. Also, languages with static types can use the introspection to derive a type model in code which matches the contract. Either by static code generation, or by type providers.

Whirlwind tour

The GraphQL world specifies a typed schema definition. For instance the following taken from the Relay Modern specification:

interface Node {
  id: ID!
}

type Faction : Node {
  id: ID!
  name: String
  ships: ShipConnection
}

type Ship : Node {
  id: ID!
  name: String
}

type ShipConnection {
  edges: [ShipEdge]
  pageInfo: PageInfo!
}

type ShipEdge {
  cursor: String!
  node: Ship
}

type PageInfo {
  hasNextPage: Boolean!
  hasPreviousPage: Boolean!
  startCursor: String
  endCursor: String
}

type Query {
  rebels: Faction
  empire: Faction
  node(id: ID!): Node
}

input IntroduceShipInput {
  factionId: String!
  shipNamed: String!
  clientMutationId: String!
}

type IntroduceShipPayload {
  faction: Faction
  ship: Ship
  clientMutationId: String!
}

type Mutation {
  introduceShip(input: IntroduceShipInput!): IntroduceShipPayload
}

The schema is a subset of the Star Wars schema given as the typical GraphQL example all over the web. The GraphQL world roughly splits the world into input objects and output objects. Input objects are given as part of a query request by the client. Output objects are sent back from the server to the client.

This Erlang implementation contains a schema parser for schemas like the above. Once parsed, a mapping is provided by the programmer which maps an output type in the schema to an Erlang module. This module must implement a function

-spec execute(Context, Object, Field, Args) ->
    {ok, Response}
  | {error, Reason}.

which is used to materialize said object. That is, when you request a field F in the object O, a call is made to execute(Context, O, F, Args). The value Context provides a global context for the query. It is used for authentication data, for origin IP addresses and so on. The context is extensible by the developer with any field they need. The Args provides arguments for the field. Look, for instance at the type Mutation and the introduceShip field, which takes an argument input of type IntroduceShipInput!.

Materialization is thus simply a function call in the Erlang world. These calls tend to be used in two ways: Either they acquire a piece of data from a database (e.g., mnesia) and return that data as an Object. Or they materialize fields on an already loaded object. When execution of a query is processed, you can imagine having a "cursor" which is being moved around in the result set and is used to materialize each part of the query.

For example, look at the following query:

query Q {
  node(id: "12098141") {
      ... on Ship {
        id
        name
      }
  }
}

When this query executes, it will start by a developer provided initial object. Typically the empty map #{}. Since the node field is requested, a call is performed to match:

-module(query).

...
execute(Ctx, #{}, <<"node">>, #{ <<"id">> := ID }) ->
    {ok, Obj} = load_object(ID).

Now, since you are requesting the id and name fields on a Ship inside the node, the system will make a callback to a type-resolver for the Obj in order to determine what type it is. We omit that part here, but if it was something else, a faction say, then the rest of the query would not trigger. Once we know that id "12098141" is a Ship, we "move the cursor" to a ship and calls the execute function there:

-module(ship).

-record(ship, { id, name }).

execute(Ctx, #ship{ id = Id }, <<"id">>, _Args) ->
    {ok, ID};
execute(Ctx, #ship{ name = Name }, <<"name">>, _Args) ->
    {ok, Name}.

Two materialization calls will be made. One for the field <<"id">> and one for the field <<"name">>. The end result is then materialized as a response to the caller.

Materilization through derivation

A common use of the functions is to derive data from existing data. Suppose we extend the ship in the following way:

type Ship {
  ...
  capacity : float!
  load : float!
  loadRatio : float!
}

so a ship has a certain capacity and a current load in its cargo bay. We could store the loadRatio in the database and keep it up to date. But a more efficient way to handle this is to compute it from other data:

-module(ship).

-record(ship,
    { id,
      name,
      capacity,
      load }).

execute(...) ->
  ...;
execute(Ctx, #ship {
                capacity = Cap,
                load = Load }, <<"loadRatio">>, _) ->
    {ok, Load / Cap };
...

This will compute that field if it is requested, but not compute it when it is not requested by a client. Many fields in a data set are derivable in this fashion. Especially when a schema changes and grows over time. Old fields can be derived for backwards compatibility and new fields can be added next to it.

In addition, it tends to be more efficient. A sizable portion of modern web work is about moving data around. If you have to move less data, you decrease the memory and network pressure, which can translate to faster service.

Materializing JOINs

If we take a look at the Faction type, we see the following:

type Faction : Node {
  id: ID!
  name: String
  ships: ShipConnection
}

in this, ships is a field referring to a ShipConnection. A Connection type is Relay Modern standard of how to handle a paginated set of objects in GraphQL. Like "Materialization by derivation" we would derive this field by looking up the data in the database for the join and then producing an object which the ship_connection_resource can handle. For instance:

execute(Ctx, #faction { id = ID }, <<"ships">>, _Args) ->
    {ok, Ships} = ship:lookup_by_faction(ID),
    pagination:build_pagination(Ships).

where the build_pagination function returns some object which is a generic connection object. It will probably look something along the lines of

#{
  '$type' => <<"ShipConnection">>,
  <<"pageInfo">> => #{
      <<"hasNextPage">> => false,
      ...
  },
  <<"edges">> => [
      #{ <<"cursor">> => base64:encode(<<"edge:1">>),
          <<"node">> => #ship{ ... } },
      ...]
}

which can then be processed further by other resources. Note how we are eagerly constructing several objects at once and then exploiting the cursor moves of the GraphQL system to materialize the fields which the client requests. The alternative is to lazily construct materializations on demand, but when data is readily available anyway, it is often more efficient to just pass pointers along.

API

The GraphQL API is defined in the module graphql. Every functionality is exported in that module. Do not call inside other modules as their functionality can change at any point in time even between major releases.

The system deliberately splits each phase and hands it over to the programmer. This allows you to debug a bit easier and gives the programmer more control over the parts. A typical implementation will start by using the schema loader:

inject() ->
  {ok, File} = application:get_env(myapp, schema_file),
  Priv = code:priv_dir(myapp),
  FName = filename:join([Priv, File]),
  {ok, SchemaData} = file:read_file(FName),
  Map = #{
    scalars => #{ default => scalar_resource },
    interfaces => #{ default => resolve_resource },
    unions => #{ default => resolve_resource },
    objects => #{
      'Ship' => ship_resource,
      'Faction' => faction_resource,
      ...
      'Query' => query_resource,
      'Mutation' => mutation_resource
    }
  },
  ok = graphql:load_schema(Map, SchemaData),
      Root = {root,
      #{
        query => 'Query',
        mutation => 'Mutation',
        interfaces => []
      }},
  ok = graphql:insert_schema_definition(Root),
  ok = graphql:validate_schema(),
  ok.

This will set up the schema in the code by reading it from a file on disk. Each of the _resource names refers to modules which implements the backend code.

In order to execute queries on the schema, code such as the following can be used. We have a query document in Doc and we have a requested operation name in OpName and parameter variables for the given op in Vars. The variables Req and State are standard cowboy request and state tracking variables from cowboy_rest.

run(Doc, OpName, Vars, Req, State) ->
  case graphql:parse(Doc) of
    {ok, AST} ->
      try
          {ok, #{fun_env := FunEnv,
                ast := AST2 }} = graphql:type_check(AST),
          ok = graphql:validate(AST2),
          Coerced = graphql:type_check_params(FunEnv, OpName, Vars),
          Ctx = #{ params => Coerced, operation_name => OpName },
          Response = graphql:execute(Ctx, AST2),
          Req2 = cowboy_req:set_resp_body(encode_json(Response), Req),
          {ok, Reply} = cowboy_req:reply(200, Req2),
          {halt, Reply, State}
      catch
            throw:Err ->
                err(400, Err, Req, State)
      end;
    {error, Error} ->
        err(400, {parser_error, Error}, Req, State)
  end.

Conventions

In this GraphQL implementation, the default value for keys are type binary(). This choice is deliberate, since it makes the code more resistent to atom() overflow and also avoids some conversions between binary() and atom() values in the system. A later version of the library might redesign this aspect, but we are somewhat stuck with it for now.

However, there are many places where you can input atom values and then have them converted internally by the library into binary values. This greatly simplifies a large number of data entry tasks for the programmer. The general rules are:

If you supply a value to the system and it is an atom(), the internal representation is a binary value.
If the system hands you a value, it is a binary() value and not an atom().

Middlewares

This GraphQL system does not support middlewares, because it turns out the systems design is flexible enough middlewares can be implemented by developers themselves. The observation is that any query runs through the Query type and thus a query_resource. Likewise, any Mutation factors through the mutation_resource.

As a result, you can implement middlewares by using the execute/4 function as a wrapper. For instance you could define a mutation function as:

execute(Ctx, Obj, Field, Args) ->
    AnnotCtx = perform_authentication(Ctx),
    execute_field(AnnotCtx, Obj, Field, Args).

The reason this works so well is because we are able to use pattern matching on execute/4 functions and then specialize them. If we had an individual function for each field, then we would have been forced to implement middlewares in the system, which incurs more code lines to support.

More complex systems will define a stack of middlewares in the list and run them one by one. As an example, a clientMutationId is part of the Relay Modern specification and must be present in every mutation. You can build your mutation_resource such that it runs a maps:take/2 on the argument input, runs the underlying mutation, and then adds back the clientMutationId afterwards.

Schema Definitions

This GraphQL implementation follows the Jun2018 specification for defining a schema. In this format, one writes the schema according to specification, including doc-strings. What was represented as tags in an earlier implementation of GraphQL for Erlang is now represented as a @directive annotation, as per the specification.

As an example, you can write something along the lines of:

"""
A Ship from the Star Wars universe
"""
type Ship : Node {
  "Unique identity of the ship"
  id: ID!

  "The name of the ship"
  name: String
}

And the schema parser knows how to transform this into documentation for introspection.

Resource modules

The following section documents the layout of resource modules as they are used in GraphQL, and what they are needed for in the implementation.

Scalar Resources

GraphQL contains two major kinds of data: objects and scalars. Objects are product types where each element in the product is a field. Raw data are represented as scalar values. GraphQL defines a number of standard scalar values: boolean, integers, floating point numbers, enumerations, strings, identifiers and so on. But you can extend the set of scalars yourself. The spec will contain something along the lines of

scalar Color
scalar DateTime

and so on. These are mapped onto resource modules handling scalars. It is often enough to provide a default scalar module in the mapping and then implement two functions to handle the scalars:

-module(scalar_resource).

-export(
  [input/2,
    output/2]).

-spec input(Type, Value) -> {ok, Coerced} | {error, Reason}
  when
    Type :: binary(),
    Value :: binary(),
    Coerced :: any(),
    Reason :: term().
input(<<"Color">>, C) -> color:coerce(C);
input(<<"DateTime">>, DT) -> datetime:coerce(DT);
input(Ty, V) ->
    error_logger:info_report({coercing_generic_scalar, Ty, V}),
    {ok, V}.

-spec output(Type, Value) -> {ok, Coerced} | {error, Reason}
  when
    Type :: binary(),
    Value :: binary(),
    Coerced :: any(),
    Reason :: term().
output(<<"Color">>, C) -> color:as_binary(C);
output(<<"DateTime">>, DT) -> datetime:as_binary(DT);
output(Ty, V) ->
    error_logger:info_report({output_generic_scalar, Ty, V}),
    {ok, V}.

Scalar Mappings allow you to have an internal and external representation of values. You could for instance read a color such as #aabbcc, convert it into #{ r => 0.66, g => 0.73, b => 0.8 } internally and back again when outputting it. Likewise a datetime object can be converted to a UNIX timestamp and a timezone internally if you want. You can also handle multiple different ways of coercing input data, or have multiple internal data representations.

Type resolution Resources

For GraphQL to function correctly, we must be able to resolve types of concrete objects. This is because the GraphQL system allows you to specify abstract interfaces and unions. An example from the above schema is the Node interface which is implemented by Ship and Faction among other things. If we are trying to materialize a node, the GraphQL must have a way to figure out the type of the object it is materializing. This is handled by the type resolution mapping:

-module(resolve_resource).

-export([execute/1]).

%% The following is probably included from a header file in a real
%% implementation
-record(ship, {id, name}).
-record(faction, {id, name}).

execute(#ship{}) -> {ok, <<"Ship">>};
execute(#faction{}) -> {ok, <<"Faction">>};
execute(Obj) ->
    {error, unknown_type}.

Output object Resources

Each (output) object is mapped onto an Erlang module responsible for handling field requests in that object. The module looks like:

-module(object_resource).

-export([execute/4]).

execute(Ctx, SrcObj, <<"f">>, Args) ->
    {ok, 42};
execute(Ctx, SrcObj, Field, Args) ->
    default

The only function which is needed is the execute/4 function which is called by the system whenever a field is requested in that object. The 4 parameters are as follows:

Ctx - The context of the query. It contains information pertaining to the current position in the Graph, as well as user-supplied information from the start of the request. It is commonly used as a read-only store for authentication/authorization data, so you can limit what certain users can see.
SrcObj - The current object on which we are operating. Imagine we have two ships, a B-wing and an X-wing. Even if we request the same fields on the two ships, the SrcObj is going to be different. GraphQL often proceeds by having certain fields fetch objects out of a backing store and then moving the cursor onto that object and calling the correct object resource for that type. The SrcObj is set to point to the object that is currently being operated upon.
Field - The field in the object which is requested.
Args - A map of field arguments. See the next section.

Field Argument rules

In GraphQL, field arguments follow a specific pattern:

Clients has no way to input a null value. The only thing they can do is to omit a given field in the input. In particular, clients must supply a field which is non-null.
Servers always see every field in the input, even if the client doesn't supply it. If the client does not supply a field, and it has no default value, the server sees a null value for that field.

This pattern means there is a clear way for the client to specify "no value" and a clear way for the server to work with the case where the client specified "no value. It eliminates corner cases where you have to figure out what the client meant.

Resolution follows a rather simple pattern in GraphQL. If a client omits a field and it has a default value, the default value is input. Otherwise null is input. Clients must supply every non-null field.

Note: This limitation is lifted in the Jun2018 GraphQL specification, but this server doesn't implement that detail yet.

On the server side, we handle arguments by supplying a map of KV pairs to the execute function. Suppose we have an input such as

input Point {
    x = 4.0 float
    y float
}

The server can handle this input by matching directly:

execute(Ctx, SrcObj, Field,
    #{ <<"x">> := XVal, <<"y">> := YVal }) ->
  ...

This will always match. If the client provides the input {} which is the empty input, XVal will be 4.0 due to the default value. And YVal will be null. If the client supplies, e.g., { x: 2.0, y: 7.0 } the map #{ <<"x">> => 2.0, <<"y">> => 7.0 } will be provided.

Tips & Tricks

The execute function allows you to make object-level generic handling of fields. If, for example, your SrcObj is a map, you can do generic lookups by using the following handler:

execute(_Ctx, Obj, Field, _Args) ->
    case maps:get(Field, Obj, not_found) of
      not_found -> {ok, null};
      Val -> {ok, Val}
    end.

Another trick is to use generic execution to handle "middlewares" - See the appropriate section on Middlewares.

System Architecture

Most other GraphQL servers provide no type->module mapping. Rather, they rely on binding of individual functions to fields. The implementation began with the same setup, but it turns out pattern matching is a good fit for the notion of requesting different fields inside an object. Thus, we use pattern matching as a destructuring mechanism for incoming queries.

Schema

Internally, the system parses the schema into an ETS table, on which it can perform queries in parallel to satisfy multiple requests at the same time.

A schema injector allows the developer to parse a schema from a file or from memory, then bind exeuction modules to the schemas types. Once finishes, the schema is finalized which runs a lint check over the schema and rejects schemas which are nonsensical.

Query

A query is treated as a compiler chain, which is a design that fits Erlang well. Compilers rely a lot on pattern matching, so we can process a query symbolically by matching on it and gradually transforming it into a query plan which can then be executed.

A lexer tokenizes the query
A parser constructs an AST of the query from the token stream
An type checker walks the AST and attaches type information to the AST by looking up data in the schema ETS table. The pass also detects type and validation errors. The type checker is written in a bi-directional style so it flips between inference, in which we deduce the type of a term, and checking in which we verify a term has a given type.
A validator performs additional linting. Many queries are type-correct and thus executable, but are still malformed because they have nonsensical parts in them. The validator phase rejects such queries.
A query plan is formed from the AST.
An executor runs the query plan.

Of these tasks, only the execution phase in the end is performance-critical. Clients can pre-load query documents to the server, which means the document acts as a stored procedure on the server side. The server can then do parsing, elaboration, type checking and validation once and for all at load time. In addition it provides a security measure: clients in production can only call a pre-validated set of queries if such desired.

User Interface

GraphQL has some very neat Javascript tooling which plugs into the introspection of a GraphQL server and provides additional functionality:

GraphiQL - Provides a query interface with autocompletion, documentation, debugging, ways to execute queries and so on. It is highly recommended you add such a system in staging and production as it is indispensable if you are trying to figure out a new query or why a given query returned a specific kind of error.

Additionally, Relay Modern provides specifications for cache refreshing, pagination, mutation specifications and so on. It is recommended you implement those parts in your system as it is part of a de-facto standard for how GraphQL servers tend to operate.

Tests

The GraphQL project has an extensive test suite. We prefer adding regressions to the suite as we experience them. Some of the tests are taken from the the official GraphQL repository and translated. More work is definitely needed, but in general new functionality should be provided together with a test case that demonstrates the new functionality.

The general tests are:

dungeon_SUITE which implements a "MUD" style dungeon backend. It is used as a way to handle most of the test cases we cook up ourselves. It is driven by a schema and uses a query document for its queries. If you don't know where to add a test, this is a good place.
enum_SUITE Taken from the official Node.js de-facto implementation, this suite uses the "colors" schema in order to verify certain hard-to-get-right properties about enumerated data types.
graphql_SUITE Main suite for things which doesn't fit elsewhere. Checks lexing/parsing, "Hello World" style queries and also introspection.
star_wars_SUITE An implementation from the specification of the Star Wars universe. Allows us to verify queries from the specification in our own code. Also part of the Node.js de-facto implementation, so it is easy for us to transplant a test from there to here.
validation_SUITE GraphQL contains a lot of different validation checks. We handle some of these in the type checker and some in a validation pass. The tests here are mostly verifying parts of the specification. It uses the "Pet" schema as a base.

graphql-erlang's People

Contributors

Stargazers

Watchers

Forkers

lpil sntran-contrib benoitc callumroberts ruanpienaar chansuke kalta overbryd truqu bullno1 madsflensted goertzenator emq-inc davidalphafox m-2k gausby ygmpkk stephb9959 jew99 drvspw kgroup33 cofinality bommasam ezkemboi kafa58 arnodirlam ufo707 akinposama qibinlin lokipa tudborg twinnylab soediro thomas9911 sam-mix 16point7 michalmuskala filmor kunaprime premwoik duksis kianmeng seriyps landx-id thepulkitagarwal fastdgiot mau5atron ilya-klyuchnikov strogo markomin

graphql-erlang's Issues

5.7.2 Variable Default Values Are Correctly Typed

Handle this case.

(NOTE: I think this is already handled now in the type checker).

5.4.2.3.2 Abstract Spreads in Object Scope

Handle these cases

Some times, `__typename` is not valid in introspection

We need to investigate this. There are a couple of places where you cannot expand __typename as you should because the introspection part reports it as an invalid field. This is obviously a bug.

Reproduce the bug in a test case so we make sure we nail it for good.
Fix the bug.

Resolver error should propagate its failure correctly

The current system resolves the error but makes the error propagate too far. It should turn that error into a null value. This is a rather simple fix, which should improve error handling by quite a lot in the responses.

I found the problem toying with the star wars tutorial yesterday.

5.4.1.4 - Fragments must be used

The system must reject the following query, which has a fragment which is never used in the system.

fragment nameFragment on Dog { # unused
  name
}

{
  dog {
    name
  }
}

heavy lager dependencies

I am exploring using graphql on an embedded device, and the dependence on lager pulls in approximately 2MB of .beam (lager->goldrush->compiler), whereas graphql itself is only 0.38MB.

I see lager is used in only 4 places. Is depending on lager crucial here? I've never used lager before so I don't know what it is bringing to the table.

5.7.4 - All variable uses defined

Variable uses should propagate into Fragments as well. This is related to #70 and should probably be handled in the same patch. Many of the things can be pushed around in the type checker if you start building "function specifications" for fragments.

Invalid description results in unhelpful error message

Given this invalid annotation:

# Notice the uppercase "Text"
+description(Text: "Fail")

This makes graphql:load_schema fails with {error, {schema_canonicalize, {error, badarg}}} which does not help the user to track down the error.
I'm not sure what's the right fix for this one.

5.5.1 Input Object Field Uniqueness

The system must reject e.g.,

{
  field(arg: { field: true, field: false })
}

Because it isn't unique. This belongs as a check in type-checking when checking input objects, if it is not already there.

5.7.6 - All Variable uses are allowed

In operations you can have variables $var1, $var2 and so on. When they are used, transitively, it is important that they are used in ways that doesn't break the type system. We are currently doing some of the checking work, but not the transitive follows into fragments. This has to be done.

5.7.1 Variable Uniqueness

A fairly straightforward case. We already have a uniqueness checker in the code base, we just have to call it before we proceed. So I think this task is among the low-hanging fruit.

Empty parenthesis in query definitions result in server crash

Defining a query node like this will crash the GraphQL server:

type Query {
  # …
  nodes(): [Node]!
}

This works:

type Query {
  # …
  nodes: [Node]!
}

Is this valid or should it error with a reason?

Passing null variable into fields with default argument

Not sure if this is a bug or a feature.

Given the schema:

type Query {
  articles(includeDeleted: Boolean = false): [Article]
}

And the query:

query ListArticles($includeDeleted: Boolean) {
   articles(includeDeleted: $includeDeleted) { title }
}

Calling the query without providing $includeDeleted will result in null being passed into articles( i.e: articles(includeDeleted: null) while includeDeleted was specified to default to false. Changing the above query to:

query ListArticles($includeDeleted: Boolean = false) {
   articles(includeDeleted: $includeDeleted) { title }
}

will achieve the desired result but that's a repetition of default value.

GraphQL commit: 135657f745254198f3d07e35fd3a63623144b119

Argument resolver crashes when omitting a nullable input field of type list

The schema snippets below sets the stage

input I {
     nonNull: String!
     list: [String]
}

type Mutation {
     m(input: I!): ...
}

GraphQL crashes with runtime error {case_clause, null} in graphql_execute.erl when attempting to perform this mutation:

mutation M {
  m(input: {nonNull: "non-nullable string"}) {
    ...
  }
}

5.6.1 Directives Are Defined

Handle this case

Parser bugs in anonymous invocation

Execution of a test such as

query { hello }

Currently fails to get through the parser. This is wrong and should be remedied. It is currently not known when this error came into existence.

Reproduce the error in a test case
Write a fix for the parser

The bug is clearly in the parser since the parser rejects the above document. But it shouldn't reject the above document as it is valid. The document

{ hello }

is accepted and this could be a hint as to what is going on here.

5.4.2.3.3 Object Spreads In Abstract Scope

Handle these cases

Validation Tracking

Background

We need to have a place where we track validations. We begin this work by building the complete list, then mark what we already have covered. Things not covered are pulled into their own issues for further tracking.

Checklist

From the Oct2016 specification:

Subscription support

Is there any reason why subscription is not supported? All the codes are already there and it seems to be intentionally ignored in graphql_execute and graphql_schema_canonicalize.

Would a trivial pull request to enable it to have identical semantics to query be accepted? Or did I overlook something non-obvious with subscription?

5.7.3 Variables Are Input Types

I think this one is handled as well, but it does require a test in the validation suite.

Concurrent execution: add timeouts

Our (experimental) concurrent execution currently waits 750ms for some event to happen. This is arbitrarily chosen and if the deadline is met, we fail the query. We'd like for each defer statement to tell the engine what it expects is the deadline. This will allow us to manage deadlines in the system and run with higher deadlines.

This issue tracks what there is to be done in order to solve this problem.

Introduce a deadline timeout to defer results
Gather all timeouts, pick the maximal one
Handle timeout extension if parts of the query requires a larger timeout
Resolve deadlines by removing appropriate closures: invoke an {error, timeout} as the response result and feed that to the closure. It will then resolve itself properly, but as if an error occurred.

5.4.2.3 Fragment spread is possible

A fragment spread is ...Frag. They are written as fragment Frag on Dog { ... }. The fragment has a type, (in this case Dog) and the context in which the spread occurs has a type (not shown here). For a query to be sensible, they must match. In short, there must be a way for the fragment to eventually match. Otherwise the fragment is wrong.

There is code for this in the system already. It would be obvious to check this in the type checking phase: look up the fragment type and expand it into its possible types (unions, interfaces, objects). Do the same with the spread and look for a valid overlap.

However, it isn't clear that this is currently handled, so:

Provide a test case for the validator
Eventually implement the validator, preferably as part of a type check

Consider coercion of integers to floats in float context

If we return an integer in a position which expects a float, we currently do no conversion. But it is possible to just convert the integer in this case. Consider this.

5.6.2 Directives Are In Valid Locations

Handle this case

tag 0.8.0 ?

Maybe you could tag the 0.8.0 version? Is there any reason why not?

Support user defined enum variant values?

Commit 6bdc1b6 introduces the variants/1 function in the graphql_schema_parse, which assings a numeric value to each enum variant, starting with 0 and incrementing by 1 for each consecutive variant. This reduces boilerplate code dramatically, but unfortunately, this setup:

Makes the ordering of variants in the .graphql schema file important
Requires the maintenance of a separate enum value "translation" for components that does not interact with objects through the graph.

Currently, this affects moose, as moose needs to be able to distinguish and search among different authentication method representation, that are parametrized by the AuthenticationMethodType enum. Prior to the new way of doing things(TM), variant values were explicitly defined and could be
canonicalized as this example shows

{enum, { ..., 
    values => #{ 
                'AUTHENTICATION_METHOD_FACEBOOK' =>
                    #{ value => gryphon_person:authentication_method_type(facebook),
                       description => "Authentication method facilitated by Facebook."
                    },
                  ...
        }
    }
}

@jlouis Can we come up with a solution allowing variant representation values to be determined or looked up outside of the graphql?

Execute: when completing scalar values, do not return {error, Reason}

This is wrong because it messes with the object. They become null as if the field was non_null, yet that might not be the case.

Get on hex.pm

Publish this package (and the tutorial) on hex.pm.

Wrong (deep) variable expansion

Consider the following query:

mutation C($n : String) { createUser(input:{name: $n}) { user { id name }}}

This erroneously makes a call like the following:

#{<<"name">> => {var,{name,34,<<"n">>}}}

which should have been expanded. To fix this, we need to understand what is happening in the expansion system inside the type checker.

Split the graph schema file in one file per object

Add the possibility to split the graph schema in diferentes files, that files must be joined and evaluated as a singe schema file.
As @jlouis suggested , we could use a directive at top of file like package my-graphql-erlang, for example where every file that references that package will be into my-graphql-erlang.graphql

Handle queries which have parameters by no operation name

A query such as

query ($n : Int) {
    grabStuff(size: $n) {
       id
    }
}

Is produced by Apollo. It fails because we expect something like query N(...) { ... } where the query has a name. Check:

This is allowed in the first place in the specification (Both 2016 and the current draft spec)
Write a test case for the right behaviour
Implement the change in the parser
Verify that the semantics are still correct in this case (Type checker and execution phase has to process parameters like normally in this case).

Missing query and mutation type result in unrelated error message

A schema with missing mutation and query types will result in the error schema_not_found, even though the schema file does exist. We should consider a better error for this scenario.

Resolvers should be able to return auxiliary data

We used to be able to send auxiliary data back from a resolver like so:

{ok, Result, [AuxiliaryData]}

This data would show up under the aux field in the query result map, but this functionality seems to be lost in a recent commit. We need to:

Make a unit test
Implement it for the new resolver code

5.7.5 All Variables Used

This task depends on Issue #107. Once that is done, then this is trivial to satisfy. You walk the document, and build a union set over the variables in it. A fragment contains a type signature which transitively knows about its variable uses.

Once the full set is created, you can use it to subtract from the set of variables. If that set becomes the empty set, then every variable has at least one use.

Make it possible to serve multiple graphql schemas on a node

Currently it is only possible to serve one GraphQL schema on an Erlang node. It would be useful to be able to run multiple GraphQL schemas on a single node for a couple of reasons:

In a HTTP context different schemas could be served from different endpoints
In a test context tests could be run in parallel

This would require all the internal lookups to be name-spaced.

Proposal: Rename atom null -> nil

Hi,

this might seem a little counter-intuitive at first, but hear me out:

Using graphql-erlang from Elixir is currently slightly pesky, since one has to either traverse the whole tree/list of results to replace null with nil or nastily patch JSON encoding to treat null as null instead of the default Atom -> binary "null".

I have been going back and forth with some hacky solutions, but thought maybe its worth having this discussion upstream here.

I am happy to help by sending in a PR that is currently representing a project-wide rename of the null-Atom to nil. It would give the folks over in Elixir land instant compatibility to bind to this excellent library.

Here is something to have a laugh about while pondering. (It's all ALGOL W's fault.)

N+1 SQL queries

The following query will require 1 SQL query to get the books and N queries to get each author.

{
    books(genre: "comedy") {
        title
        author {
            name
        }
    }
}

Is there a way to make only 1 or 2 SQL queries with graphql-erlng?

Crash while creating error message for type error in named query without operationName

That was a handful to type.

Basically, given a named query:

query FindSomething($input: Input!) {
  find(input: $input)
}

$input contains a type error (e.g: negative number for a uint type).
operationName is not given by the client and following graphql-erlang-tutorial, it is set to undefined.

Given this query, graphql will crash with:

#{class => error,
  error => {case_clause,{undefined}},
  module => my_web_handler_module,
  stacktrace =>
      [{graphql_err,'-path/1-F/1-0-',1,
                    [{file,"/home/bullno1/Projects/nm-exchange-api/_build/default/lib/graphql/src/graphql_err.erl"},
                     {line,43}]},
       {graphql_err,'-path/1-lc$^2/1-2-',2,
                    [{file,"/home/bullno1/Projects/nm-exchange-api/_build/default/lib/graphql/src/graphql_err.erl"},
                     {line,60}]},
       {graphql_err,path,1,
                    [{file,"/home/bullno1/Projects/nm-exchange-api/_build/default/lib/graphql/src/graphql_err.erl"},
                     {line,60}]},
       {graphql_err,mk,3,
                    [{file,"/home/bullno1/Projects/nm-exchange-api/_build/default/lib/graphql/src/graphql_err.erl"},
                     {line,20}]},
       {graphql_err,abort,3,
                    [{file,"/home/bullno1/Projects/nm-exchange-api/_build/default/lib/graphql/src/graphql_err.erl"},
                     {line,16}]},
       {graphql_type_check,check_input_object_fields,4,
                           [{file,"/home/bullno1/Projects/nm-exchange-api/_build/default/lib/graphql/src/graphql_type_check.erl"},
                            {line,232}]},
       {graphql_type_check,'-tc_params/3-fun-0-',4,
                           [{file,"/home/bullno1/Projects/nm-exchange-api/_build/default/lib/graphql/src/graphql_type_check.erl"},
                            {line,131}]},
       {lists,foldl,3,[{file,"lists.erl"},{line,1263}]}]}

Which I guess is while it's trying to build the error path, hence the undefined case clause.
After I give operationName a correct value, graphql reports the type error correctly without crashing.

GraphQL commit: 135657f745254198f3d07e35fd3a63623144b119

5.6.3 Directives Are Unique Per Location

Handle this case

Graphql builtins suffer from encoding issues (breaks server/clients)

Hi everyone,

I am running against the latest master version.

And I am seeing strange encoding problems when querying the default responses with GraphiQL.
For demonstration purposes I used your graphql-erlang-tutorial application.

So this is the response header that cowboy sends for the introspection query.
It states that content will be utf-8 encoded:

HTTP/1.1 200 OK
server: Cowboy
date: Tue, 19 Dec 2017 18:44:19 GMT
content-encoding: gzip
content-length: 4069
content-type: application/json; charset=utf-8
vary: accept

snip

But inside the response, documentation annotations can be found that have encoding issues.
Here is one example of the introspection of the SCALAR type Float:

{"description":"Floating point values, IEEE 754, but not �infty, nor NaN","enumValues":null,"fields":null,"inputFields":null,"interfaces":null,"kind":"SCALAR","name":"Float","possibleTypes":null}

When looking up the source for this string, I found this in src/graphql_builtins.erl:15:

    	description => "Floating point values, IEEE 754, but not ±infty, nor NaN" }},

So there is definitely something fishy going on. Unfortunately I do not know how to fix this quickly.
I think maybe there is an encoding issue when fetching graphql-erlang as a dependency (it definitely happens on a vanilla graphql-erlang-tutorial application).

This issue prevents some clients, and also my server, to render introspection queries/results.

Batch loading support

Not an issue per se, I'm just not sure how to do batch loading in graphql-erlang.

I'm trying to use this optimization: dataloader to batch up individual queries into a big one.

Basically, in dataloader, a call to dataloader.load(id) will return a promise. At the "next tick", all ids will be sent in a single batch query to save round trip time and all promises will be resolved.

While deferring is already possible in graphql-erlang, "joining" is not as the concept of a "next tick" doesn't make much sense in Erlang.

Is it possible to register a callback which will be called here: https://github.com/shopgun/graphql-erlang/blob/develop/src/graphql_execute.erl#L912? It can inform the batch loader to start sending.

Alternatively, is there a better way to implement batch loading?

Annotating values in an enum results in a error

Annotating a field of an enum in a schema will result in a crash when evaluating the graphql schema:

How to reproduce:

Add something like this to the schema:

enum Status {
  +description(text: "This will make the parser crash")
  SUCCESS
  FAILURE
}

Reload the graphql schema in an interface like graphiql.

5.2.2 Field Selection merging

The validator for this is not yet implemented (it is somewhat contrived and complex). This issues tracks the field selector.

5.4.2.3.1 Object Spreads In Object Scope

There is a test case for this one, but we are currently skipping the test case. Figure out why.

Enum output coercion: Make sure the enum is valid

Currently, there is a test in the enum SUITE, no_correct_internal_value, which is there to test we correctly handle output rules for enumerated values. We currently don't.

The fix is in the execution layer. When you return from a resolver of enum type, we must check that the returned value indeed matches the valid possible enum values. Otherwise, this is an error. Also, while here, we must lock down what the correct return value for an enum type is in the case where you are using the default resolver. Before this has gone in, we cannot document how enum resolution works.

Coerce default values

Currently, default values are not coerced. They should be.

Handle Fragment expansion with variables fully

Background

Suppose we have a fragment

fragment M on Monster {
  ...
  hitPoints(above: $foo)
  ...
}

Then this fragment is valid as a spread in any query Q which defines

query Q($foo : Int) { ... }

But not in a query

query Q($foo: String) { ... }

Also, note that if we have another fragment

fragment R on Room {
    description(language: $foo)
}

Then if language : Locale you are not allowed to mix fragments M and R in the same query.

Other observations

This holds transitively. If M refers to a subfragment I on Iventory and that fragment refers to $bar, then M also refers to $bar.
These tests must happen in the type checking phase.

Implementation

The reason this has been held off for a while is that it isn't that simple to implement:

Introduce the notion of a type signature for fragment. This allows us to ask if a fragment "fits" when it is called by a fragment spread.
Inline spreads can just produce their signature and we can then check it afterwards.
A fragment is implemented as a function from its signature to its expansion. Thus, the function signature is verified by checking if we can "call" the fragment from the spread with a valid type.
When handling fragments, they can refer to other fragments. When this happens, we must run fragception and DFS the fragment world. We track already handled fragments in order to detect if we have a cycle in the fragment expansion. This runs in linear time over the fragments and is going to be reasonably fast. Also, it moves the cycle validation into the type checker where it belongs.
Value coercion still happens at the execution phase for variables. So this will still work as expected.

Make it possible to define the format of null returns

Currently when a node resolve to no data we return {ok, null}; this is fine for an Erlang and JSON context, but if we want to use another output format, or use graphql-erlang as an interface to another system that require something else (like {ok, undefined}) we could benefit from making this configurable.

This will have to happen when the graphql-execute finalise a value.

Support Query Handles

Rather than supplying a query document each time, support caching a query handle on the server side and let queries refer to this in the future. This speeds up queries as we can skip a good part of the parsing and validation each time.

5.4.2.3.4 Abstract Spreads in Abstract Scope

Handle these cases

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.