GSOC 2019 Ideas
This is the D Language Foundation's Google Summer of Code 2019 page.
If you are interested in working on a D project as part of GSoC 2019, as either a student or mentor, please contact the GSoC admins at scoial@dlang.org. Include "GSoC 2019" in the email subject line.
Contents
- 1 Timeline
- 2 Ideas
- 2.1 Graphics library for resource constrained embedded systems
- 2.2 Tabular data container (data frames)
- 2.3 C++ interops
- 2.4 Persistent Data Structures
- 2.5 Language Server Protocol for D
- 2.6 Mir Project
- 2.7 DUB: D's package manager
- 2.8 std.benchmark
- 2.9 Jupyter notebook D kernel
- 2.10 Functional Reactive Programming
- 2.11 Java Native Interface (JNI) library
- 2.12 ORM mapping
- 2.13 Propose your own project
- 3 Ideas From Previous Years
- 4 Tips for students
- 5 Tips for Mentors
Timeline
The timeline for GSoC 2019 can be found here.
Ideas
The D ecosystem is always in need of improvement through the revision of existing projects and the creation of new ones. The core projects like the(standard library) and the compilers (Compilers) are always in need of attention. Other important areas include GUI support, integration with other languages, improving and enhancing the tooling, IDE and editor support, and more.
Please add your project ideas to the list below, being as descriptive as you can in the summaries. Students interested in participating can use the ideas listed here as jumping off points, so the more information you provide, the easier it will be for them to get started. If your idea is particularly complex or nuanced, consider leaving contact information so that interested parties may follow up.
Graphics library for resource constrained embedded systems
Create a 2D rasterizer, rich drawing primitives, and 2D graphics library suitable for resource constrained embedded systems (e.g. ARM Cortex-M) to be used in industrial controls, home appliances, medical devices, consumer electronics, and IoT just to name a few. The end goal would be something similar to Segger's emWin. The library would be used to drive LDCs similar to https://www.adafruit.com/product/3396 Requirements:
- Hardware agnostic; should simply render to a frame buffer
- No dependencies (No Phobos, no C standard library, and no official D runtime).
- Consider using -betterC, but a custom minimal D runtime is also a viable option
Related work:
- https://github.com/TurkeyMan/color
- https://bitbucket.org/timosi/minlibd
- https://github.com/JinShil/stm32f42_discovery_demo
Proposed Project Mentors: TBA
Tabular data container (data frames)
Pandas, R and Julia have made data frames very popular. As D is getting more interest from data scientist (e.g. eBay or AdRoll) it would be very beneficial to use one language for the entire data analysis pipeline - especially considering that D (in contrast to popular languages like Python, R or Julia) - is compiled to native machine code and gets optimized by the sophisticated LLVM backend.
Minimum requirements:
- conversion to and from CSV
- multi-indexing
- column binary operations, e.g. `column1 * column2`
- group-by on an arbitrary number of columns
- column/group aggregations
Proposed Project Mentor: Ilya Yaroshenko
C++ interops
There are many experiments in transparently interfacing D with C++, notably:
- dpp by Atila Neves, that parses and translates headers on the fly
- Calypso by Elie Morisse that bridges the two languages at the AST level.
There is also a lot of work in bringing STL types to the DRuntime (see https://github.com/dlang/druntime/pull/2310).
D would greatly benefit by seamlessly integrating the C++ ecosystem. There are many ways to accomplish this goal and it should be discussed with the community. Andrei already commented on the importance of a tool like Calypso in this thread: https://forum.dlang.org/post/m9s4cd$2s1v$1@digitalmars.com
Proposed Project Mentor: Andrei Alexandrescu
Persistent Data Structures
There is an ongoing effort to reduce the reliance of Phobos on the GC but the GC is not going anywhere given that it has many advantages over manual memory management. The community would greatly benefit from a wide range of functional programming techniques that have their roots in the usage of immutable and persistent data structures that heavily rely on the GC. Persistent Data Structures are heavily used in Scala and Clojure and they allow the programmer to reason about interactivity, sharing and parallelism.
The focus of this project would be to create a varying range of persistent data structures (sets, vectors, maps, strings, trees, lists, etc...) that respects the range interface and the semantics of Phobos.
Related Work:
- https://www.cs.cmu.edu/~rwh/theses/okasaki.pdf
- https://www.worldcat.org/title/rrb-trees-efficient-immutable-vectors/oclc/820379112
- https://www.youtube.com/watch?v=7BFF50BHPPo
- https://www.infoq.com/presentations/Value-Values
- https://github.com/arximboldi/immer
Proposed Project Mentor: Andrei Alexandrescu
Language Server Protocol for D
The Language Server Protocol (LSP) defines the protocol used between an editor or IDE and a language server that provides language features like auto complete, go to definition, find all references etc. Adding features like auto complete, go to definition, or documentation on hover for a programming language takes significant effort. Traditionally this work had to be repeated for each development tool, as each tool provides different APIs for implementing the same feature (see the list of Editors and IDEs). A Language Server is meant to provide the language-specific smarts and communicate with development tools over a protocol that enables inter-process communication. The idea behind the Language Server Protocol (LSP) is to standardize the protocol for how such servers and development tools communicate. This way, single D Language Server can be re-used in multiple development tools, which in turn can support multiple languages with minimal effort.
Since summer of 2017 DMD is available as a DUB package and can be used for this challenge.
Proposed Project Mentors: Jacob Carlborg, Sebastian Wilzbach
Mir Project
The Mir project is developing numerical libraries for the upcoming numeric packages of the Dlang standard library. The are numerous projects an interested student could pursue:
- University project. Your GSoC project can be combined with your scientific research.
- ndslice>-<Julia integration
- ndslice>-<NumPy integration
- General purpose BetterC libraries
- I/O betterC Dlang library
- Async betterC Dlang library
- String and format betterC library
- mir-cpuid
- ARM support
- Advanced OS specific support
- mir-glas
- BLAS Level 2 subprograms
- One BLAS Level 3 subprogram. It is OK to do only one! But it must be faster then OpenBLAS, and not slower then Intel MKL.
- Multithreading support for BLAS Level 2
- Multithreading support for BLAS Level 3
You can get more details on projects related to Mir here.
Its Good To Know:
To work on the Mir you should be proficient with one of:
- C
- C++
- LLVM
- Fortran
- Experience with D is not essential.
To work on the Mir project requires a responsible and self-motivated student.
Proposed Project Mentor: Ilya Yaroshenko
DUB: D's package manager
DUB - D's package manager - is one of the central infrastructure tools in the D world as a convenient build and package management tool for the D community. With D gaining more and more popularity, it's also a key tool to guarantee D's further adoption. Thus there are many great ideas to improve DUB:
- separate compilation and caching
- extensibility through DUB
- control flow directives
- build system support for C/C++
- git support
- support dub install for distributions
- support building a {snap, Debian, Arch, ...} package out of DUB package
Proposed Project Mentors: Sönke Ludwig, Sebastian Wilzbach
std.benchmark
Ideally every function in Phobos should have its own benchmark suite which is run for every PR to check for performance improvements or regressions.
- Additionally support for the new big O annotation here would be nice.
- Existing work: here, and here, and here (QuickText-like testing).
Proposed Project Mentors: Sebastian Wilzbach
Jupyter notebook D kernel
The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and much more. Jupyter notebooks are extremely popular among data scientist as they allow interactive progress and exploring a solution step by step. With a bit of work, a statically compiled language can be used in an interactive notebook e.g. [Cling https://blog.jupyter.org/interactive-workflows-for-c-with-jupyter-fe9b54227d92] and Go with gophernotes. Apart from being useful to data scientist, a Jupyter D kernel would allow an intuitive exploration of the language for newcomers and people starting with new libraries.
Existing works: - drepl - simple REPL for D - PydMagic - allows to write PyD extension in a notebook.
Proposed Project Mentor: Sebastian Wilzbach
Functional Reactive Programming
Functional reactive programming (FRP) is a programming paradigm for reactive programming (asynchronous dataflow programming) using the building blocks of functional programming (e.g. map, reduce, filter). The best-known FRP API is the ReactiveX API whose implementation in D this project would focus. Note that this project is closely related to the std.eventloop project.
Proposed Project Mentors: Petar Kirov
Java Native Interface (JNI) library
D's features allow to write cut down the boilerplate of wrapper drastically. See the recently published excel-d for a comparison between the C++ implementation for registering a function in Excel and the D way. Similarly a solid JNI library would open up new many new use cases and allow the Java community to profit even easier from D's features.
Previous work: DJni, djvm Proposed Project Mentor: TBA
ORM mapping
SQLAlchemy is an object-relational mapper which provides "a full suite of well known enterprise-level persistence patterns, designed for efficient and high-performing database access, adapted into a simple and Pythonic domain language". While there has been some attempts in implementing an ORM in D - none of these got close to the simplicity and usability of SQLAlchemy (see Database Libraries). However, with D's CTFE queries could - like std.format's strings - already be checked at compile-time and optimizations could be be applied to the parser and serializer.
This project would be based on the proposed std.database abstraction and focus on creating a general-purpose ORM on top of it.
See also: Martin Nowak's DConf16 talk about Object-Relational Mapper
Existing work:
Proposed Project Mentor: TBA
Propose your own project
Do you have your own vision for improving D's success? That's great! Are you convinced that your project helps to advance open source technology related to the D programming language? Then simply get in touch with D's GSoC org admin (social (at) dlang (dot) org) and potential mentors for your project.
If you are searching for inspiration, you might want to have a look at the list of D Improvement Proposals (DIPs), the high-level vision for this semester or the wish list.
Ideas From Previous Years
These GSoC idea pages from the past years which might provide more inspiration for proposing your own idea:
- GSoC 2018 Ideas (not accepted)
- GSoC 2017 Ideas (not accepted)
- GSoC 2016 Ideas (accepted, 4 slots)
- GSoC 2015 Ideas (not accepted)
- GSoC 2014 Ideas (not accepted)
- GSoC 2013 Ideas (not accepted)
- GSoC 2012 Ideas (accepted, 3 slots)
- GSoC 2011 Ideas
You may also want to have a look at the Symmetry Autumn of Code ideas page.
Tips for students
Please see the "gsoc" articles at our official blog which contain experience reports from students of previous years.
Daniel Pocock has written a detailed blog about initiatives students can take if they want to have a serious chance of being selected in GSoC without a focus on one specific organization.
Tips for Mentors
If you are interested in mentoring, please check out the organization administrator and mentor manual for more information.