Who am I?

I'm Martin. I'm a software engineer located Taiwan. Specialized at building high performanmce sofware, AI deployment and messing with Linux. Now getting to OpenBSD and privacy releated software development. Here's a few technology I use daily:

Do send me a message if you want to know more about me or have any questions. I'm always happy to talk about anything. I'm avaliable over email (marty1885 \at protonmail) or GNU Jami (Link to my QR code). If it's something serious, please encrypt your message using my PGP key 76D1193D93E96444 or send it over Jami which is P2P and encrypted by default.

I'm open to contracted work. Do shoot me an email/Jami if I can be of any help. If you are interested in hiring me, here's my resume.

Author's profile. Photo taken in VRChat by my friend Tast+
Martin Chang
Systems software, HPC, GPGPU and AI. I mostly write stupid C++ code. Sometimes does AI research. Chronic VRChat addict

I run TLGS, a major search engine on Gemini. Used by Buran by default.

  • marty1885 \at protonmail.com
  • Matrix: @clehaxze:matrix.clehaxze.tw
  • Jami: a72b62ac04a958ca57739247aa1ed4fe0d11d2df

Work experience

Lumina Industries, Inc.

Director of Product Engineering[2022-10 ~ PRESENT]
Official title. But gained simply so I can sign product pacakges on the company's behalf. I'm still a software engineer, job doesn't change.

Native Application Lead[2022-01 ~ 2022-10]
Archicture, develop and coordinate the development of the company's native application. Including image pipeline, UI and AI systems.

C++ Developer[2021-09 ~ 2022-01]
Develop the company's application and algorithms. Introduced the team to modern C++ standards and tools like Address Sanitizer and FBInfer.

National Sun-Yet-Sen University

Research Assistant[2016-12 ~ 2021-06]
Research and co-work in the development machine learning accelerators. Compilers for model execution. And various other research projects.

Major Open Source Projects



Drogon is an production ready, extremely fast C++ web application framework. I'm a maintainer, wrote the coroutine subsystem, various bug fixes and joins community development. I have developed commerical product using it. Drogon easily handles C10K on an embedded system and C100K on a standard server.


Etaler is a high performance implementation of Hierarchical Temporal Memory, a biologically inspired machine learning model developed by Numenta. At the time of release, Etaler is more than 20x faster than the community developed HTM.core on CPU and another 2x faster on a GPU.


embree-arm is the world's first functional port on Intel's Embree ray tracing kernels for ARM processors. Though not maintained. This project inspired the embree-aarch64 project (using the same porting approach) to provide up-to-date ray tracing kernels for ARM.

Conference talks


I love games but why movies look so much better - An introduction to computer graphics and ray tracing.


Introduction to GPU computing with OpenCL - video


Building your own NumPy! Implementing NDArrays from scratch.


How to port/hack Arch Linux onto ARM devices.


High Performance Software

I build high performance software like ML libraries and search engines in C++. I've deployed a search engine written in highly optimized C++ commerically in a scalable infrastructure. I also developed TLGS, a search engine for the Gemini protocol (proxy).

Arch Linux

I'm a passionate Linux user and being on the bleeding edge is important for me. I use Arch Linux on my laptop, PC, workstations, servers and embedded systems. I've been using Linux since 2013 and knows a bit about Linux internal and managment.


ROOT is a high performanmce data analysis framework in C++. I have carried out Big Data analysis that usually required a cluster using it on a single server. I also integrated ROOT into IC development flow to analyze performance in real time.

Macline Learning/AI

Besided knowing how to use common ML libraries like sklearn and PyTorch. I'm an active member of the Numenta community. Helping the development and research of HTM theory.


Besides writing code for the CPU. OpenCL allows me to run faster using GPUs or even FPGAs. I also have experience using AMD's HIP platform to write portable C++ kernels.