Chiroptical’s Blog

Getting started with Erlang’s `maybe_expr`

2024-03-04T00:00:00+00:00

Assumptions

You are using rebar3 to build your project. You are using OTP 25.

Introduction

I am an Erlang beginner and I am currently building a Slack bot to learn more. Here is some code I wrote recently,

{ok, ChannelId} = map_utils:recursive_get([<<"channel">>, <<"id">>], Payload),
{ok, UserId} = map_utils:recursive_get([<<"user">>, <<"id">>], Payload),
{ok, TeamId} = map_utils:recursive_get([<<"team">>, <<"id">>], Payload),
{ok, ResponseUrl} = map_utils:recursive_get([<<"response_url">>], Payload),

The Payload here is a decoded JSON body from Slack. The map_utils:recursive_get/2 function takes the path to a JSON entry and extracts it if possible, given this JSON

{
  "channel": {
    "id": "value"
  }
}

If we ran this JSON through my HTTP handlers, this code would succeed,

{ok, <<"value">>} = map_utils:recursive_get([<<"channel">>, <<"id">>]),
{error, not_found} = map_utils:recursive_get([<<"hello">>, <<"world">>]),

When the ChannelId, UserId, etc are all extracted from the Payload properly everything is great. However, if any of the pattern matches fails everything seems to get dropped into the void. This is obviously problematic when you are building an application. Thankfully, I discovered maybe_expr!

With maybe_expr, the code will look more like this,

-record(interact_payload, {channel_id, user_id, team_id, response_url})

% ...

maybe
  {ok, ChannelId} ?= map_utils:recursive_get([<<"channel">>, <<"id">>], Payload),
  {ok, UserId} ?= map_utils:recursive_get([<<"user">>, <<"id">>], Payload),
  {ok, TeamId} ?= map_utils:recursive_get([<<"team">>, <<"id">>], Payload),
  {ok, ResponseUrl} ?= map_utils:recursive_get([<<"response_url">>], Payload)
  {ok, #interact_payload{channel_id = ChannelId, user_id = UserId, team_id = TeamId, response_url = ResponseUrl}}
else
  {error, not_found} ->
    logger:error(...),
    {error, not_found};
  {error, not_found, Reason} ->
    logger:error(...),
    {error, not_found}
end,

Instead of dropping anything into the void, the else clause can be used to pattern match out any failure cases. Here, we match {error, not_found} and {error, not_found, Reason} and log that we had an unexpected error.

This feature is currently “experimental” in OTP 25. However, it is becoming standardized over the next few OTP releases. See this highlight for more information.

Setting up rebar3

Credit to this forum entry for the details. With OTP 25, first create the file config/vm.args if it doesn’t exist and add,

-enable-feature maybe_expr

Then set this in your environment,

export ERL_FLAGS="-args_file config/vm.args"

Or, prepend ERL_FLAGS="-args_file config/vm.args" to your rebar3 commands. Reminder, you can skip this step in OTP 26.

Enabling the feature

In your Erlang files you only need to add (after your module definition),

-module(...).
-feature(maybe_expr, enable).

Done. Supposedly in OTP 27 you won’t need to do either of these steps!

More information

You can read the Erlang Enhancement Process proposal here.

If you’ve ever written Haskell before this is essentially ExceptT e IO a where the e can be literally anything. It is your job in Erlang to catch all the cases. You can add type checking to your Erlang code with something like eqWAlizer and type annotations via dialyzer. I was first exposed to type annotations in LYAH’s dialyzer introduction.

If there is anything you are curious about in Erlang, please ask me about it on BlueSky or the Erlanger’s Slack. I would like to write more blog posts and learn more about Erlang.

First day with typst, a markup based typesetting system

2023-03-31T00:00:00+00:00

I came across typst recently which looks like an interesting replacement to LaTeX. I don’t really do much collaborative editing anymore, but I really enjoy plain text presentations. I tried pollen as well, but I didn’t like the unicode symbols. What was my first presentation like using typst?

typst is available on the unstable nix channel and you can likely get it with nix-shell -p typst or follow the instructions on their github.

One annoying thing about LaTeX is you have to compile a bunch of times for your PDF to be correct. With typst, you can typst -w document.typ and it will watch the document for changes and recompile automatically. This is a really nice productivity boost.

Setting up the presentation,

#set page(                                                                 
  paper: "presentation-16-9",    
  margin: (    
    rest: 25pt    
  )    
)   

Here, we are setting parameters for the page. # denotes a “code expression”. I believe I could also build my own template to define margins, spacing, font, etc. in a separate file.

Next, the font,

#set text(    
  font: "JetBrains Mono",    
  size: 22pt    
)

This syntax is a bit odd, but it is syntax and LaTeX isn’t necessarily nicer in any way. You can lay out a slide like so,

= The slides title

// The slides content

#pagebreak()

The = denotes a header, you can generate smaller headers with additional =, i.e. ===. The // denotes a comment, most of what you need is from markdown, see the syntax guide. Interestingly, you can make this into a named function,

let slide(title, content) = [
  = #title
  #content
  #pagebreak()
]

The function syntax is a bit weird to me, but I also don’t fully understand the type system yet. As an example, here is another function,

  #let fig(location, width, gap, caption) = [
    #figure(
      image(location, width: width),
      numbering: none,
      gap: gap,
      caption: caption
    )
  ]

Note the difference in how I refer to the parameters in the body. I think the former #x are inserting “content blocks” and the latter are plain values and don’t require the #. Not exactly sure yet.

From here, you could generate a slide with a figure pretty easy.

#slide(
  [The slides title],
  [
    - Some unordered list item
    - Some other unordered list item
    fig(
      "figures/image.png", 
      350pt, // the width of the image, see function definition
      -2pt, // the captions are a bit far away from the images by default
      [ The caption for the figure. ]
    )
  ]
)

From here, you can build a basic presentation! Pretty cool.

I also wrote two other functions for links:

#let l(location) = link(location)[#text(blue)[#location]]
#let ld(location, description) = link(location)[#text(blue)[#description]]

I did try a two column #grid but the alignment was a bit wonky. I would like to spend a bit more time handling columnar layouts before attempting to show some code. Let me know what you think on Twitter

Discarding monadic results in Haskell

2021-09-01T00:00:00+00:00

Discarding Monadic Results in Haskell

I recently ran this poll on Twitter. The original poll and results,

I wasn’t expecting much from this poll but the comments turned out to be fantastic! Let’s summarize the problem, options, and discuss them a bit. The focus of the discussion will be if I would use it in my personal project. It isn’t a suggestion.

Motivation

module Main where    
    
f :: IO Int    
f = pure 1    
    
main :: IO ()    
main = do    
  -- business...    
  f    
  -- more business...    
  pure ()

The following warning is generated when you compile this with -Wall,

src/Main.hs:9:3: warning: [-Wunused-do-bind]
    A do-notation statement discarded a result of type ‘Int’
    Suppress this warning by saying ‘_ <- f’
  |
9 |   f
  |   ^

Typically, I also use -Werror and therefore the warning becomes an error. What are our options in this case?

Options

We need to discard the result of f. Here are all the suggested solutions (attributed to the first suggester),

Disable the warning -Wno-unused-do-bind @TechnoEmpress
void f (in the poll)
() <$ f @alex_pir
_ <- f (in the poll)
_ ← f @toastal
_descriptiveName <- f @MxLambda
(_ :: ResultType) <- f , in this case Int @vincenthz

Breakdown

Let’s look at a few of the options a bit.

void

void :: Functor f => f a -> f ()

I have always used this in my personal projects. It gets the job done, but isn’t particularly satisfying (hence the poll). The win here is that it only requires a Functor constraint and can be used beyond do notation. I wonder if void would be more compelling if it was named differently? Maybe discard or ignore?

The const equivalent

(<$) :: Functor f => a -> f b -> f a
(<$) = fmap . const

This is const lifted into a functorial context. It is more flexible than void and useful for the same reasons. It is provided, for free, by the Functor typeclass and is one I often forget about. That being said, I don’t feel particularly compelled to start using () <$ ... over void.

Underscores

The options,

_ <- f
(_ :: Int) <- f
_descriptiveName <- f

The first line is saying “match something, but I don’t care what”. This is equivalent to void but preferred by more respondents. There is one exception to this preference (expressed in the responses as well),

do
  -- business...
  _ <- finalMonadicComputation
  pure ()

I personally think this should be void in almost all cases. The latter two lines are much more interesting to consider and make context even more important. Specifying the underscore’s type, i.e. _ :: Int, does add some additional type safety if the monadic computation changes. However, in most cases, changing the monadic computation would at least point me to the underscore (thanks GHC) so I can reconsider my choices. Adding a descriptive name is never a bad thing, but sometimes it is difficult to come up with a good name or the function names are clear enough. I think both of these are interesting and I will probably use some variation of them in the future.

Bonus: with ScopedTypeVariables you can remove the parentheses.

Unicode

Honestly, I don’t even know how to enter a unicode arrow on my keyboard. Cool suggestion nonetheless.

Disable the warning

Here I am appeasing the compiler for -Wall -Werror and Hécate is playing an entirely different game. I think this is interesting and I might try it out in my personal projects. However, you do lose a signal that the monadic computation returns something. In Haskell, we often use descriptiveFunctionName_ to indicate that a function returns () and if you follow that convention you could use that as a signal. Do I really need this signal? I am not so sure anymore.

Wrapping up

This poll generated a surprising response. The results were both fun, interesting, and will hopefully make me think more carefully about context. I hope you enjoyed it as much as I did.

Find typos or have suggestions? My DMs are always open @chiroptical.

Like the content? Follow me on Twitch and subscribe on Youtube

Simple Scaleable Preprocessing with PyTorch and Ray - 0

2020-05-20T00:00:00+00:00

Simple Scaleable Preprocessing With Pytorch and Ray

Background

I have been using PyTorch for a few months now and I really like the Dataset and DataLoader workflow (see torch.utils.data). I realized I might be able to use this workflow for every step in my Machine Learning pipeline, i.e. preprocessing, training, and inference. I further realized I could use Ray to coordinate multi-node parallelism with little changes to my original code.

Escape Hatch: if you would rather explore the code with no explanation there is a Jupyter Notebook on Github

I believe most folks are using Dataset/DataLoader to handle training and inference pipelines but let’s consider a more general preprocessing workflow. A data scientist needs to write a function which processes their entire data set, the function has the approximate signature:

InputFile -> (OutputFiles, Metadata)

Here, InputFile is an input file in your dataset. The function may produce one, or more, OutputFiles and some Metadata related to the operation performed. As a practical example, I often have to split large audio files into multiple audio files of a fixed size and retain some metadata (source audio, destination audio, labels).

In this blog post, I’ll discuss how to get PyTorch’s DataSet and DataLoader workflow running in parallel for this general use case. I will also go over some of the mistakes I made while first exploring this workflow. I will assume the reader knows basic Python.

Why should you care?

I believe this workflow is really easy to teach to beginners. A user only needs to know how to write a function to process an input file and the relationship between batches and parallelism. With the exception of the collate_fn (explained later) the code is essentially boilerplate. If you can implement a Dataset the parallelism comes almost for free which is a massive win for beginners.

Up and Running

I am going to build an example data set which mimics the audio splitting example I introduced. I will have a dataset.csv file which contains the following:

input
a.txt
b.txt
c.txt
d.txt

Each TXT file will contain a word (simple, scaleable, preprocessing, and pytorch respectively). The files will be located in an inputs/ directory. The goal is to split each word into parts of a certain number of characters and overlap, e.g.

a = "hello"
b = split_word(a, num_chars=2, overlap=1)
assert b == ["he", "el", "ll", "lo"]
c = split_word(a, num_chars=3, overlap=2)
assert c == ["hel", "ell", "llo"]

We can build a Dataset which performs this action on all of the input files. First, let’s generate a list of input files. I’ll use the built-in CSV library:

import csv

with open("dataset.csv", "r") as csv_file:
    reader = csv.DictReader(csv_file)
    input_files = [f"inputs/{row['input']}" for row in reader]

assert input_files == ["inputs/a.txt", "inputs/b.txt", "inputs/c.txt", "inputs/d.txt"]

To use Dataset, you’ll need PyTorch (e.g. pip3 install torch==1.5.0)

from torch.utils.data import Dataset

class WordSplitter(Dataset):
    def __init__(self, inputs, num_chars=2, overlap=1):
        self.inputs = inputs
        self.num_chars = num_chars
        self.overlap = overlap
        
    def __len__(self):
        return len(self.inputs)
    
    def __getitem__(self, idx):
        filename = self.inputs[idx]
        
        with open(filename, "r") as f:
            word = f.read().strip()
        
        return split_word(
            word,
            num_chars=self.num_chars,
            overlap=self.overlap
        )

For the Dataset to work, we need to define 3 “dunder” methods __init__, __len__, and __getitem. The __init__ function stores the input files and parameters needed to run split_word. The __len__ function returns the length of input_files. The __getitem__ function is where the computation happens. First, we extract the file at the given index. Second, we read the word from the file and remove any whitespace sorrounding the word. Finally, we feed our word to split_word with the appropriate parameters. Let’s see if it works:

word_splitter = WordSplitter(input_files, num_chars=3, overlap=2)
assert word_splitter[0] == ['sim', 'imp', 'mpl', 'ple']

Awesome. It is really important to make sure your Dataset works before moving on to the next steps. Remember our signature from before:

InputFile -> (OutputFiles, Metadata)

Think of the __getitem__ method in WordSplitter as inputting an InputFile, not writing any OutputFiles, and producing Metadata related to the operation. In the realistic audio splitting example the OutputFiles could be written to an outputs/ directory. We can now wrap this into a DataLoader and run our analysis in parallel!

from torch.utils.data import DataLoader

loader = DataLoader(
    word_splitter,
    batch_size=1,
    shuffle=False,
    num_workers=len(word_splitter),
)

The DataLoader bundles our work into batches to be operated on. The DataLoader takes in the word_splitter Dataset object we initialized previously. When we set batch_size=1, the loader will split our work into 4 total batches where each batch contains 1 file (batch_size=2 means 2 batches each with 2 files). With 4 batches it is possible to split the work over 4 cores on our machine by setting num_workers=len(word_splitter). Important: with batch_size=4 there is only 1 batch to process and therefore no parallelism can be extracted (i.e. setting num_workers will have no effect). The shuffle=False argument asks the loader to process inputs in order (the default). The loader object behaves like other iterators, i.e. we can print the results in a for loop:

for metadata in loader:
    print(metadata)

Let’s look at the output:

[('sim',), ('imp',), ('mpl',), ('ple',)]
[('sca',), ('cal',), ('ale',), ('lea',), ('eab',), ('abl',), ('ble',)]
[('pre',), ('rep',), ('epr',), ('pro',), ('roc',), ('oce',), ('ces',), ('ess',), ('ssi',), ('sin',), ('ing',)]
[('pyt',), ('yto',), ('tor',), ('orc',), ('rch',)]

Hmm… Something looks weird, each string is embedded in a tuple. The issue is PyTorch uses a collation function which is designed for their Tensor type. It doesn’t work great in this case. Luckily, we can define our own to fix this! In the following code I will use ... to represent code shown above. First, we need to figure out what the input to collate_fn even looks like. Add the collate_fn to WordSplitter

 class WordSplitter(Dataset):
 	...
    
    @classmethod
    def collate_fn(*batch):
        print(f"BATCH: {batch}")
        return []

The @classmethod decorator allows us to call WordSplitter.collate_fn (you’ll see it in a moment). I use *batch to tuple up all of the inputs if the arity is greater than one. The collate_fn isn’t complete but this allows us to inspect our inputs to the function. Second, we add our new function to the DataLoader:

loader = DataLoader(
	...,
    collate_fn=WordSplitter.collate_fn,
)

Note, you don’t want to run this test over your entire data set. I would suggest doing this on a small subset of inputs. If we loop over the loader again,

BATCH: (, [['sim', 'imp', 'mpl', 'ple']])
BATCH: (, [['sca', 'cal', 'ale', 'lea', 'eab', 'abl', 'ble']])
BATCH: (, [['pre', 'rep', 'epr', 'pro', 'roc', 'oce', 'ces', 'ess', 'ssi', 'sin', 'ing']])
BATCH: (, [['pyt', 'yto', 'tor', 'orc', 'rch']])
[]
[]
[]
[]

Let’s modify batch_size=2 in the loader and see what happens when there is actual batching,

BATCH: (, [['sim', 'imp', 'mpl', 'ple'], ['sca', 'cal', 'ale', 'lea', 'eab', 'abl', 'ble']])
BATCH: (, [['pre', 'rep', 'epr', 'pro', 'roc', 'oce', 'ces', 'ess', 'ssi', 'sin', 'ing'], ['pyt', 'yto', 'tor', 'orc', 'rch']])
[]
[]

Okay, so PyTorch returns something like (DatasetObject, [metadata0, metadata1, ...]). All we need to do is extract the list of metadata from the tuple and return it, i.e.

@classmethod
def collate_fn(*batch):
    return batch[1]

In the for loop we need to additionally loop over the returned list of metadata, i.e.

for metadatas in loader:
    for metadata in metadatas:
        print(metadata)

Result with batch_size=1,

['sim', 'imp', 'mpl', 'ple']
['sca', 'cal', 'ale', 'lea', 'eab', 'abl', 'ble']
['pre', 'rep', 'epr', 'pro', 'roc', 'oce', 'ces', 'ess', 'ssi', 'sin', 'ing']
['pyt', 'yto', 'tor', 'orc', 'rch']

With batch_size=2,

['sim', 'imp', 'mpl', 'ple']
['sca', 'cal', 'ale', 'lea', 'eab', 'abl', 'ble']
['pre', 'rep', 'epr', 'pro', 'roc', 'oce', 'ces', 'ess', 'ssi', 'sin', 'ing']
['pyt', 'yto', 'tor', 'orc', 'rch']

With batch_size=4,

['sim', 'imp', 'mpl', 'ple']
['sca', 'cal', 'ale', 'lea', 'eab', 'abl', 'ble']
['pre', 'rep', 'epr', 'pro', 'roc', 'oce', 'ces', 'ess', 'ssi', 'sin', 'ing']
['pyt', 'yto', 'tor', 'orc', 'rch']

Heck yes, this is exactly what we want! You could easily write this metadata somewhere for further use. The key thing to remember here is that the parallelism happens over batches, in this case the maximum possible cores used with varying batch sizes:

`batch_size`	cores
1	4
2	2
4	1

The full code is available in a Jupyter Notebook on Github. This concludes part 0. Next time we’ll look into Ray and let it coordinate the Dataset/DataLoader workflow over multiple nodes!

If you have any suggestions or improvements please message me on Twitter @chiroptical or submit an issue on Github.

Edits

05/20/2020: Use snake-case over camel-case for wordSplitter

Path to Beginnery in Functional Programming with Haskell - 1

2018-10-18T00:00:00+00:00

Path To Beginnery in Functional Programming with Haskell

See the first post in this series for an introduction to this series. Quick recap: I am trying to track my path completing Haskell programming projects from books I am reading. Feel free to message me on Twitter @chiroptical with any corrections or suggestions on new topics.

Project 1

This is a short problem, but I was getting stuck on a foldr implementation. I wanted to write down the problem, reductions, correct solution, and some alternate implementations to increase my understanding.

Definition of Problem

Implement myMaximumBy using a fold. myMaximumBy takes a comparison function, of type (a -> a -> Ordering), and returns the greatest element of the list based on the last value in the list which returned GT. Some examples:

Prelude> myMaximumBy (\_ _ -> GT) [1..10]
1
Prelude> myMaximumBy (\_ _ -> LT) [1..10]
10
Prelude> myMaximumBy compare [1..10]
10

Solving The Problem

The base case, or accumulator, is simply the first value in the list. My initial thought is that given an empty list our function should return an error. Side note: after additional thought I decided to implement a version which returns Maybe a, but I will show that in the Practical Considerations section. If given a list with one element, simply return that element. Next we need to define our folding function (for a foldr),

folder :: (a -> a -> Ordering) -> a -> a -> a
folder f x acc = if f x acc == GT then x else acc

and the full foldr with pattern matches for empty and single-item lists,

myMaximumBy :: (a -> a -> Ordering) -> [a] -> a
myMaximumBy _ [] = error "Cannot myMaximumBy on empty list!"
myMaximumBy _ [x] = x
myMaximumBy f xs = foldr (folder f) (head xs) $ tail xs

For a novice, this might look like working code as it will type check! However, it doesn’t work correctly. A foldr breaks down like this for [a] with 3 items:

foldr g acc [a, a', a'']
-- ==
-- g a (g a' (g a'' acc))

Let’s take the example where,

-- Omitting types
g = folder f
f = \_ _ -> GT

-- Reduction (g x x' = x, always!)
-- 1. g a (g a' a'')
-- 2. g a a'
-- 3. a

Which is not what we are looking for! We actually want to return a''. To do that, we need foldl,

folder :: (a -> a -> Ordering) -> a -> a -> a
folder f acc x = if f acc x == GT then acc else x

myMaximumBy :: (a -> a -> Ordering) -> [a] -> a
myMaximumBy _ [] = error "Cannot myMaximumBy on empty list!"
myMaximumBy _ [x] = x
myMaximumBy f xs = foldl (folder f) (head xs) $ tail xs

-- Reduction
-- 1. f (f a a') a''
-- 2. f a' a''
-- 3. a''

Practical Considerations

Implement `... -> Maybe a` Version

Let’s remove the version of myMaximumBy which errors out by returning Nothing when given an empty list and a Maybe a otherwise.

folder :: (a -> a -> Ordering) -> Maybe a -> a -> Maybe a
folder f (Just acc) x = if f acc x == GT then (Just acc) else (Just x)
folder _ _ _ = Nothing

myMaximumBy :: (a -> a -> Ordering) -> [a] -> Maybe a
myMaximumBy _ [] = Nothing
myMaximumBy _ [x] = Just x
myMaximumBy f xs = foldl (folder f) (Just $ head xs) $ tail xs

I don’t think folder f _ x pattern in necessary, but it definitely doesn’t hurt.

Implement `myMinimumBy`

For myMinimumBy you simply replace GT in folder with LT. With a little abstraction, you can write both in a nice point-free style.

folder :: Ordering -> (a -> a -> Ordering) -> Maybe a -> a -> Maybe a
folder o f (Just acc) x = if f acc x == o then (Just acc) else (Just x)
folder _ _ _ _ = Nothing

myOrderingBy :: Ordering -> (a -> a -> Ordering) -> [a] -> Maybe a
myOrderingBy _ _ [] = Nothing
myOrderingBy _ _ [x] = Just x
myOrderingBy o f as = foldl (folder o f) (Just $ head as) $ tail as

myMaximumBy :: (a -> a -> Ordering) -> [a] -> Maybe a
myMaximumBy = myOrderingBy GT

myMinimumBy :: (a -> a -> Ordering) -> [a] -> Maybe a
myMinimumBy = myOrderingBy LT

Wrapping Up

This wasn’t a particularly difficult problem or solution, but it was one of the first cases where my code looked correct, type-checked, and failed. It is really important to understand the difference between foldr and foldl. I am starting to really enjoy point-free style in Haskell. When understood, it is terse and beautiful.

Edits made on 10/18/18 cleaning up patterns with unneccesary named parameters. Replace (x:[]) with [x].

C++ Recursive Template Metaprogramming: Fibonacci Numbers

2018-07-01T00:00:00+00:00

Background

After a brief dive into Scala, I am back to writing C++. However, I do have a much better appreciation for functional programming and recursion. I am far from an expert at either, but I am interested in increasing my programming skills. I decided to revive my blog and try to post things I find fun or interesting. I am currently reading “Effective Modern C++” by Scott Meyers and continually come across Metaprogramming online. I was poking around Stack Overflow and I found this post which asks about tail recursion in Template Metaprogramming (TMP). I thought this was interesting and decided to see if I could write the naive recursive Fibonacci number generator using TMP.

I had already written this in Scala, which looks like:

import scala.annotation.tailrec
def fib(n: Int): Int = {
    @tailrec
    def loop(iter: Int, prev: Int, next: Int): Int = {
        if (iter >= n) prev
        else loop(iter + 1, next, prev + next)
    }
    loop(0, 0, 1)
}
fib(10)

However, fib(10) will execute at runtime and the Java Virtual Machine occurs additional runtime overhead each time you run the program. A neat benefit of TMP in C++ is the compiler can compute fib(10) and then each invocation of the program is as simple as printing an integer. My first implementation in C++, looked like:

#include 
#include 

namespace impl {

    template<int64_t n, bool isPositive>
    struct fib_impl {
        static constexpr int64_t val = fib_impl<n - 1, isPositive>::val + fib_impl<n - 2, isPositive>::val;
    };

    template<>
    struct fib_impl<1, true> {
        static constexpr int64_t val = 1;
    };

    template<>
    struct fib_impl<0, true> {
        static constexpr int64_t val = 0;
    };

    // If calling fib<-1>::val it will try to do the recursion infinitely
    // -> this template short circuits that recursion
    template<int64_t n>
    struct fib_impl<n, false> {
        static constexpr int64_t val = -1;
    };

} // namespace impl

template<int64_t n>
struct fib {
    static_assert(n >= 0, "Error: fib can't be called with a negative integer");
    static constexpr int64_t val = impl::fib_impl<n, (n >= 0)>::val;
};

int main() {
//    static_assert(fib<-1>::val); // This will fail.
//    static_assert(fib<10>::val == 55); // Make sure it works at compile time!
    std::cout << fib<91>::val << '\n';
    return 0;
}

I want the interface of fib to accept only a positive integer, therefore we abstract away whether, or not, the integer is positive with impl::fib_impl. In this implementation, you need 3 template specializations. Two are the termination conditions: 0 and 1; the other provides protection from an infinite recursion when you give a negative number to fib. Even though you get an error from the static_assert(fib<-1>::val), the compiler still tries to create infinite templates. Luckily, your compiler will protect you from creating literally infinite templates (GCC 7.2.1 allowed 900 to be generated, use -ftemplate-depth= to change it). This implementation isn’t tail recursive because the recursion isn’t in the tail position. The recursive call,

fib_impl<n - 1, isPositive>::val + fib_impl<n - 2, isPositive>::val

is shaped like recursive_template(...) + recursive_template(...), but must look like: recursive_template(...) to be tail recursive. You can verify this by modifying the Scala code. In C++, I believe the only way to find out if tail recursion is actually applied is looking at the assembly for loops. Unfortunately, this is done at compile time and you can’t review the compile time assembly (to my knowledge). The tail recursive implementation is:

#include 
#include 

namespace impl {

    template <int64_t n, int64_t prev, int64_t next, bool isPositive>
    struct fib_impl {
        static constexpr int64_t val = fib_impl<n - 1, next, prev + next, isPositive>::val;
    };

    template <int64_t prev, int64_t next>
    struct fib_impl<0, prev, next, true> {
        static constexpr int64_t val = prev;
    };

    template <int64_t n, int64_t prev, int64_t next>
    struct fib_impl<n, prev, next, false> {
        static constexpr int64_t val = -1;
    };

} // namespace impl


template <int64_t n>
struct fib {
    static_assert(n >= 0, "Error: fib can't be called with negative numbers!");
    static constexpr int64_t val = impl::fib_impl<n, 0, 1, (n >= 0)>::val;
};

int main() {
//    static_assert(fib<-1>::val); // This will fail.
//    static_assert(fib<10>::val == 55); // Make sure it works at compile time
    std::cout << fib<91>::val << '\n';
    return 0;
}

Great, now the recursive call is in the tail position. Additionally, we only need 2 template specializations. The one where n = 0 and the infinite template recursion protection for negative integers. I compiled both versions with GCC 7.2.1 using the C++11 standard (which is necessary for constexpr) 10 times and measured the average compile time. It was essentially the same (about 0.2s). The tail recursive version has a major downside though: it overflows a int64_t faster than the non-tail recursive version. The largest value of n for the non-tail recursive version was 92 and for the tail recursive version was 91. The reason for this is because the template recursion for fib<92>::val contains a prev + next which would contain a value to large to fit in int64_t.

This code was an academic exercise, but I think it is neat. This is my first experience with TMP and I am very interested to learn more. Feel free to message me, or follow me, on Twitter with constructive criticism or for future blog posts.

Path to Beginnery in Functional Programming with Haskell

2018-06-16T00:00:00+00:00

Path to Beginnery in Functional Programming with Haskell

Introduction to the Series

I have done a little functional programming in Scala. I tend to write short scripts to help myself and users on the University of Pittsburgh’s compute resources. Because of the JVM startup and length of my scripts it just made more sense to use Python. I want to embrace FP completely and decided something like Haskell would fit my needs better. Additionally, I did a PhD in Theoretical Chemistry so I am not averse to math and theory. In fact, I would like to continue working with math and theory without writing Fortran. So, I recently started working through the Learn You a Haskell book.

In this blog series, I want to document my path to “Beginnery” in Haskell. I have no formal training with Computer Science and I wouldn’t call this series educational. If it helps anyone that would be a great bonus. I just want to be able to look back and track my own progress. I don’t really want to accept comments on this website, please direct message me on Twitter @chiroptical. and I will try to fix any mistakes, accept feedback for my crappy code, or potentially accept new problems.

Here’s the format I want to follow: Take a project from somewhere and try to solve the problem without reading the solution. Then, attempt to add some additional functionality to the project. Finally, read the solution and hopefully learn something.

Project 0

Definition of the Problem

Let’s consider two paths from a starting point to an end point. We’ll label the paths A and B. Along the paths there are additional paths which connect A to B. Here’s a pictoral representation:

Path A would be along the top and B along the bottom. We want to determine the shortest path from Start to End.

Breaking up the Problem

I will break the problem up into sections which look like:

Where $ a_1 $ is the length along path A, $ b_1 $ is the length along path B, $ c_1 $ is the length along the cross section of the paths, and the $ \bullet $ represents the current path. The knowledge of the current path is helpful because there are two potential combinations of sections:

To determine the next shortest path you would evaluate:

starting on path A: take minimum of $ a_2 $ and $ c_1 + b_2 $
starting on path B: take minimum of $ b_2 $ and $ c_1 + a_2 $

Solving the Problem

This is the layout of a folding problem. If you are unfamiliar with folds the basic idea is that we will layout our sections into a list and define a function which will combine them one by one into a new data structure until we reach the End. The definition of our data structures:

type RoadSec = (Int, Int, Int)
type Path = (Int, String, Int)

A RoadSec is a tuple of $ (a_i, b_i, c_i) $. The Path is the data structure we will use to combine on. The first element is simply the distance travelled (m for minimum distance). The second element is the actual path taken (p for path), e.g. abaa means:

Start along A
Take the next cross section to B
Take the next cross section to A
At the next cross section, continue on A

If you look at the 2 possible combinations of sections above, you will notice we don’t use $ c_2 $! The final element in the Path tuple is $ c_1 $ for the next section combination.

To do a fold, we first need to define the base case. The base case is what you want to combine on, in our case the initial Path. To construct it, take the first RoadSec $ (a_0, b_0, c_0) $ and construct the initial Path: (minimum distance of $ a_0 $ and $ b_0 $, the path to take "a" or "b", and the cross section for the next combination $ c_0 $).

Next, you have to define a function to combine the initial Path with a RoadSec to make the next Path:

foldRoadSecs :: Path -> RoadSec -> Path
foldRoadSecs (m, p, c1) (a2, b2, c2)
  | head p == 'a' = (m + min a2 crossB, takeWhich a2 crossB ++ p, c2)
  | otherwise = (m + min b2 crossA, takeWhich crossA b2 ++ p, c2)
  where
    crossB = c1 + b2
    crossA = c1 + a2

Reminder: m is the overall minimum distance, p is the overall path taken, c1 is the cross section from the previous combination, (a2, b2, c2) is the next RoadSec. If we are on path "a", we want to take the minimum of continuing on path A or crossing over (i.e. crossB). Starting on path "b" is obviously similar and the only other option, hence otherwise. To figure out what path we are on, we take the first entry in p. It turns out prepending to a list is faster than appending in Haskell (i.e. p is stored backwards). I have read this observation about Haskell in a few different places and I trust that in most cases it is true. However, never forget: “premature optimization is the root of all evil” and I should measure to be sure. The function takeWhich generates the string to add to the path and c2 is used as c1 in the next combination. Now, assuming we have a list of RoadSec we can simply do a foldl:

minPath :: [RoadSec] -> Path
minPath ((a, b, c): xs) = foldl foldRoadSecs (min a b, takeWhich a b, c) xs

Notice how I separate the base case from the rest of the list. Also, foldl takes a function B -> A -> B (e.g. foldRoadSecs), a B (Path), and a [A] (list of A, RoadSec) and generates a B (Path). Now, we can take the first element of the resulting Path to get the minimum distance travelled, reverse the second element to get the path you should travel, and the third element is not useful in this case.

Practical Considerations

Data Layout

I changed the layout of the data compared to the book because I thought it was more readable. Our program will read a CSV file with exactly 3 columns, i.e.:

[ a_0,b_0,c_0
a_1,b_1,c_1
…
a_n,b_n,c_n
]

Each line represents a section and there are n sections.

Reading the Data

I decided that the program should take the CSV as STDIN. I came up with:

main :: IO ()
main = do
  contents <- getContents
  let path = minPath $ csv3ToRoadSecs (lines contents)

Starting from the right, lines contents generates a [String] which is fed to csv3ToRoadSecs, defined as:

csv3ToRoadSecs :: [String] -> [RoadSec]
csv3ToRoadSecs = map (toRoadSec . map (read::String->Int) . splitOn ",")

This took me longer than I want to admit to get correct, but that is okay. Starting with the function composition: toRoadSec . map (read::String->Int) . splitOn ",", on the right: splitOn "," takes a String and generates a [String] splitting on commas. Then we map read on each String to generate an Int, finally we convert the [Int] to a RoadSec. Then we map the function composition over the [String] from lines contents. It isn’t too hard when you follow the types.

String and [Char]

Conceptually, I wanted to represent p in Path as [Char] but some of the type checking became problematic. String is exactly the same as [Char], but [Char] didn’t operate as I expected. This could have been a misunderstanding on my part, but it was easier to use String everywhere. I think this is a minor point, but I did get hung up on the type checking.

Adding a Feature

The feature I decided to add was generating a pictoral representation of the path travelled. Basically, it is another folding problem which is great because I know how to solve those. Our [A] in this case is the p from the final Path and the B will be a PictoralRepr, defined as:

type PictoralRepr = (String, String, String, String)

Where the first entry will contain the current path similarly to the first element of Path, but we don’t need to keep the overall path! Only the most recent one. The next 3 will build a top, middle, and bottom row as a String. For example, if we had p = "aa" we would want to generate the representation:

-|-

and, p = "ab" would generate:

-|
 |
 |-

Our base case is simply the starting path ("a" or "b") and three empty strings to hold the representations. Our folding function:

foldPaths :: PictoralRepr -> String -> PictoralRepr
foldPaths (p, t, m, b) n
  | p == "a" && n == "a" = (n, "|-" ++ t, "  " ++ m, "  " ++ b)
  | p == "a" && n == "b" = (n, "|-" ++ t, "| " ++ m, "| " ++ b)
  | p == "b" && n == "b" = (n, "  " ++ t, "  " ++ m, "|-" ++ b)
  | otherwise = (n, "| " ++ t, "| " ++ m, "|-" ++ b)

Where p is the current path, t is the top representation, m is the middle representation, b is the bottom representation, and n is where we are headed next. There are four possibilities, "aa", "ab", "bb", "ba". Depending on the combination we prepend the new strings to the representations (remember prepend faster than append) and the final representation should be reversed. Finally, the actual foldl call:

pictoralRepr :: String -> PictoralRepr
pictoralRepr (x:xs) = foldl foldPaths ([x], "", "", "") (tail . splitOn "" $ xs)

The input is a String and we take off the first Char which is why [x] is necessary (reminder: String made type checking easier). The composition tail . splitOn "" converts a String (xs) into [String]. It’s easier to show you why the tail is necessary:

$ stack ghci
Prelude> import Data.List.Split (splitOn)
Prelude Data.List.Split> a = "babb"
Prelude Data.List.Split> splitOn "" a
["","b","a","b","b"]

splitOn adds an empty string in the head position. Finally, there is one issue. The final path is never added to the representation and we need to deal with it specially. Luckily we already have our folding function, so it is easy as:

let repr = pictoralRepr (pathStr path)
let lastRepr = foldPaths repr (prev repr)

Where pathStr extracts the p from a Path. prev extracts the “previous” position which would have been used to make the next combination. We can simply using our folding function on repr itself to generate the last part of the representations.

Compile and Run

I am using stack, which I know very little about thus far. I compiled the code with:

$ stack exec -- ghc -dynamic v2.hs
[1 of 1] Compiling Main             ( v2.hs, v2.o )
Linking v2 ...

Run on the example in the book which generates p = "babb":

./v2 < toLondon.csv
"Minimum distance travelled: 75"
"Path travelled: babb"
"Take the following path:"
" |-|    "
" | |    "
"-| |-|-|"

Sweet! I generated a larger random CSV file with Python, numpy, and pandas (obviously I should be using Haskell!):

$ ipython
In [1]: import pandas as pd
In [2]: import numpy as np
In [3]: df = pd.DataFrame(np.random.randint(0, 100, size=(100, 3)))
In [4]: df.to_csv("test.csv", index=False, header=False)

Run the code:

$ ./v2 < test.csv
"Minimum distance travelled: 4143"
"Path travelled: bbbbbbbbaaaaaaaaaaabbbbbbbbbbbbbaaabbbbbbbbbbbbbbbbbbaaaaaaabbbbbbaaaaaaaaaaaaaabbbbbbbbbbbaabbbbbbb"
"Take the following path:"
"               |-|-|-|-|-|-|-|-|-|-|-|                         |-|-|-|                                   |-|-|-|-|-|-|-|           |-|-|-|-|-|-|-|-|-|-|-|-|-|-|                     |-|-|              "
"               |                     |                         |     |                                   |             |           |                           |                     |   |              "
"-|-|-|-|-|-|-|-|                     |-|-|-|-|-|-|-|-|-|-|-|-|-|     |-|-|-|-|-|-|-|-|-|-|-|-|-|-|-|-|-|-|             |-|-|-|-|-|-|                           |-|-|-|-|-|-|-|-|-|-|-|   |-|-|-|-|-|-|-|"

Wrapping Up

This was a fun little project. I am feeling a lot more comfortable about folding. The entire code with all of the helper functions can be found in this gist. I can’t stop you from making comments, but understand I am a novice. I will learn better techniques as I continue.

Hope you enjoyed this post, let me know what you think on Twitter @chiroptical

Building Tensorflow GPU Images for HPC

2018-01-21T00:00:00+00:00

Background

I am an HPC administrator for Pitt. A common trend lately is users asking for the newest Tensorflow release the second it is available. However, as many of you probably know, compiling Tensorflow can be a bear. My daily Linux distribution of choice is Arch Linux which is a bleeding edge distribution and Tensorflow 1.4.1 is as easy as sudo pacman -S python-tensorflow-opt-cuda (note, I use the pip package below). However, I have found that building GPU enabled containers is a little tricky because if the underlying NVIDIA libraries don’t match it will never run on the GPU. First, we should talk about building HPC containers

Enter Singularity

Singularity (http://singularity.lbl.gov) is a very powerful tool for reproducible research as well as portable software. It is available on the Arch Linux User Repository (AUR) as singularity-container and can be installed with yaourt -S singularity-container (I have always used yaourt AUR package manager, others exist). If you need to build from source:

git clone https://github.com/singularityware/singularity.git
cd singularity
./autogen.sh
./configure
make
make install
make test

I have never had an issue compiling and installing this code. On Red Hat Enterprise Linux, you will need to sudo make install for everything to work. If you don’t have access to install with sudo, add ` –prefix= --disable-suid` to the configure line (via [Issue 1258](https://github.com/singularityware/singularity/issues/1258)). I am not going to go over the basics of Singularity, check out their documentation for that (or hit me up on Twitter and I'll write about it).

Setting Up

As I mentioned previously, the tricky part is getting the libraries to match. I installed NVIDIA Drivers/CUDA using an orchestration tool called Warewulf (same developer as Singularity!). On my compute nodes, these libraries are included in the following packages:

NVIDIA-Linux-x86_64-384.59.run
cuda_8.0.44_linux.run (ships with CuDNN 5)

For TF 1.4.1, you also need CUDNN 6: cucudnn-8.0-linux-x64-v6.0.tgz. Ideally, you would download all of these from the web. In the case of the slightly older Driver/CUDA, I had a harder time finder these online. I keep the source on my Warewulf master for safe keeping. You may need to ask your HPC administrator for these packages.

Build File

With Singularity, you have some options for building containers and I chose to use a bootstrap file. By convention it is titled Singularity and is essentially a Bash script. First, I will paste the entire file and then break it down section by section.

Bootstrap: docker
From: base/archlinux

%runscript
    exec python $*

%setup
    # Mirror list
    echo 'Server = http://mirror.cs.pitt.edu/archlinux/$repo/os/$arch' > $SINGULARITY_ROOTFS/etc/pacman.d/mirrorlist
    echo 'Server = http://mirrors.rit.edu/archlinux/$repo/os/$arch' >> $SINGULARITY_ROOTFS/etc/pacman.d/mirrorlist
    echo 'Server = http://mirror.es.its.nyu.edu/archlinux/$repo/os/$arch' >> $SINGULARITY_ROOTFS/etc/pacman.d/mirrorlist
    echo 'Server = http://mirrors.rutgers.edu/archlinux/$repo/os/$arch' >> $SINGULARITY_ROOTFS/etc/pacman.d/mirrorlist

    # NVidia
    VERSION=384.59
    sh NVIDIA-Linux-x86_64-$VERSION.run -x
    mv NVIDIA-Linux-x86_64-$VERSION $SINGULARITY_ROOTFS/usr/local
    cp links.sh $SINGULARITY_ROOTFS/root

    # CuDNN
    mkdir $SINGULARITY_ROOTFS/usr/local/cuda
    cp -R cudnn/* $SINGULARITY_ROOTFS/usr/local/cuda

    # CUDA
    dir=$(pwd)
    sh $dir/cuda_8.0.44_linux.run -extract=$dir/cuda
    $dir/cuda/cuda-linux64-rel-8.0.44-21122537.run --noexec --keep
    cp -R $dir/pkg/lib64/* $SINGULARITY_ROOTFS/usr/local/cuda/lib64

    # Cleanup
    rm -rf cuda pkg

%environment
    export LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/NVIDIA-Linux-x86_64-384.59:$LD_LIBRARY_PATH
    export PATH=/usr/local/NVIDIA-Linux-x86_64-384.59:$PATH
    unset XDG_RUNTIME_DIR

%labels
    AUTHOR barrymoo

%post
    # Process NVIDIA links
    sh /root/links.sh 384.59

    # Install python and pip
    pacman -Syy --noconfirm python python-pip

    # Install tensorflow
    pip install --upgrade tensorflow-gpu

Metadata

This is the only section which doesn’t start with a %

. In this case I use Bootstrap: docker and From: base/archlinux. I am telling singularity to start with an Arch Linux base image from DockerHub. In the next sections, I will modify that container.

%runscript

This tells the container that when a user runs singularity run tensorflow-gpu.img ... to run python (within the container) with ... as arguments. There is a lot more clever things one can do with this section, but my users basically need to run python

%setup

I will break this down into a few steps:

Generate a list of mirrors for pacman. In the %post section, we need to install things via pacman and will need to refresh mirrors. Note the use of $SINGULARITY_ROOTFS! In this section we are running from outside the container! To refer to the root filesystem of the container we use this environment variable.
Install the NVIDIA driver which matches the compute nodes. Copy in the script links.sh, which I borrowed from @clemsonciti on GitHub (thanks!), which we need for inside the container.
I already had CuDNN extracted in this directory, simply make the /usr/local/cuda directory and copy CuDNN in.
Install CUDA. CUDA comes with 3 components, I only want the CUDA libraries.
Finally clean up the stuff we no longer need.

%environment

This generates a file /environment inside the container which singularity runs when setting up the environment.

%labels

More metadata. There is probably more useful information I could put in here, but I don’t plan to distribute this container (it is specific to my current compute node environment).

%post

Unlike %setup this section is executed inside the container! Again the steps:

Run the links.sh script for our driver version.
Install python and pip.
Install tensorflow-gpu via pip.

Here, you could install other packages. For example, I know my users will use cython (installed via pip) and gcc (installed via pacman).

Bootstrap the Container

You will need root to build the container. Therefore, it makes sense to use your own computer: sudo singularity build tensorflow-gpu.img Singularity. After it is done building,

$ singularity run tensorflow-gpu.img hello-world.py # a stupid simple hello TF script
... Errors complaining it can't run on GPU due to driver mismatch ...
b'Hello TF'
42

From my cluster:

$ singularity run tensorflow-gpu.img hello-world.py
...
Creating TensorFlow device (/device:GPU:0) -> (device: 0, name: GeForce GTX
1080, pci bus id: 0000:81:00.0, compute capability: 6.1)
...
b'Hello TF'
42

Fantastic! We are running TF 1.4.1 on a compute node without compiling anything!

Wrap Up

I think Singularity is fantastic. I spent a lot of time mucking around with compiling TF by hand in our HPC environment. To be fair, I also spent a lot of time mucking around with building Singularity images on GPUs. However, every time a new release of TF comes around I can simply update the container and stop compiling it by hand. As usual, if anyone thinks what I am doing is stupid and you have a better way. Message me on Twitter.

Free SSL certificates on Digital Ocean with Letsencrypt

2017-12-31T00:00:00+00:00

Quick Start

Important Notes:

Using Digital Ocean droplet 5$/month tier
CentOS 7
Apache (I am assuming this is already installed and running)
May need sudo in front of these commands if you created a non-root user

Install EPEL (Project Site):

  wget https://dl.fedoraproject.org/pub/epel/epel-release-latest-7.noarch.rpm # may need `yum install wget`
  yum install epel-release-latest-7.noarch.rpm

Install certbot (Project Site for CentOS/Apache):

  yum install python-certbot-apache
  certbot --apache # follow prompts, I force SSL

Auto-renewal (because we are lazy, modified slightly from Arch Wiki Let’s Encrypt):

The one-shot service which runs the renewal /etc/systemd/system/certbot.service:

  [Unit]
  Description=Let's Encrypt renewal
        
  [Service]
  Type=oneshot
  ExecStart=/usr/bin/certbot renew --quiet --agree-tos

The certbot.service will check for an expired certificate and install a new one only if necessary. Certbot recommends that you check twice a day with a random 60 minute delay, we do that with /etc/systemd/system/certbot.timer:

  [Unit]
  Description=Daily renewal of Let's Encrypt's certificates
        
  [Timer]
  Persistent=true
  OnBootSec=10min
  OnUnitActiveSec=12hour
  RandomizedDelaySec=1hour
        
  [Install]
  WantedBy=timers.target

Enable and start the timer:

  systemctl enable certbot.timer
  systemctl start certbot.timer

Shortly after posting this I noticed that URLs which don’t exist yield an SSL error. I had to correct the vhost in /etc/httpd/conf.d/le-redirect-www.chiroptical.com\:443.conf. Just change the line: ServerAlias www.chiroptical.com to ServerAlias www.chiroptical.com chiroptical.com (obviously using your site name). This may have been because I already set this up previously using the non-certbot version of letsencrypt, but I don’t remember where I did that.

Congratulations on your automatically renewed and free SSL certificates :)

Chiroptical’s Blog

Getting started with Erlang’s `maybe_expr`

Assumptions

Introduction

Setting up rebar3

Enabling the feature

More information

First day with typst, a markup based typesetting system

Discarding monadic results in Haskell

Discarding Monadic Results in Haskell

Motivation

Options

Breakdown

void

The const equivalent

Underscores

Unicode

Disable the warning

Wrapping up

Simple Scaleable Preprocessing with PyTorch and Ray - 0

Simple Scaleable Preprocessing With Pytorch and Ray

Background

Why should you care?

Up and Running

Edits

Path to Beginnery in Functional Programming with Haskell - 1

Path To Beginnery in Functional Programming with Haskell

Project 1

Definition of Problem

Solving The Problem

Practical Considerations

Implement ... -> Maybe a Version

Implement myMinimumBy

Wrapping Up

C++ Recursive Template Metaprogramming: Fibonacci Numbers

Background

Path to Beginnery in Functional Programming with Haskell

Path to Beginnery in Functional Programming with Haskell

Introduction to the Series

Project 0

Definition of the Problem

Breaking up the Problem

Solving the Problem

Practical Considerations

Data Layout

Reading the Data

String and [Char]

Adding a Feature

Compile and Run

Wrapping Up

Building Tensorflow GPU Images for HPC

Background

Enter Singularity

Setting Up

Build File

Metadata

%runscript

%setup

%environment

%labels

%post

Bootstrap the Container

Wrap Up

Free SSL certificates on Digital Ocean with Letsencrypt

Quick Start

Implement `... -> Maybe a` Version

Implement `myMinimumBy`