Pete Keen

Using a Static JSON File in Home Assistant

2024-01-08T00:00:00+00:00

Recently I found myself needing to bring some JSON from a file into a Home Assistant sensor. Specifically, the electricity rates for my power company are woefully out of date on OpenEI so I decided I could just maintain the data myself.

Home Assistant doesn't have a direct way to read JSON data from a file into a sensor. There's the File platform which has a promising name but is actually a trap. File is meant for use cases where something writes to, say, a CSV file continuously and you just want to read the most recent line. It specifically does not read the whole file.

After a lot of searching I came across the Command Line platform. The integration does a number of things, but for our purposes it lets you periodically run a command within the context of the Home Assistant container and bring the output back into Home Assistant as a sensor.

Let's say you have a JSON file named rate.json in your Home Assistant configuration directory:

{
  "name": "Base Rate",
  "rate": 0.15
}

You can bring that into a sensor with the following snippet in your configuration.yaml file:

command_line:
  - sensor:
    name: "Electrity Rate"
    command: 'cat rate.json',
    value_template: "{{ value_json['rate'] }}"
    unit_of_measurement: "USD/kWh",
    json_attributes:
      - name
      - rate

This config does a couple things. The command key specifies what command HA should run, in this case cat to read the file to stdout. value_template extracts the rate key from the file into the sensor's value. The json_attributes list pulls the list of keys into attributes in the sensor, which you can later access from a template using state_attr(). I have also specfied unit_of_measurement here just because the Energy reporting system needs that if you want to use this as an input.

So, the above is great if you have one static set of attributes to bring in, but sensor values can be at most 255 characters. What if you have a bigger file that you need to pull just a little data out of?

Let's say we have this slightly bigger file rates.json:

[
  {
    "id": "d1-11_summer_on_peak",
    "name": "Summer Peak",
    "months": [6, 7, 8, 9],
    "days": [1, 2, 3, 4, 5],
    "hours": [15, 16, 17, 18],
    "rate": 0.23525,
    "peak": true
  },
  {
    "id": "d1-11_summer_off_peak",
    "name": "Summer Off-Peak",
    "months": [6, 7, 8, 9],
    "days": [0, 1, 2, 3, 4, 5, 6],
    "hours": [0, 1, 2, 3, 4, 5, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23],
    "rate": 0.17859,
    "peak": false
  },
  {
    "id": "d1-11_winter_on_peak",
    "name": "Winter Peak",
    "months": [1, 2, 3, 4, 5, 10, 11, 12],
    "days": [1, 2, 3, 4, 5],
    "hours": [15, 16, 17, 18],
    "rate": 0.17879,
    "peak": true
  },
  {
    "id": "d1-11_winter_off_peak",
    "name": "Winter Off-Peak",
    "months": [1, 2, 3, 4, 5, 10, 11, 12],
    "days": [0, 1, 2, 3, 4, 5, 6],
    "hours": [0, 1, 2, 3, 4, 5, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23],
    "rate": 0.1658,
    "peak": false
  }
]

This file describes the rate structure that's in effect at my house, DTE rate D1.11. Each entry in the array is a rate, and the first matching rate based on month, weekday, and hour is the effective rate.

Ordinarily one might reach for a HACS integration or something, but the Home Assistant container has another trick up it's sleeve: it bundles jq.

jq is a tool for querying and manipulating JSON streams. This essay isn't meant to be a jq tutorial so we're not going to go in depth into what this query in rate_filter.jq does, but in broad strokes it picks the first matching rate from the input file and extracts just the name, rate, and peak keys.

map(select(
    (.months[] | contains($ARGS.positional[0] | tonumber))
    and (.days[] | contains($ARGS.positional[1] | tonumber))
    and (.hours[] | contains($ARGS.positional[2] | tonumber))
))[0] | {name, rate, peak}

Here's a modified command line sensor that runs jq appropriately:

command_line:
  - sensor:
      name: "DTE Rate"
      command: 'jq -f rate_filter.jq rates.json --args {{ now().month }} {{ now().isoweekday() % 7 }} {{ now().hour }}'
      value_template: "{{ value_json['rate'] }}"
      unit_of_measurement: USD/kWh
      json_attributes:
        - name
        - rate
        - peak

The relevant change here is to the command key, which now invokes jq with the -f argument to pass the filter as a file rather than trying to quote everything properly within HA, then passes the actual rates.json file, then treats the rest of the arguments as positional args. These are accessed within rate_filter.jq as $ARGS.positional[0] etc.

With this set up I can access the current electric rate within my Home Assistant in a way that is compatible with the Energy dashboard, which is completely local, and which should be easy to maintain in the future.

The above isn't at all specific to electric rates, by the way. This technique should work for any data that you need in HA but is more complicated than a plain input can work with.

Using Eufy Permanent Outdoor Lights with WLED

2023-11-28T00:00:00+00:00

For as long as I've known her, my wife has been very into decorating for the holidays. She loves it. The traditions, the cherished ornaments, the stockings on the wall, etc etc.

I'm ok with decorating. It's fine. It's not my favorite thing.

The one thing I don't like at all, though, is putting up outdoor lights. Getting up on the roof at our previous house was a complete nightmare that I absolutely refused to do, so we just strung some lights around the porch and called it good.

After moving into our current house I had the idea that I would install "permanent" outdoor lights, but then I was worried that I would need to take them down when we got the roof and gutter replaced at some point so I kept deferring.

This year, though, an internet friend mentioned Eufy Permanent Outdoor Lights.

These are in the style of the permanent outdoor lights that Govee has been distributing for a few years now, with a couple important distinctions:

They are RGBW, meaning they have a whole separate white LED along with the RGB LED. I can't stand the cool white that standard RGB's try to synthesize, especially for holiday decorations.
They (at the time) were only $200 per 100 foot section. I would need two, so cheaper is better.
The light strings are supported by WLED.

That last bit was the most interesting part. Over the years I've gotten more curmudgeonly about IoT devices, and at this point I'll only buy cloud-controlled IoT devices if they're super compelling or let me host the controller myself somehow.

After reading that article by Robert at The Hook Up I was convinced that the Eufy lights were the thing to get, and I was further convinced that I definitely didn't want to run the stock controllers past testing the light strings.

Parts

The bounding box for my house is a rectangle that's roughly 150 feet by 25 feet, so I knew I would need two 100ft light kits. I also wanted an off the shelf WLED controller and a power supply that I wouldn't have to worry about if I stuck it to the wall in my garage.

One interesting wrinkle about the Eufy light strings is that they're 36 volt, rather than the more common 5 or 12 volt strings. As far as I can tell the chips don't come in a 36 volt part so Eufy must have a buck converter built into each puck.

Anyway, after a bunch of trial and error, here's my shopping list (Amazon links, but not no affiliate links):

2x Eufy Permanent Light Kit (I used coupon code EUFYLIGHT but I have no idea if that still works).
1x Ericsity WLED LED Light Strip Controller
1x ALITOVE 36V Power Supply 4A (each stock power supply is 1.5A so I figured this would be good enough)
1x SUPERNIGHT LED Pixel Strip Amplifier
1x 36V to 12V buck converter

Installation

Each light puck has a 3M sticky pad pre-applied, but Eufy says in plain language that those are just to temporarily hold the pucks in place while you secure them with real screws. I knew going into this that I didn't actually want a truly permanent install, so screws were a no go.

Instead, I 3D printed a bunch (way too many) of this little mounting bracket and used small 3M outdoor command strips to stick them to the soffit. The outdoor Command Strip variant is reliable down to -20°F which seems reasonable for my part of Michigan, and so far they've held up well.

The controller is pretty boring. It consists entirely of an Espressif ESP8266 (4MB flash version) microcontroller running WLED and a tiny bit of power regulation. The input power rails are connected directly to the light strips with no fuses or anything and the 3.3V data outputs are weakly level shifted up to 5V.

At first I tried hooking the light strips directly to the controller but any time I tried to do anything other than turning the white channel on the lights freaked out. After watching a couple YouTube videos the symptom seemed to match with the data line experiencing some voltage drop prior to getting to the first LED puck.

I wanted to have the controller and power supply inside my garage and just run 18awg thermostat wire out an existing hole in the outside wall to the light strips, along with using half of the extension that the kits came with. This added a good 8 feet of wire between the controller and the first puck, which I guess is just too much. After adding the amplifier that symptom went away and they've been reliably running ever since.

In addition to adding the amplifier I added a buck converter to bring the voltage for the controller and amplifier down to 12V. The controller seemed to be happy running at 36V but I didn't really want it to blow up and have to re-engineer everything while the garage is super cold.

I had a Unifi Flex Utility box laying around from when I was using Unifi switches more extensively, so I broke all the interior plastic out of it and used regular Command Strips to mount the components.

Software

The controller shipped with WLED 0.14.0-b3 which worked fine, but I went ahead and upgraded it to 0.14.0 which must have been released sometime after manufacturing.

For a bit I tried messing with esphome but the built-in effects in WLED are just really good, and being able to configure things on-device rather than recompiling the firmware is pretty compelling. I might try esphome again at some point.

I also tried using Home Assistant to control WLED but for some reason the combination of this board, this version of WLED, and the current version of Home Assistant results in HA being way too chatty with the controller and causing random reboots. At first I thought it was the power supply but no, turns out it's a known issue with the HA integration.

The Eufy strings need these WLED settings:

LED type: SK6812 RGBW
Color order: BRG, swap W & G
Auto-calculate white channel from RGB: Accurate

Caveats

There are a couple things I would do differently if I were to do this project again.

I would get a bigger power supply. With the 4A brick I bought, if I try to run both the color and white channels at full brightness the controller senses a brownout and restarts. A Mean Well 36V 10A power supply with screw terminals is not that much more expensive than the brick would give a ton more headroom for brightness.
I would use different controller, either a QuinLED or something similar with an ESP32. The ESP32 has fewer weird limitations than the ESP8266 and seems to be more reliable, and the QuinLED controllers have all of the level shifting and boosting built in so I (probably) wouldn't have to have the separate amp. The QuinLED boards also can have more output channels.

I don't think I'm going to rebuild this controller setup any time soon, but I might add more strings to wrap around the back of the house and would probably build a separate controller for it.

Using a Secondary Klipper for Printer Power Control

2023-09-19T00:00:00+00:00

I recently got a 3D printer, a Sovol SV06 Plus. Remarkably, this has been a net positive for almost every aspect of my life. My kids are into it, my spouse is browsing filament colors, and I can print whatever little project boxes my heart desires.

We're not here to talk about that, though. We're here to talk about controlling the power for this printer in possibly the most ridiculous way possible.

Printer Software

The software stack for my printer consists of:

Klipper is the firmware and software that takes the sliced 3d models and tells the printer how/where/when to move and extrude.
Moonraker sits athwart Klipper and mediates API requests from higher up layers to Klipper's somewhat weird JSON-over-BSD-sockets API. Importantly, it also has the capability to control printer power a few different ways.
Mainsail is a web-based frontend for Moonraker that allows one to upload, select, and print gcode files, along with controlling every other aspect of the printer itself.

Most of these three components run on a Dell Wyse 5070 thin client that I have sitting next to the printer. The only thing that runs on the printer hardware controller is the Klipper firmware.

Klipper is interesting when compared to more conventional 3D printer firmware like Marlin, in so far as it doesn't try to do everything on the limited embedded microcontroller on the printer controller board. Instead, it splits it into two: the firmware exposes the printer hardware over a serial port using a compact binary API, and the "host" software (written primarily in Python) runs on a more powerful computer, typically a Raspberry Pi. The host software interprets gcode, turning it into an optimized set of movement commands and shipping those commands to the printer hardware on a precise timeline.

What does this have to do with power control?

The biggest difference between my setup and most Klipper setups is that I'm running the Klipper host software on a used thin client PC (specifically a Dell Wyse 5070) instead of a Raspberry Pi, which means I don't have direct access to GPIO pins (A GPIO pin can, say, flip a relay on and off). GPIO is one way that Moonraker can control power, but not the only way. Moonraker can instead use a generic HTTP client to talk to some other thing that can control power, for example Home Assistant or Tasmota-powered smart switches.

I saw this, and I says to myself, I says "Self, why not make one Moonraker talk to another Moonraker which talks to another Klipper which talks to a microcontroller dedicated to GPIO which then flips a relay to control my printer power?"

The advantages of this setup:

I can use the same secondary Klipper (in this case a Raspberry Pi Pico) to control power/lights for anything on my bench separately from that machine's specific controller
I don't have to switch to a Raspberry Pi.

That's weird. Just switch to a Raspberry Pi.

No.

Sigh. Ok fine. Show us the configs.

The config is remarkably small, once you get two Klippers and two Moonrakers installed. I used KIAUH which has the ability to do this baked right in.

In printer_1_data/config/moonraker.cfg I have appended the following bits:

[power SV06_Plus]
type: http
locked_while_printing: true
restart_klipper_when_powered: true
restart_delay: 2
on_url: http://localhost:7126/machine/device_power/device?device=sv06_plus&action=on
off_url: http://localhost:7126/machine/device_power/device?device=sv06_plus&action=off
status_url: http://localhost:7126/machine/device_power/device?device=sv06_plus
request_template:
  {% if command in ["on", "off"] %}
    {% do http_request.set_method("POST") %}
    {% do http_request.set_body("") %}
  {% endif %}
  {% do http_request.send() %}
response_template:
  {% set resp = http_request.last_response().json() %}
  {resp["result"]["sv06_plus"]}

This is pretty dense but basically it's saying, use the generic HTTP client with three URLs: on, off, and status. Requests need to be a POST if we're telling the remote Moonraker to turn the printer on or off. The response_template extracts the status that the secondary Moonraker returns from each command.

Three additional settings control the safety interlock, so the printer can't get turned off in the middle of a print, and also tells the primary Moonraker to restart the primary Klipper when the power gets switched on.

For printer_2_data/config/moonraker.cfg I added the following:

[power sv06_plus]
type: klipper_device
object_name: output_pin relay1

This states that the secondary Moonraker can control the power of a device named sv06_plus by telling the secondary Klipper to toggle an output pin named relay1.

Finally, this is printer_2_data/config/printer.cfg:

[mcu]
serial: /dev/serial/by-id/usb-Klipper_rp2040_E661AC8863625425-if00 #don't copypasta this, find your own

[printer]
kinematics: none
max_velocity: 1000
max_accel: 1000

[output_pin relay1]
pin: gpio4

In other words, the MCU is a Raspberry Pi Pico, I have a "printer" that can't actually print, and I have a single output pin on gpio4 named relay1.

I built and uploaded the Klipper firmware to the Pico following the package instructions and now Mainsail has a little toggle to control the printer power! And it works!

Pretty cool, right?

Updated 2023-09-26 to use Moonraker's native power APIs rather than toggling the relay directly.

Backyard Fiber Update 2

2023-06-05T00:00:00+00:00

You ever have one of those moments where you realize that what you've actually been doing is not at all what you thought you were doing?

Yeah. I had one of those a couple weeks ago.

See, I thought I was just installing some fiber in the back yard to get my shed gear off of a lightning-prone shallowly buried cat5e connection. What I actually ended up doing is rebuilding my home network (almost) from scratch while maintaining a passable SLA. A real ship of theseus type of deal.

Projects within Projects

Here's where my project checklist sits as of today:

As you can see, there ended up being a lot of projects even after the conduits were installed, and there's still a bunch left to do.

The biggest chunks were:

Pull fiber from media center (MC) structured media enclosure to office structured media enclosure (SME)
Set up both media enclosures
Pull fiber from office SME to shed
Set up the shed
Move a bunch of existing equipment around

On top of that I decided to rectify a couple other things that bothered me about the existing network setup. When we moved into this house there were two slightly dodgy cat5e connections from a random room in the house to a very small cabinet in the office, along with more cat5e terminated as phone jacks in a few other locations. I was able to use those existing wires to set up a functional but topologically and aesthentically displeasing network that I lived with for almost three years.

The old topology was sort of barbell shaped. I had a bunch of stuff in the office, a bunch of stuff in the media closet, and an annoyingly vital connection between the two via an underpowered 8 port switch in the basement. It also was very branchy with lots of switches and connections over cables that I didn't really trust.

To fix all of those problems at once I pulled five new ethernet cables and two fiber cores from the media center to the basement, out through the new conduit and pull box and back in through an existing 3/4" conduit that when I started contained a pair of rodent-chewed UV destroyed cat5 cables.

The new topology puts the house media closet at the center. My core switch is a Brocade ICX 6450-24 (non-PoE) and right next to it is a 24 port TP-Link Omada PoE switch. I put the modem, router, and homeprod servers here too. The office SME has a Ruckus (nee Brocade) ICX 7150-C12P switch along with a few pieces of IoT gear and the temporary home for the office access point. Also in the office are a small access switch in a cabinet and another on my desk.

One of the shed to office fiber pairs is spliced to a second MC to office pair so the shed doesn't depend on the office switch having power.

The basement switch just serves the TV and game consoles so there's no need for a UPS there anymore.

Problems

The most time consuming and costly problem on the whole project was mis-judging how long the custom fiber cables needed to be. Before I ordered I measured with Google Maps and added on a few meters just to be safe. It was not safe.

When I pulled the cable that was supposed to be between the office and the media closet it ended up being too short by about 15 feet. Thankfully I ordered the cable that I intended to go between the office and shed longer by about double that, so I was able to use it to get the office running.

Unfortunately that meant an extra three weeks and $250 while Fs.com built and shipped a second much longer cable. The distance is about 80 feet in a straight line so I ordered a 40 meter cable (130 feet) and ended up with about 15 feet extra. I guess I ordered right but my mind still has a hard time acknowledging that conduit adds that much to the straight line distance.

Results

So after all of the projects are done, what will I have?

A 26kW standby generator to keep us up and running through future ice storms and heat waves
A nice looking cabinet covering the electric meters and generator transfer switches
A 60A subpanel in the shed for future power tool expansion
A 12 core single mode fiber backbone that connects all three buildings on the property
10 additional Ethernet runs with six more to come
Centralized PoE and battery backup for access points and cameras
Several new potential locations for my homelab and other noisy or hot gear
All new cables and several old ones properly labeled
Several new tools
A network topology that doesn't make me itch

Arguably the most important result is the confidence I've gained in organizing and completing big projects like this, along with the confidence that cutting holes in the house isn't going to make it fall down.

Pictures

Here's some pictures of the interesting stuff:

Backyard Fiber Update 1

2023-04-19T00:00:00+00:00

Today is day one of three installation days. Yesterday they came and dug the trenches and today they're starting to put in the electrical and gas for the generator.

Tomorrow and possibly the next day are for actually hooking everything up.

When that's all taken care of I'll still be waiting on FS.com to build and ship the fiber assemblies that will go in the conduits.

Sigh.

At least I can finalize some other decisions while I wait for other people to do their thing.

Office Setup

Amongst all of the weird things that the previous owners of this house did, one of the very weirdest was running two ethernet drops from the tiny cabinet in the office (pic in the previous part of this series), all the way across the garage, and into a room that they used as a den/study area and we use as a playroom for the kids.

Why two? No idea. Why the tiny cabinet? I guess it made sense at the time.

I've been wanting to recify this setup since we've owned the house and I finally have my chance with these conduits, so I'm taking it.

The new setup will be:

All of the fiber from the media cabinet in the house and the shed runs to a structured media panel in the office
Cat6 from this new panel to various points around the office: TV, guest room, my spouse's studio, my desk and work bench, and of course the stupid little cabinet
A Brocade ICX 7150-C12P managed switch in the panel

There are two candidates for where to put the panel. Candidate 1 is a closet in the guest bedroom that we currently just use for random storage.

Candidate 2 is the utility room

The closet has the advantages of being an entirely empty stud cavity with lots of room to put other stuff if I choose. It's also approxmiately in the center of the building which might make it better for wireless stuff.

The utility room is the more natural choice because the junction box on the outside will be right on the other side of that wall, so running things in from outside will be a snap, power is right there, etc. It's also already a noisy environment so there's no problem putting noisy gear in there if I need to.

The problems with the utility room are that there's little space for future expansion, the space that is there is currently occupied by an old manual generator transfer switch that should be removed anyway but which is not part of the spec for the generator job, and it would be directly above the water meter.

I'm not sure that last part matters, to be honest, but it seems like a worthy consideration for electrical equipment. The installer says it'll be fine and I trust them, so I'm sure it'll be fine. It's fine.

On the house side, we're negotiating around the generator transfer switches for where to put the outside low voltage junction box, and how I want the conduit run into the house itself. The installer hasn't ever done anything like this before, so it's new to both of us and I hope we get something useful without too many compromises.

Exciting stuff.

Backyard Fiber

2023-03-31T00:00:00+00:00

We moved into our current house in the middle of 2020. Between then and now (end of March 2023) we've experienced at least one power outage per year longer than 12 hours, the most recent of which was five days due to a historically bad ice storm. We were lucky to get a hotel room that let us stay the entire time, but we still had to discard a fridge and a freezer full of food.

Even at our old house we had been throwing around the idea of getting a whole home generator, but it was always just too expensive. This most recent event put us over the edge, though, and so now we have a generator in the yard waiting to be hooked up next week.

As part of this work, the contractors happen to be installing trenches exactly where I've wanted them since we moved in, so I'm having them put some low voltage conduit into the trenches and running it into junction boxes.

Check out this diagram:

The generator is going out by the shed, at the left end of the 80 foot conduit segment. Here's a gratuitous shot of what that looks like today, not hooked up:

Ok, but why?

Why the generator or why the conduit?

Generator

A couple of reasons why we wanted to get a generator:

We have an all-electric house so a small generator to run the furnance and fridge won't cut it.
We hate knowing that if we have to cheese it in the winter there will be a thundering herd trying to book the same hotel rooms.

Conduit

Three reasons for conduit:

I have a security camera and a few bits of equipment in the shed and they're connected today over a CAT5E copper link. I'm rolling the dice every thunderstorm to see what equipment is going to blow up from induced voltage on that cable.
The connection between the house and the office is a dicey CAT5E link that I have to run through several switches before it gets here. This is both aesthetically displeasing and also expensive if I want to have faster than gigabit speed in the office.
Fiber is cool!

The Plan

The plan is simple in concept but of course completely overkill complicated in execution.

Phase 0: Prep (we are here)

In this phase the generator is set but the trenches are not yet dug. I will be disconnecting the CAT5E link in favor of a wireless mesh link because of what happens in Phase 1.

I'll also be ordering a bunch of stuff from FS.com soon:

Two custom-built twelve strand armored direct-burial rated single mode fiber assemblies with LC ends and pulling eyes, one for each conduit
Some shorter indoor-rated single mode patch cables to run into each building
Several gigabit and 10gig optical transceivers
Assorted bits and bobs

Why twelve strands? That seems excessive.

Because, dear reader, I don't want to have to do this again. Running 12 strands of single mode fiber in a super tough armored cable in a buried conduit means that I likely won't ever have to. 12 strands gives me 6 links with conventional optics and up to 12 with bidirectional optics, should the need arise.

Phase 1: Trenching and Hookup

The contractors come back to dig the trenches, install the transfer switches (yes multiple), gas line, and low voltage conduits and get the thing running. The conduits will end at junction boxes at the shed, office, and house with small stub conduits into each building ending in either another junction box or a wall box. When the contractors dig the trenches I'm confident that they're going to cut the CAT5E link, but that's fine because I'm going to replace it with fiber.

Phase 2: Running Fiber and Shed Activation

My plan is to set up some kind of spool dispenser for the fiber assemblies and pull them through either solo or with my spouse helping to guide them in. After they're settled in the conduits, I'll couple one pair from each together at the office junction box to get the shed directly connected to the house core switch. This will be a gigabit link because it's really just one camera and an access point.

Phase 3: Office Activation

Here's where things get a little hazy.

Today, all of the connections from the house to the office to the shed terminate in a small cabinet under my desk. I've spent way too much money and time trying to get things to work in this cabinet and recently just gave up and moved everything over to a closet in the house.

Said cabinet when it was full:

I haven't settled on what exactly I'm going to do, but the leading contender right now is to install either a rack or a structured media enclosure in a closet in the office and re-run everything to that: a couple fiber strands from outside and a dozen or so ethernet drops around the office and garage.

This will involve some electrical work and crawling around in the dingy crawlspace under the office to pull more cable, so I might suffer with my dodgy ethernet connection for a little while longer.

I'll keep this page updated with photos from each phase as I complete it. I'm sure I'll also be keeping the #HomeLab tag on my Hachyderm account apprised, so follow there if you like!

Homeprod Management with Docker Compose

2022-09-21T00:00:00+00:00

Recently I decided to change how I manage my homeprod environment (home production, i.e. the things that other people in my household rely on and tell me if they're down). I moved everything over to docker-compose stacks managed with a small(ish) shell script. Skip down a bit if you don't want backstory.

Backstory

The situation before was a mish mash of various things that I've tried over the years. For awhile I was deploying everything with dokku on one big VM running on a proxmox host. That worked for some things but the 12 factor architecture doesn't always fit.

For those things I used Portainer to launch docker-compose stacks. Again, works for a lot but sometimes it's really annoying. Portainer's docker-compose support is somewhat limited in so far as you can't really ship configs along with a docker-compose file to be mounted into a container, so for anything custom you need to build your own derived container. Once you have that there are a lot of pointy-clicky steps to actually refreshing and deplolying an update.

The Fleet

I have a handful of systems running in homeprod:

hypnotoad is the VM host. It's an HP Elitedesk 800 G3 mini running Proxmox 7
netsvc1 is a Dell Wyse 3040 thin client running Technitium for DNS and DHCP with a nice web UI
There are 4 other Dell Wyse 3040 thin clients scattered around, one each in the house, office, garage, and shed. These have z-wave and zigbee USB sticks and are running zwavejs2mqtt and zigbee2mqtt.

All of the thin clients and most of the VMs are running Alpine Linux because the small runtime footprint meshes well with the 2gb of memory and 8gb of storage on the thin clients.

Every VM and physical node is connected to my Tailscale network as well.

Docker Compose Stack

The whole idea of this refactor is to centralize and simplify management without having to run SPOF orchestrators or heavy agents on the nodes (again: see the thin client specs above). I thought about a lot of things but nothing really clicked until one day while perusing a Hacker News thread I came across a tossed-off comment from someone who said they just used the offical docker compose container, bundled their stack into the container, passed in the docker socket and everything Just Worked.

This sounded like magic, so of course I had to try it. Of course it worked, but it's very limiting. It implies a single host per derived image, for one thing, and that doesn't work with my fleet.

The kernel of the idea was really great, though, so I iterated on it and came up with docker-compose-stack. Docker compose stack (terrible naming, sue me) is basically the same idea taken to a higher extreme.

How it works

Here's the workflow that docker-compose-stack runs:

At startup, start.sh loads secrets from disk and, optionally, from a script named download_secrets.sh. It then runs run_compose.sh.
Run anything declared in hosts.yml for the current host as a pre-start script within the context of the dockerstack-root container
Copy declared configs into /var/lib/docker/stack_configs
Creates a .env file from envirionment variables declared in hosts.yml
Runs sha256sum over the contents of /var/lib/docker/stack_configs and stuffs the result in CONFIGS_SHA.
Composes a base docker-compose invocation from the list of stacks declared in hosts.yml
Drops some cron jobs
Runs docker-compose
execs into crond to run the crons set up in step 7

Even without an entry in hosts.yml, a host running docker-compose-stack will always run a watchtower container on a very short refresh cycle. Watchtower will check every 30 seconds to see if any container images have been updated, download the update, and re-create each updated container. That allows me to update my docker-compose-stack container with GitHub Actions and every host just updates themselves accordingly.

That's it. That's the whole thing. I've been running my implementation of docker-compose-stack across 12 VMs and physical nodes for about a week and it's been ticking along nicely.

Configs and Secrets

There's some nuance around secrets and configs that probably deserves some explanation.

Configs are exactly what you expect. Nginx configs, whatever. These get dropped into a directory on the host to be mapped in as container bind mounts.

Secrets are loaded from a special file on disk. Secrets can also be loaded from a script. Loading works by running the script and capturing the output, evaling the output, and dropping the sha256sum of the output into a file in the config directory.

The nuance comes in when we want to update secrets or configs on running containers. By including CONFIGS_SHA in the list of environment variables for a service, it will automatically be recreated when that SHA changes. Otherwise changes aren't picked up very well.

In my homeprod environment I'm managing secrets using tailscale-op-proxy which lets me tag a node with a Tailscale ACL tag and grab any secrets out of 1Password tagged with that same ACL tag. Nodes only get the secrets that they need and I get to manage secrets with the 1Password application rather than ssh'ing into each machine and managing them with vim.

Alternatives Considered

I looked at Kubernetes, but that was way too heavy. I also looked at Nomad which felt limiting, especially with regards to not wanting to run a SPOF orchestrator at all. I'm sure I could have implemented all of this with ansible or salt stack or puppet or whatever else.

I honestly just thought the docker-compose-within-docker idea was clever and decided to run with it.

Docker compose also has profiles built in, but those are confusing because if you decide to remove a service from a node, if you're using profiles that service won't actually be removed because it's still part of the stack formation, according to the compose authors. That doesn't work for me, hence the machinations around building a compose command from a bunch of stack files.

You too?

I have esoteric tastes and don't really care about exploring production-grade infra systems like k8s in a home environment. My primary concern is that services stay up.

If that fits you too, and you run some stuff that other people rely on in your house, maybe take a look at docker-compose-stack.

Setting up an isolated work VLAN with VyOS

2022-08-12T00:00:00+00:00

I treat employer-provided hardware as hostile entities on my network. I have no control over them and I have no idea what invisible scanning software they've installed.

Previous to now I set up a dedicated guest wi-fi network which provides a sort of client isolation, but this wasn't quite enough because I want my work machine to also have an ethernet connection. Therefore, I went about setting up a VLAN for the first time.

Topology

My network is unfortunately somewhat complicated due to how my house is laid out. The path from my workstation to my WAN modem goes like this:

TP-Link Omada access point
TP-Link standalone L2+ switch
UniFi Flex switch
TP-Link Omada switch
VyOS running on an old HP SFF

I set up the VLAN on Omada, UniFi, and the standalone switch as a tagged VLAN on every port so that it could pass through unperturbed to VyOS. This was probably somewhat aggressive but I don't see any harm for my situation. Plus, I don't have my uplink ports written down anywhere and I was lazy.

The way this works in Omada and UniFi is identical, you just add a new LAN network of type VLAN, set the tag, and then ensure that every port is set to the automatically generated All port profile. That profile just sets the new VLAN as tagged (allowed to pass). On Omada I also set the work-specific guest network to have the correct VLAN tag.

On the standalone switch it was a little more manual, in so far as I defined the VLAN and then manually selected every port to be tagged, but overall it was similar.

VyOS

This is where it gets complicated. I'm running VyOS 1.4.x rolling releases and, as is the nature with a rolling release model where you just run the tip of the development branch, there's sometimes bugs.

For the longest time, though, I thought I was holding something wrong. Here's my bridge config:

bridge br0 {
    address 10.73.95.1/24
    enable-vlan
    member {
        interface eth0 {
            allowed-vlan 74
            native-vlan 1
        }
        interface eth2 {
            allowed-vlan 74
            native-vlan 1
        }
    }
    stp
    vif 74 {
        address 10.74.1.1/24
        firewall {
            in {
                name WORK-OUT
            }
        }
    }
}

My VyOS box, for purposes of this blog post, has three ethernet interfaces: one WAN and two LAN. I have the LAN interfaces set up with a Linux bridge, which is basically a network switch in software.

Both member interfaces of the bridge have native-vlan 1, which means that packets coming in without a VLAN tag are automatically assigned VLAN 1 and packets leaving with the VLAN 1 tag will have it stripped on the way out.

Both member interfaces also have allowed-vlan 74, which just means that packets with that tag are allowed to pass through those interfaces.

Then I have vif 74 which sets up a sub-interface on the bridge specific to VLAN 74. That sub-bridge only deals with packets on VLAN 74 and ignores everything else. It has an address in a different subnet along with a firewall that I'll get to in a bit.

This all worked, to a point, but when I tried to do anything on a client machine attached to VLAN 74 I couldn't get anywhere. Every packet said "no route to host".

After a bunch of time with tcpdump I noticed something odd: ARP requests for 10.74.1.1 would get all the way to vif 74, which would dutifully issue an ARP reply, which would then be visible on br0 but not on either one of the ethernet interfaces and would never make it back to the client.

This is extremely weird behavior!

I asked for help on reddit, twitter, various Slacks, but then I remembered that VyOS has a slack and asked there. After a bit of debugging one of the VyOS developers noticed the problem: br0 wasn't getting VLAN 74 set as allowed by the bridge management code within VyOS:

vyos@vyos:~$ sudo bridge -c vlan show
port              vlan-id
eth2              1 PVID Egress Untagged
                  74
eth0              1 PVID Egress Untagged
                  74
br0               1 PVID Egress Untagged

The magic command to fix it is:

vyos@vyos:~$ sudo bridge vlan add vid 74 dev br0 self

and then when we run the show again:

vyos@vyos:~$ sudo bridge -c vlan show
port              vlan-id
eth2              1 PVID Egress Untagged
                  74
eth0              1 PVID Egress Untagged
                  74
br0               1 PVID Egress Untagged
                  74

As soon as I ran that add command everything started working great. I've added it to /config/scripts/vyos-postconfig-bootup.script and I trust that the VyOS development group will get around to fixing the bug at some point in the near future.

Firewall

Now that I had the VLAN set up I could access the internet from it, but also I could access the rest of my home network. VyOS assumes unless told otherwise that it should route packets as much as possible. You can use firewall rules to restrict that, however, so here's what that WORK-OUT rule looks like:

firewall {
    name WORK-OUT {
        default-action accept
        rule 10 {
            action drop
            destination {
                address 10.0.0.0/8
            }
        }
    }
}

It's really simple: drop everything that is destined for an address in the 10.0.0.0/8 subnet and accept everything else. This is maximally restrictive so that I can expand my network with other subnets and VLANs and not have to worry about updating this rule.

Ruby Service Objects with Sorbet

2022-05-27T00:00:00+00:00

I really enjoy working with Sorbet. Actually I really like working with T::Struct, everything else that Sorbet provides is sort of just a bunch of nice bonus content. Today I'm writing about a small technique that I think illustrates the value of T::Struct and friends.

Recently I was working with a series of service classes that were all structured like this:

class SomeService
  class << self
    def call(arg1, arg2, arg3, arg4, arg5 = nil, ..., arg13 = nil)
      ...
    end

    private

    def some_private_method(arg1, arg3, arg5 = nil, ..., arg12 = nil)
      SomeOtherService.call(arg3, arg1, arg5, arg12, some_calculated_value)
    end
  end
end

That is to say, they all had a consistent callable interface with a very large number of nilable positional arguments. This isn't a bad pattern, per se, but it starts bordering on unreadable when you need to pass those arguments around to other methods within the service.

Being how I like to (ab)use Sorbet in fun ways, and that I really wanted all of those arguments to be typed, and that I wanted to convert this to a service object rather than a service class, here's what I ended up with:

class SomeService < BaseService
  private

  const arg1, String
  const arg2, T::Hash[Symbol, String]
  ...
  const arg13, T.nilable(Integer)

  def call
    ...
  end

  def some_private_method
    ...
  end
end

Notice, right at the top, the private keyword. Everything in this class is private except the things that are exposed by BaseService. After that we define some const things, then no-argument call and some_private_method instance methods.

And what does BaseService look like, you ask?

class ApplicationService < T::InexactStruct
  def self.call(**kwargs)
    new(**kwargs).send(:call)
  end

  private

  def call
    raise NotImplementedError
  end
end

There's no real magic here. The interesting stuff happens in T::InexactStruct where it creates a nice constructor for you and handles all of the const and prop initialization. T::InexactStruct is exactly like a T::Struct except you can subclass from it which T::Struct prevents subclassing for, really, no good reason other than performance.

The only other weird thing happening is that .send(:call), and the only reason we're doing that is so that we can have a private instance-level call method. It's not absolutely required, but it considerably narrows the public interface of ApplicationService-derived classes.

I think this is a nice pattern that lets you use Sorbet's props to make a clean and minimal interface for your service objects.

What do you think? Is this something you'd use? Is it super gross and you hate it? Either way, lemme know by emailing pete@petekeen.net.

Z-Wave Controllers using Dell Wyse Thin Clients

2022-03-18T00:00:00+00:00

About two years ago my family and I moved into our (hopefully) forever home. This house has an unfortunately large number of exterior doors and is situated on a main road, so we wanted to make sure that most of the doors are locked without having to patrol the doors before bedtime every night.

To that end, I put z-wave locks on every door and built out a z-wave network so they could be reached from my Home Assistant instance that shows their status on a nice tidy dashboard.

The Beginning

Originally I had a z-wave USB stick just stuck into the server running Home Assistant. This is a problem for a couple reasons. First, the z-wave network was rebooting and taking forever to start up every time I restarted Home Assistant (which, at the beginning, was extremely frequently).

Second, this house is very long (about 140ft) and has a number of thick cinderblock walls between where I put the server and where the furthest locks are. Z-wave doesn't travel too well between cinderblock so I ended up having to use a couple repeaters, which worked ok but added so much latency that status updates just were not very reliable.

A New Plan

We suffered with the over-extended mesh for quite a long time before I cooked up a plan:

Each zone (defined as an area surrounded by cinderblock) will have a zwave gateway consisting of a raspberry pi and a z-wave USB stick
Each zone will have an independent z-wave network, rather than trying to bridge them with repeaters.
All the zones will integrate into one Home Assistant instance

This plan worked well, except that raspberry pi's are not very reliable for this application. Two of the zones are in unconditioned spaces and for whatever reason their connection would drop when it got cold. Also, those same two zones were using raspberry pi zero w's which are slow as heck trying to do anything.

Plan B: Replace It All

After reading a lot and trolling eBay for hardware I came across the Dell Wyse 3040 thin client. Ostensibly these are for running RDP sessions to remove Windows servers but they're quite overpowered. These little machines have four Intel Atom cores, 2GB of memory, and 8GB of onboard storage, plus a bunch of USB ports, two DisplayPorts and an ethernet jack. They can run whatever Linux you want, provided you can get into the BIOS settings and change the boot order. They're perfect for what I want to do, I just have to be careful with the built-in storage because it's exactly equivalent to an 8GB SD card soldered to the motherboard and can wear out after too many writes.

Software stack

I settled on this software stack for each gateway:

Alpine Linux as the base OS
zwavejs2mqtt running in a Docker container with the USB stick passed through
Managed with Portainer as an Edge Agent
zwavejs2mqtt ports only open to Tailscale

Alpine is a minimal linux that really tries hard to be small while also fully featured. It's widely used as a base for Docker containers but for this application I wanted to install it directly on the hardware. Alpine has a sys installation type that works well for this. I spent too much time trying to make a custom image that golfed down the installed size and just ended up using the stock ISO installer.

ZwaveJS2MQTT is a great UI on top of a pure-JS z-wave stack (zwave-js), which in turn has a first-party integration with Home Assistant. In my setup I actually have HA talking to the gateways via their websocket rather than using MQTT as an intermediary. I'm running their official Docker container and passing the USB z-wave stick through.

Portainer bills itself as "a centralized service delivery platform" which I'd say is fairly accurate. I'm using it to deploy a docker-compose file across my little fleet of thin clients without having to manually ssh into each machine and run a docker command. Each node is set up as an edge agent, which tells them to check in with the central server every few seconds, ask for work, and then go back to sleep. If the server needs their attention it'll tell them to open a tunnel on their next check-in so it can access their docker socket. All of this happens transparently, all I ever have to interact with is the central server's UI.

Tailscale is a managed mesh overlay network based on Wireguard. Every node in the mesh tries really hard to make direct Wireguard-encrypted connections to every other node. The Tailscale control plane handles public key distribution to all of your nodes as well as serving as an introduction point when nodes can't connect directly. It also has a tag-based ACL firewall system that lets me lock down the z-wave gateways so only Home Assistant can talk to them. This is important because zwavejs2mqtt doesn't ship with authentication built-in for the web UI, anyone can connect to the websocket if it's turned on and issue commands. I put them on tailscale to mitigate a situation where someone cracks my wifi password and starts scanning my network.

Setting it all up

Setting up the nodes was pretty standard once I got going. I created a generic USB bootable drive out of an external drive I had laying around using Ventoy. Once Ventoy is set up you just have to copy ISOs onto the drive and Ventoy presents a nice menu on boot to select one to boot. The way it works under the hood is fascinating: it uses iPXE to present the menu, then chainloads into the ISO you select. It can even do cloud-init/kickstart for a couple different Linux distributions (sadly not Alpine).

After getting Alpine installed I curl | bash'd a setup script. This doesn't do much of anything interesting, except for that last line:

docker network create -d bridge -o com.docker.network.bridge.host_binding_ipv4=$(tailscale ip | head -n1) tailnet

This is a docker network that bridges into tailscale, which means my docker-compose.yml file for the zwave stack can be nice and generic:

version: '3.7'
services:
  zwavejs2mqtt:
    container_name: zwavejs2mqtt
    image: zwavejs/zwavejs2mqtt:6.5.2
    restart: always
    tty: true
    stop_signal: SIGINT
    privileged: true
    networks:
      - tailnet
    ports:
      - '8091:8091'
      - '3000:3000'
    environment:
      - ZWAVEJS_EXTERNAL_CONFIG=/usr/src/app/store/.config-db
    devices:
      - ${ZWAVE_DEVICE_PATH:-/dev/serial/by-id/usb-0658_0200-if00}:/dev/zwave:rmw
    volumes:
      - zwave-config:/usr/src/app/store
volumes:
  zwave-config:
    name: zwave-config
networks:
  tailnet:
    external: true

Before creating that network I was trying to pass the tailscale IP in through an environment variable and the compose file was super ugly.

I started rolling out this system a couple months ago and finally put the last node in production earlier this week. It's been working very well, much better than the raspberry pi-based nodes it replaced. I think it also means I'm going to avoid raspberry pis unless I absolutely need the GPIO. They're just not very good machines.

My Virtualized Router

2021-11-24T00:00:00+00:00

This post is mostly for me to remember what I did, but feel free to follow along.

Original Post

Recently I decided to switch jobs, for a number of reasons that aren't germane to this post. I haven't had any proper time off for years so this time I decided to take a big chunk of time between leaving my old job and starting my new one. Two and a half months, to be specific.

I'm spending this time doing a few things. First, I'm being more present with my family. I haven't been the kind of dad or husband that I want to be lately and I'm trying my best to fix that. Second, I fired up my XBox One and started playing Forza Horizon 5. It's ludicrious and mindless in the best possible way.

Third, the topic of this post: I'm building a virtualized router out of a Dell T20 server and a bunch of eBay'd networking gear.

But why?

Yeah, good question. There's a bunch of answers. Comcast is selling us 1.2Gbps service and I want to be able to use it all. I want a reliable failover WAN situation because Comcast goes out for about 5 minutes multiple times a day and that's really annoying in meetings. For example, I was in the middle of a job interview, deep in thought while on camera with two people from a company you've heard of, when Comcast fell over. I tethered to my phone quick but by the time I was reconnected I had lost my train of thought.

Beyond more and better, I want to get some more hands on experience with some tech that I only sort of tangentially know. Specifically, I've been running a Proxmox host for a couple years and it's been solid, but I don't know a lot about the guts. I've also been running a UniFi Security Gateway and then an Edgerouter, but I feel like I don't know how they actually do their job. I also want to play around with a thing called Open vSwitch within Proxmox and this seems like a good opportunity.

Also it's fun and my official job until early 2022 is to follow the dopamine.

Hardware Stack

The hardware is a mix of stuff I had on hand and a few things I've picked up:

Dell T20 minitower server with Xeon E3-1225v3 3.2GHz and 32GB of ECC memory
Samsung 500GB SSD (side note: SSDs have gotten ridiculously cheap since I last looked at them)
Intel X520-DA2 Dual SFP+ 10Gbps network card
Wiitek 10Gbase-T SFP+ interface
passive SFP+ DAC cable
Intel PRO/1000 VT quad port gigabit ethernet card
Motorola MB8611 DOCSIS 3.1 multi-gig cable modem
Netgear LB1120 LTE modem

The network card decisions deserve some explanation. Comcast gives us a 1.2Gbps cable connection handled by the MB8611 modem. That modem has a 2.5Gbase-T ethernet connection.

One way to handle this would have been a 2.5Gbase-T ethernet card in the router. This would have been a little cheaper but the fast connection would have ended at the router. I want to share the speed with my other server so I need another faster-than-gigabit port while also preserving the ability to, one day, maybe, upgrade to Comcast Gigabit Pro 3 Gbps fiber service. I'm also constrained by the T20's selection of three PCIe slots: 16x, 4x, and 1x. If I tried to do something with 2.5Gbase-T cards I probably would have run out of slots, but with the dual port SFP+ card in the 16x slot and the quad gigabit card in the 4x slot I'm fine.

I actually got two of those dual SFP+ cards, one for the router and another for my other server which will cross-connect with a passive DAC (directly attached copper) cable.

Software Stack

After putting the SFP+ card and SSD into the server I installed Proxmox VE 7.1 on the server and got started evaluating router distros.

The very first thing I installed was OPNSense, a FreeBSD-derived routing and firewall system. It came up wanting to be on the same IP as our current router (192.168.1.1) which was problematic for a bit until I figured out what was going on. After installing it I clicked around a bit and read some docs and decided that I wasn't really going to learn what I wanted to learn from it.

Next I installed VyOS, a routing and firewall package derived from Debian and Vyatta, which itself was strongly inspired by Juniper's router OS. This was sort of bewildering and overwhelming and after messing around a bit I moved onto the next thing.

Third, I installed NixOS and futzed around with the config. NixOS is a linux distribution that uses Nix to deploy and configure software. This is interesting but also weird and it doesn't get me multi-WAN failover out of the box. I'd have to build that myself, which is not super appealing.

I think what I'm going to do is reinstall VyOS and actually commit to learning how the CLI works. It gets me everything I want out of the box, it's just slightly more inscruitable.

I'm also planning on running a couple of ancilliary "network service" type VMs on this machine:

pi-hole for DNS and network-wide ad blocking
Unifi controller for our existing Unifi gear (mostly APs, some switches)
Ingress nginx proxy (this probably deserves it's own post)

Open vSwitch

One additional thing that I want to play with is Open vSwitch. This is a software network switch that lives inside Proxmox and ties everything together. It acts like a L3 hardware switch, just implemented entirely in software. It's optional within Proxmox but from what I've read it gives significantly better performace, which is aesthetically attractive if not strictly necessary. Nothing about this project is strictly necessary, though, so I feel justified.

What's next?

Set up a basic Open vSwitch configuration within Proxmox
Install VyOS and get it working as a basic router
When the quad port ethernet card arrives, install it and hook it up to the vSwitch and VyOS
Roll out to production!?

Progress Update 2021-11-25

I accomplished a couple of things last night and today:

I got a basic Open vSwitch config working within Proxmox! This was a bit of an ordeal because I installed a package that really shouldn't be installed, because apparently it breaks the entire network stack if you install it. So, protip, just do what the tutorial says and don't get fancy.

Here's the config, for posterity:

auto lo
iface lo inet loopback

# LAN interface, auto-tagged as VLAN-1
auto eno1
iface eno1 inet manual
    ovs_type OVSPort
    ovs_bridge vmbr0
    ovs_options vlan_mode=native-untagged tag=1

# WAN1 10GBase-T SFP+ Module, auto-tagged as VLAN-100
auto enp1s0f1
iface enp1s0f1 inet manual
    ovs_type OVSPort
    ovs_bridge vmbr0
    ovs_options vlan_mode=native_untagged tag=100

# Internal interface for the hypervisor itself attached to VLAN-1
auto vlan1
iface vlan1 inet static
    address 192.168.1.120/24
    gateway 192.168.1.1
    ovs_type OVSIntPort
    ovs_bridge vmbr0
    ovs_mtu 1500
    ovs_options tag=1

# Just one OVSBridge, the software equivalent of an L3 managed switch
auto vmbr0
iface vmbr0 inet manual
    ovs_type OVSBridge
    ovs_ports eno1 enp1s0f1 vlan1

OVS on Proxmox works like this:

Traffic comes into a OVSBridge through physical ports, represented by OVSPorts, as well as OVSIntPorts. These ports can have packets come in already tagged or tag them themselves (or both?)
Traffic to and from VMs transits via the virtual network adapters attached to each VM. These network adapters can be assigned to a VLAN or not. If not, they're assumed to be a trunk port (all VLANs).
The only things with IP addresses assigned are OVSIntPorts and the VM NICs.

I set up two LXC containers, one for pi-hole and another for the Unifi console. After setting up pi-hole I went into the Edgerouter config and told it to send DNS traffic there, and boy howdy is it interesting how much trash the various things on the network are talking to.

The Unifi console was a bit of work. I installed it using the 5.14.23 script from here and then restored a backup from my remote console. After that I told the Edgerouter to broadcast the new console's IP as the inform URL, which mostly worked. I had to forget one AP from the old controller and re-adopt it on the new one, and one of the switches (which happens to be the basement switch that sits in the critical path between the house where I was sitting and the office where the servers are) needed some hand holding, i.e. SSH'ing into it and running set-inform manually.

Tonight I set up Proxmox to relay email through Postmark with this tutorial and set up a scheduled weekly snapshot of all the VMs on the machine.

Progress Update 2021-11-30

Over the past few days I've tackled a couple router-adjacent mini projects.

10G Connection

As I described above, I want to have a fast connection between my primary VM host and my network services host. My first attempt at this was to cross-connect the two machines with a pair of SFP+ network cards and a DAC cable, but unfortunately no matter what I did I couldn't make this work. I ended up buying a Mikrotik CRS305 4 SFP+ port, 1 gigabit port switch and another DAC, and then waiting for it to show up.

Setting up the CRS305 was a little bit fascinating. It can power itself in a couple of ways, including with an included 12V wall wart and PoE over the gigabit port. The PoE option would have been perfect for my network so that's what I tried first, but a ridiculous thing happened as soon as I tried to connect a DAC cable to it. Every time a powered DAC touched an SFP+ port on the switch OR if a DAC plugged into the switch touched an SFP+ port or any other metal port on the computer, the switch immediately reset itself. If I tried to power the switch up after both sides were plugged in it wouldn't power up.

After exploring this for a good 45 minutes I gave up on PoE and tried the wall wart which of course immediately worked. A friend who heard this story has speculated that it could be a grounding issue and recommended an outlet tester. When it arrives on Friday I'll test the outlet and hope that I don't have to go repair it in just about the most awkard spot in the house.

Anyway, after all of that I got both servers hooked up with DACs to the Mikrotik and back to the rest of the network through the gigabit port. Performance is... kind of weird? I would have expected ~10Gbps with iperf3 between the machines but I was only getting ~7.7Gbps. Not sure what's up with that but it's fast enough that I don't really care at the moment.

Backup Internet

One aspect of this project that I've been pretty excited about is having transparent LTE backup internet. Comcast here is pretty unreliable, with momentary outages throughout the day and occasional hours long outages throughout the year. An LTE modem set up as failover would mean fewer interruptions and less chance of an important Zoom meeting getting thrown off course.

My original plan was to purchase a cheap Android phone with a prepaid LTE plan on Verizon's network, transplant the SIM into my Netgear LB1120 LTE modem, and hook it up as failover. The backup plan was to USB tether the phone directly to the router.

Surprise! Neither of these plans actually work.

Transplanting the SIM initially seemed to work, but after some time on the network the modem started doing this awful thing where it would cycle the ethernet port every 30 seconds or so. My current theory is that the modem's IMEI has been hellbanned from the Verizon network for not being a compatible piece of equipment. The additional fun part of this plan is that I needed another ethernet port on the machine so I bought a quad port card and waited for it to show up, and now that's just kind of sitting useless.

The backup plan for backup internet also doesn't work for two similarly annoying reasons. First, VyOS 1.3 doesn't seem to have the right drivers for USB tethered Android phones. I'm not sure if I'm just holding it wrong or what, but when I set up an Ubuntu 20.04 VM it got a working connection right away.

That connection isn't usable for my purposes as it only has a working IPv6 address. The IPv4 address seems to not route at all.

The backup to the backup plan, at this point, is to buy a new Netgear LM1200 modem which is specifically advertised as being compatible with Verizon. That is going to wait a bit because at this point I've spent more money than I wanted to and backup internet is not a project completion requirement.

Current status:

DHCP and DNS moved from the ER-X-SFP to a pihole LXC container
UniFi controller moved from an offsite VM to an LXC container
Quad port card installed
Open vSwitch configured well enough to get the router up and running
VyOS VM set up
- trunked (all VLANs) connection to OVS and VLANs split out as virtual interfaces
- WAN-LAN and WAN-LOCAL firewalls set up
- SNAT from VLAN-1 (the rest of the network) to VLAN-1000 (the WAN connection)

Next steps:

Take a pre-announced downtime to move the cable modem and ER-X-SFP to the office
During that downtime, transfer the gateway IP from the ER-X-SFP to the VyOS VM
Do a bunch of testing to make sure it all works
Order a new LTE modem I guess

Progress Update 2021-12-10

I think this project is almost done. I've had it in production for about a week now without any major problems, but getting it there had a number of bumps.

LTE USB tethering does work, actually

I finally got USB tethering to work. Instead of passing the USB device through I created a USB ethernet device on the hypervisor and joined that device to the OVS bridge. After that it came right up in the VyOS VM and I was able to use it as a WAN device.

After I figured that out I scheduled a downtime for a morning where I knew no one else would be in the house.

OVS + Comcast = no DHCP!?

My downtime plan was pretty simple:

Move the modem and ER-X-SFP from where it was to where it needed to be next to the new router machine in my office.
Bring the ER-X-SFP back up connected to the Comcast modem, verify connectivity
Change the ER-X-SFP static IP to something other than 192.168.1.1
Change the Vyos VM static IP to 192.168.1.1
Power down the Comcast modem, move it to the router machine, and power it back up
Verify connectivity

I got all the way through to step 6 and then encountered an infuriating problem. The VyOS machine wasn't getting an IP address. I could see it making DHCP discover requests on the interface with tcpdump but the modem would just never respond.

After googling and futzing and sitting on the LTE connection for basically the entire day, way past the outage window, I finally figured it out. The problem was the OVS bridge. For whatever reason it wasn't allowing DHCP multicast packets through. I don't know if the problem is inherent to OVS or if there's a setting somewhere and at this point I don't care.

After creating a new non-OVS linux bridge, attaching the WAN interface, and attaching that to the VyOS VM, suddenly I had an IP and connectivity.

Failover works... kind of

My big reason for wanting failover is that when Comcast goes down VMSave goes down. This is aesthetically vexing. Someone finds out about VMSave, tries to use it, and it doesn't work or they can't even get to the page. It adds to people's grief, exactly the opposite of what I intend for it to do.

The LTE tether was working great, except that the nginx proxy running in Fly that sits in front of VMSave wasn't able to connect back to the server. Or rather, it could connect because I could ping both sides across the wireguard tunnel, but I couldn't pass any other kind of packets.

After a few minutes with tcpdump I discovered that something somewhere was corrupting checksums in the packets transiting the wireguard tunnel. I could see the request and VMSave would start responding but the connection would hang after a single packet.

Done for now

After seeing that and thinking through the implications of a triple NAT (CGNAT, Android kernel, VyOS) I decided to send the phone back and order a new Verizon-certified LTE modem. Amazon's troubles earlier this week have delayed this portion but I'm willing to call the project done enough for now. I have low ping times and hourly speed tests showing greater than 1Gbps the majority of the time so I'm happy.

Tiller, Ledger, and Sorbet

2020-05-21T00:00:00+00:00

Tiller + Ledger

Thirteen years ago I started tracking my finances using a tool named Ledger. Up until 2018 I hand-entered every penny into my ledger files, which absolutely had value but eventually I decided that I wanted to automate things as much as I could.

I happened upon Tiller which scrapes bank accounts and puts the data into a Google spreadsheet. Importantly, Tiller adds a unique ID to every transaction it sees, which means if I want to automate something I don't have to try to implement deduplication.

Back in 2018 the script I used was rough, but I've polished it over the years and, just the other day, published the guts as LedgerTillerExport.

The gem consumes a set of reconciliation rules and a Google spreadsheet ID and produces a set of ledger transactions. Rules are given a row from the spreadsheet and return the correct account name for that transaction. For example, I can create a rule like this:

rule = LedgerTillerExport::RegexpRule.new(
  match: /Kroger/i,
  account: 'Expenses:Food:Groceries',
)

This rule looks for /Kroger/ in a Tiller payee line and says that that is always the Expenses:Food:Groceries expense account, like this:

2020-05-21 * Kroger
   ; tiller_id: 5323ch323466234c3467
   Expenses:Groceries                  $150.00
   Liabilities:CreditCard

I can create custom rules that do more complicated things than just a regular expression match. There's a rule in the readme that shows how I reconcile checks, for example.

Where does this tiller_id thing come in, you ask? LedgerTillerExport generates a list of known tiller_ids by querying ledger like this:

ledger --register-format='%(tag("tiller_id"))\n' reg expr 'has_tag(/tiller_id/)'

This extracts the value of the tiller_id tag for every transaction that has one applied. In Ruby we then split the value on commas because I have a bunch of transactions where I've collapsed multiple Tiller rows into one Ledger transaction by hand.

Sorbet

Ok, so, that's interesting, but I also want to talk about Sorbet.

I started working at Stripe almost a year ago and met the Sorbet type checker on my first day. Despite a few warts I've come to adore this way of working with types in Ruby. Both LedgerTillerExport and LedgerGen, my library for building ledger transactions, are built using Sorbet.

My favorite thing in Sorbet is T::Struct. This lets you define a typed record that you can then pass around to functions and serialize to json. For example, here's a struct from LedgerTillerExport:

class Row < T::Struct
  extend T::Sig

  const :txn_date, Date
  const :txn_id, String
  const :account, String
  const :amount, Float
  const :description, String

  sig {params(row: T::Hash[String, T.nilable(String)]).returns(Row)}
  def self.from_csv_row(row)
    new(
      txn_date: Date.strptime(T.must(row["Date"]), "%m/%d/%Y"),
      txn_id: T.must(row['Transaction ID']),
      account: T.must(row['Account']),
      amount: T.must(row["Amount"]).gsub('$', '').gsub(',', '').to_f,
      description: T.must(row['Description']).gsub(/\+? /, '').capitalize,
    )
  end
end

We create Rows from the Tiller spreadsheet's CSV rows. Every row consists of five fields, all defined as const which guarantees that nothing can change those fields once we've called new.

We can then pass a Row instance around in our program and lean on the static typechecker and runtime to ensure that we're using it correctly everywhere.

My only problem with T::Struct is that you can't subclass one due to limitations in the typechecker. If you want the prop/const behavior but you don't necessarily care about the other guarantees that Struct gives you you can either subclass T::InexactStruct or include a few modules:

class NotQuiteAStruct
  include T::Props
  include T::Props::Constructor

  prop :something, String
  const :something_else, String
end

NotQuiteAStruct.new(something: 'abc', something_else: 'def')

I use this in a couple places in LedgerTillerExport, namely for RegexpRule and Exporter to make them easily subclassable.

There are lots of other things to like about Sorbet. Method signatures, the typechecker is super fast, etc.

If you follow the rules Sorbet eliminates entire classes of tests that one would otherwise have to write to guarantee your program is correct.

Using Que instead of Sidekiq

2019-03-13T00:00:00+00:00

A project I've had on the back burner for quite awhile is my own little marketing automation tool. Not that existing tools like Drip or ConvertKit aren't adequate, of course. They do the job and do it well.

I enjoy owning my own infrastructure, however, and after Drip changed direction and raised prices I found myself without a home for my mailing list. I thought, why not now?

One vital component of any broadcast email system is fanout, where you merge the message you want to send with the list of people that should receive it. The easiest way to fanout is to just loop over the list of recipients and enqueue a job for each:

Contact.not_opted_out.each do |contact|
  BroadcastMessageDeliver.perform_async(contact.id, the_message.id)
end

This is simple and works great. However, it's not super efficient. We can do better.

If we're using Sidekiq we can use push_bulk:

Contact.not_opted_out.find_in_batches do |batch|
  Sidekiq::Client.push_bulk(
    class: 'BroadcastMessageDeliver', 
    args: batch.map { |c| [c.id, the_message.id] }
  )
end

The find_in_batches call is a built-in ActiveRecord method that will give you all of the records in the scope in batches, which is just an array of ActiveRecord objects. Sidekiq::Client.push_bulk eliminates the vast majority of Redis round trips that the simple version does because it pushes the whole batch in one Redis call.

We can still do better, though. Instead of using Sidekiq we can use Que. Que is a background processing system like Sidekiq that keeps jobs in a PostgreSQL table instead of in a Redis list. It uses PostgreSQL's native listen/notify system to make job starts basically instantenous, rather than polling like what DelayedJob does.

Using the database as the queue has a number of advantages over systems that use two data stores. In particular, ACID guarantees and atomic backups are important to me because I'm running this all myself. The fewer moving parts the better.

The other thing you can do is insert directly into the que_jobs table:

ActiveRecord::Base.connection.execute(%Q{
  INSERT INTO que_jobs (job_class, args)
  SELECT
    'BroadcastMessageDeliver' as job_class,
    jsonb_build_array(#{the_message.id}, x.id) as args
  FROM
    (#{Contact.not_opted_out.select(:id).to_sql}) x
})

The que_jobs table is just a database table, which means you can insert into it however you want. For example, Que::Job.enqueue just creates a record and saves it, it doesn't use any ActiveRecord hooks at all.

We can eliminate almost every round trip and application-level loop by letting the database do all the work.

Benchmarks (local Redis and local PostgreSQL, 5000 records):

Sidekiq loop: 1.9 seconds
Sidekiq batches: 0.3 seconds
Que direct insert: 0.7 seconds

Wait... that's... slower?

I'm as surprised as you are, but there turns out to be a pretty good reason. Que performs a bunch of check constraints on the incoming data to make sure it's coherent and ready to run. Here's all the things it checks:

Check constraints:
    "error_length"
    "job_class_length"
    "queue_length"
    "valid_args"
    "valid_data"

valid_data in particular does a handful of expensive-ish operations on the incoming json data.

So I guess the lesson here is to always validate your assumptions. I assumed that eliminating round trips would make things faster but because of other constraints and validations it's actually slower.

Still, it's considerably faster than the simple version (which is still no slouch, let's be honest), my marketing system gets all those in-database queue benefits, and I find it aesthetically pleasing. I think I'll keep it.

Automatic Investing

2019-02-19T00:00:00+00:00

The Setup

Fidelity Brokerage Account ("brokerage") with margin enabled
Fidelity Roth IRAs
Fidelity Solo 401K
Fidelity Four-In-One Index Fund (FFNOX)

How Money Flows

Every investable dollar is in FFNOX across all account types
Automatically purchase a fixed amount of FFNOX in the brokerage account every pay period
Automatically contribute to my employer's 401K plan every pay period
Once per year fund our Roth IRAs
Once per year fund our Solo 401K (if possible/advantageous)

Why

I don't want anyone to have to think about where to pull money from at any time. I want me or my wife to be able to login to Fidelity and sell enough to cover cash needs with a very small number of clicks.

FFNOX is a fund of funds consisting of four inexpensive Fidelity index funds. It invests in 60% US total stock market, 25% international developed total stock market, and 15% US total bond market. This fits our family's desired asset allocation.

The brokerage account has margin enabled. Margin allows you to borrow up to 50% of the value of your investable assets (everything but cash and CDs) from your broker for any purpose whatsoever. It kicks in while you run out of cash and will automatically pay itself back when you deposit cash in the account.

We have margin turned on so that we don't have to worry about selling investments to raise cash while something awful is happening. We can login to Fidelity and sell some FFNOX when it's convenient rather than having to do it one some kind of schedule.

We have automatic investing turned on so that I don't have to make a decision to purchase twice a month. I'm frequently tempted to mess with the program but my own lived experience and that of countless others suggests that the more hands off I can be the better off I'll end up.

Contributing to my employer's 401K plan is an automatic tax break and also means I get my employer's matching contribution. I have it set up to max out my contribution space ($19,000 for 2019) by dividing that by the number of pay checks in the year and setting a fixed dollar contribution.

We fund the IRAs and Solo 401Ks once a year just for practicality.

We're right on the bubble where sometimes we have to do what's called a "backdoor" Roth IRA contribution and that's complex enough that I only want to deal with it once a year.

The amount we can contribute to the 401K depends on how much self-employment income I raised during the year. It only makes sense to calculate at tax time.

We don't use robo-investors. FFNOX's total expenses are capped at 0.08%, or $80 per $100,000 annually. Wealthfront charges 0.25% (more than 3x) on top of the fees for the actual ETFs (typically ~0.1%). Robo-investors are never going to offset their fees when compared to FFNOX or similar funds (Target Retirement funds at Vanguard or Freedom Index funds at Fidelity).

Automatic Insurance

2019-02-19T00:00:00+00:00

The Setup

$1.5 million of term life insurance on me
A smaller amount of term life insurance on my wife
A private disability insurance policy on me that pays $5,000/mo if I can't program computers anymore, for up to 2 years
Employer sponsored short and long term disability insurance
Employer sponsored health insurance
Auto, home, and umbrella insurance

Why

My future income is currently my family's largest asset.

Term life protects my family if I die before the kids go to college. Disability protects them if I'm incapacitated but don't die. Health protects our assets from medical bankruptcy. Auto, home and umbrella protect our assets from car crashes, fires, and slip and fall incidents.

We own term life insurance because it's the simplest, cheapest possible product. If you die they pay. If you die before 2 years have passed they'll dig into your application and make sure you didn't lie or omit anything, but that's the only wrinkle. Never buy whole, universal, or variable life insurance. It combines insurance with investing on very bad (for the customer) terms.

I have both employer sponsored and private policies because my health history makes it difficult and expensive to get the amount of insurance that we need. The combination, while a little more complicated, helps me sleep easier.

Automatic Finances

2019-02-19T00:00:00+00:00

My friend Amy Hoy recently tweeted about financial planning and personal finance. This particular tweet stuck out to me:

you'd probably think given my love of money that i'd be all over financial planning, but truly i only enjoy earning it and spending it. saving it feels like… freezing cake. i guess you could do it, but why, you have cake
— Amy Hoy ✨ (@amyhoy) February 19, 2019

Last year I radically simplified my family's personal financial system and made everything as automatic as possible. Amy and Joel Hooks asked me to write up how it works, so this is the start of a series of short posts about how and why I set everything up.

A Little Bit of Why

I prefer to think of myself as a realist. Due to my health history my wife is likely going to be around longer than me. Her family has some very long lived female members as well. Her grandmother is 103 and her great aunt just passed in 2018 at the ripe old age of 95.

I want to make our finances as simple as possible so she doesn't have to worry about them when the inevitable happens.

In 2016 my wife and I welcomed our first child into the world and in late 2018 we welcomed our second. They are two more very important reasons why I want things to be simple. If something happens to both my wife and me, I want our intentions with regards to our finances as plain as possible.

This system got a trial run in late 2018. My wife was admitted to the hospital at 29 weeks pregnant for preeclampsia, a very dangerous condition that needs close monitoring. My daugther was born at 34 weeks and spent the next five weeks in NICU.

I didn't have to touch this system at all. Not once. I logged in a handful of times to check up on it, but everything just hummed along.

Automatic Cash Management

2019-02-19T00:00:00+00:00

The Setup

One Fidelity Brokerage Account ("brokerage")
One Fidelity Cash Management Account ("CMA")
One Fidelity 2% Cash Back Credit Card

How Money Flows

Payroll direct deposited into brokerage
All non-mortgage, non-Amazon expenses are paid with the Fidelity credit card
Bills autopay from brokerage (credit cards, insurance, billpay to household vendors)
Debit cards and live paper checks written against CMA

Why

We use a brokerage account because it lets us keep cash and investments in the same account. All of our cash, including working capital and reserves, sits in the brokerage's core position. Our core position is FZFXX, a Federal money market fund that pays ~2% interest.

We use the Fidelity credit card because it pays 2% cash back when it's set up to deposit rewards into a Fidelity account. It is also one of the only cards I've seen that can be set to automatically cash out deposits. Ours is set to deposit into the brokerage account.

We use the Amazon Prime credit card for all Amazon expenses. This pays 5% back on Amazon purchases which makes it worth it for us. Your milage may vary.

We have the CMA so we don't expose the brokerage account number every time we write a paper check. This is probably overly paranoid and is the only significant complication in the entire system.

The CMA can optionally have "self-funded overdraft protection" turned on, which would automatically transfer from the brokerage account into the CMA to fund checks and debit card transactions. We don't have this turned on, again mostly for paranoia. We make so few transactions like this that it's no problem to top up the account every few months.

The brokerage core position is not FDIC insured. I don't care about FDIC insurance. FZFXX is composed of ultra short term US Treasury bills. If Treasuries are suddenly not liquid enough to withdraw our money our society has much bigger problems.

The CMA's core position is FDIC insured, and the CMA is almost a full brokerage account, but we don't use it as the centerpiece account for two reasons. First, the CMA core position pays shit for interest. Second, the CMA cannot have margin turned on. We'll talk about why that's important in the next post.

Using Let's Encrypt Without certbot

2018-09-13T00:00:00+00:00

In my last post I talked about what a CDN is and why you might want one. To recap, my goal is automatic, magical DNS/SSL/caching management. Today we're going to talk about one aspect of this project: HTTPS and SSL.

SSL, or Secure Sockets Layer, is the mechanism web browsers use to secure and encrypt the connection between your computer and the server that is serving up the content you're looking for.

A few years ago browser vendors started getting very serious about wanting every website to be encrypted. At the time, SSL was expensive to implement because you needed to buy or pay to renew certificates at least once a year.

Almost simultaneously with this increased need for encryption, organizations including the Electronic Frontier Foundation and the Mozilla Foundation started a new certificate authority (organization that issues certificates) named Let's Encrypt. Let's Encrypt is different because it issues certificates for free with an API.

Most people use a tool named certbot that automates the process of acquiring certificates for a given website. However, certbot doesn't really work for my purposes. I want to centrally manage my certificates and copy them out to my CDN nodes on a regular basis, which means I need to use the DNS challenge type. certbot's support for the DNS challenge isn't really adequate for my needs.

Challenge Types

Let's Encrypt uses challenges to verify that you own the domain that you're trying to acquire a certificate for. Currently there are two different challenge types, http-01 and dns-01.

For http-01, you simply create a file within a well-known directory structure within your website containing a challenge string that the API gives you. Then you tell Let`s Encrypt to go look for it. If the file is there and contains the correct challenge string, Let's Encrypt will give you a certificate.

dns-01 works much the same way, except instead of creating a file you create a TXT record for your domain. Let's Encrypt will ask your domain's DNS servers for the value of the TXT record, and if it matches what it expects, you get a certificate.

http-01 has the advantage of being really simple and easy to use with the certbot tool and whatever web server you happen to have. However, with multiple servers in the mix it can get tricky to make sure that every server has a certificate without hitting Let's Encrypt's rate limits.

That's why I'm using dns-01. I can easily drive the API from the central management node and copy the certificates out to all of the CDN nodes simultaneously.

How ACME Works

I use a gem called acme-client to drive Let's Encrypt ACMEv2 API. Once you know ACME's terminology it's easy to use.

An order is the initial request to generate a certificate for one or more domain names
An authorization is LetsEncrypt's response to the order. It contains one or more challenges for each domain name in the order.
After setting up the challenges with either http-01 or dns-01, you then request_validation. LetsEncrypt tries to verify that you were able to successfully install the challenges.
Finally, after LetsEncrypt has seen the validations in the wild, you send a Certificate Request (csr). LetsEncrypt responds with a properly signed certificate, valid for all of the domain names that you verified and sent with your csr.

Getting a Certificate, End to End

Step 1: Sign up for an account

The first thing we need to do is sign up for a LetsEncrypt account. Accounts are identified with a private key and an email address.

require 'acme-client'
require 'openssl'

key = OpenSSL::PKey::RSA.new(4096)
client = Acme::Client.new(
  private_key: key, 
  directory: 'https://acme-staging-v02.api.letsencrypt.org/directory'
)

account = client.new_account(
  contact: "mailto:you@example.com}", 
  terms_of_service_agreed: true
)

Step 2: Generate an Order

Next, let's start the process of getting a certificate. The first thing we do is build an order from a set of domain names.

order = client.new_order(identifiers: ['example.com']

The order contains one authorization per identifier per challenge type. We only care about the dns challenge type.

authorization = order.authorizations.first
label         = '_acme-challenge.example.com'
record_type   = authorization.dns.record_type
value         = authorization.dns.record_content

Step 3: Set the value in Route53

I use AWS' Route53 service to host my DNS records for a variety of reasons. That means we now have to set a record in Route53.

First, we need to set up a client and find the zone we want to update:

require 'aws-sdk'
route53 = Aws::Route53::Client.new(region: 'us-east-1')
zone = route53.list_hosted_zones(max_items: 100)
              .hosted_zones
              .detect { |z| z.name = 'example.com.' }

Next, we generate an UPSERT to create or update the record:

change = {
  action: 'UPSERT',
  resource_record_set: {
    name: label,
    type: record_type,
    ttl: 1,
    resource_records: [
      { value: value }
    ]
  }
}

options = {
  hosted_zone_id: zone.id,
  change_batch: {
    changes: [change]
  }
}

route53.change_resource_record_sets(options)

Step 4: Wait for DNS to populate

Route53 takes some time to push your changes out so now we have to wait. We also have to wait for all of the DNS servers that service the zone to return with the correct value because LetsEncrypt will pick one randomly to ask for the challenge.

Let's write a loop to wait for us. First we need to get the list of nameservers for the zone:

nameservers = []

Resolv::DNS.open(nameserver: '8.8.8.8') do |dns|
  while nameservers.length == 0
    nameservers = dns.getresources(
      'example.com', 
      Resolv::DNS::Resource::IN::NS
    ).map(&:name).map(&:to_s)
  end
end

This uses Ruby's built-in DNS resolver library named Resolv to ask Google's public DNS server what nameservers are set up for example.com.

Next, we have a function that asks those nameservers for the challenge value:

def check_dns(nameservers)
  valid = true

  nameservers.each do |nameserver|
    begin
      records = Resolv::DNS.open(nameserver: nameserver) do |dns|
        dns.getresources(
          'example.com', 
          Resolv::DNS::Resource::IN::TXT
        )
      end
      records = records.map(&:strings).flatten
      valid = value == records.first
    rescue Resolv::ResolvError
      return false
    end
    return false if !valid
  end

  valid
end

while !check_dns(nameservers)
  sleep 1
end

This again uses Ruby's built-in Resolv library to get a list of values. In this case we're asking for all of the TXT values that we set up with the Route53 upsert earlier.

We loop over each nameserver and ask if the value is what we're looking for. If it isn't we bail out early because we need all of the nameservers to have the correct value.

Step 5: Request Validation

Finally, after verifying that DNS has the correct values set, we tell LetsEncrypt to validate our challenges. If we had just asked for verification immediately after the upsert LetsEncrypt would have failed the order and there's no way to restart it or ask for them to check again. You get one validation per order and if you fail you have to start all over.

authorization.dns.request_validation

while true
  authorization.dns.reload
  if status == 'pending'
    sleep(2)
  else
    break
  end
end

Step 6: Send a CSR and receive the certificate

Finally, after validation completes we can actually request a certificate.

cert_key = OpenSSL::PKey::RSA.new(4096)
csr = Acme::Client::CertificateRequest.new(
  private_key: cert_key, 
  names: ['example.com']
)

order.finalize(csr: csr)

sleep(1) while order.status == 'processing'

puts cert_key.to_pem.to_s
puts order.certificate

The acme-client library comes with a handy Acme::Client::CertificateRequest wrapper that takes care of building a CSR exactly how LetsEncrypt wants to see them, so all we have to fill in is the list of domain names we want the certificate to apply to. After a short wait LetsEncrypt will return the bright shiny new certificate in order.certificate.

Wildcard Wrinkle

The above is great if you want to list out every domain name that you want the certificate to apply to. LetsEncrypt recently added support for wildcard certificates, though, which are very useful but have one additional wrinkle.

Wildcard certificates apply to all of the subdomains at a single level for a given pattern. Let's say you want your certificate to apply to these domain names:

example.com
www.example.com
mx.example.com
foobar.example.com
blah.foobar.example.com

Instead of listing all of these domains in the certificate request you can ask for a wildcard, like this:

example.com
*.example.com

The wildcard will apply to any subdomain that matches a star. mx.example.com will match but blah.mx.example.com will not.

The wrinkle here is that LetsEncrypt will give you two challenges for the same domain name because it wants you to verify both the root and the wildcard. You can't set easily set multiple TXT records for a given label in Route53, though, so you have to collapse them into one upsert:

change = {
  action: 'UPSERT',
  resource_record_set: {
    name: label,
    type: record_type,
    ttl: 1,
    resource_records: [
      { value: value_for_root }
      { value: value_for_wildcard }
    ]
  }
}

options = {
  hosted_zone_id: zone.id,
  change_batch: {
    changes: [change]
  }
}

route53.change_resource_record_sets(options)

This seems simple because it is. That didn't stop it from taking me about four hours to figure out, however :)

What is a CDN and why do I need one?

2018-08-30T00:00:00+00:00

In my earlier post I talked about how I'm building my own content delivery network (CDN) but I didn't really go into what a content delivery network even is or why someone would want such a thing. A little back story is probably in order.

What is a content delivery network?

A CDN is a set of computers distributed around the globe that all point back at the server where your website is actually hosted. The CDN computers (or nodes) run a piece of software called a proxy, which just grabs the content from your server and gives it to someone's web browser as if it was their own. Usually, but not always, the proxy caches the content (saves it to locally to it's own disk) so that the next web browser to come along doesn't have to wait for the origin server to respond, it just gets the saved content.

That's pretty much all there is to it. There's a standard way of telling a caching proxy (and a web browser, for that matter) exactly how long you want something cached and under what terms. Some commercial CDN offerings also let you write some code that executes at the cache itself, so you can do fancy things like modify requests on the fly before they hit your server.

That's great, but do I need one?

It depends! A CDN buys you nothing if your audience is all in one spot and that spot is close to your server. On the other hand, if your website has a global (or even semi-regional) audience it will benefit by having caches scattered around the globe that can respond to requests. Ultimately response time is limited by the speed of light. A web browser on one side of the planet will have to wait 130ms under the most ideal conditions to hear back from a server on the other side, just due to how long it takes light to travel there and back. That doesn't include any kind of processing or disk access or buffering or anything. If you have a cache nearby the viewer's experience will be that much better because your site will seem that much more responsive.

Sites that serve the same files to lots of people are also prime candidates for CDNs. Take Netflix for example. Netflix, at one time, accounted over 20% of worldwide bandwidth usage. Their content consists of millions of tiny video files (each episode or movie is broken down into 15-30 second segments which are all encoded at a bunch of different bitrates to suit different speed connections). Netflix uses a huge fleet of servers that are all as close as possible to you, the customer, both to reduce the cost of transfering all those bits around and to make it faster for you to get to watching the latest episode of The Crown. Typically when you're accessing Netflix you're actually talking to a Netflix-owned server in your ISP's wiring closet, at least for the video content itself.

This site and the other sites I run don't get nearly that much traffic. I use a CDN because they take care of boring, error prone stuff for me. Right now most of my apps and sites live behind Amazon's CloudFront service because they automatically generate a free SSL certificate, for example. They renew it when it expires and generally let me ignore SSL completely.

The CDN I'm building for myself is also going to take care of SSL via LetsEncrypt, which is 100% free. It's also going to automatically manage the IPs of each site (which AWS does not do) so I don't have to care about DNS. It'll just automagically happen.

That's really the goal. Automatic, magical DNS/SSL/caching management.

What's next?

My next post in this series will talk about the specific technologies that I'm using to power my CDN. Stay tuned!

Want more stuff like this? Sign up for my mailing list. I post everything there a week before I post it here.

My Own Private CDN

2018-08-20T00:00:00+00:00

Hosting my own CDN has long been a completely irrational goal of mine. Wouldn't it be neat, I'd think, if I could tweak every knob instead of relying on CloudFront to do the right thing? Recently I read this article by Janos Pasztor about how he built a tiny CDN for his website. This just proves to me that at least it's not an uncommon irrational thought.

Yesterday I decided to actually start building something. Even if it doesn't make it into production, I'll at least have learned something.

Technical Goals

Centrally manage all of the dozen or so sites that I run
Automatically generate and renew LetsEncrypt certificates, both for publicly-facing sites and my own private sites. This means using the dns-01 challenge instead of using the easier to understand http challenge.
Easily add new cache nodes with authenticated curl | sudo bash
Automatically reconfigure nginx on the cache nodes when certificates roll or sites change
Easily host sites anywhere, including the internet-inaccessible server in my basement
Stop paying so much for bandwidth. Transfer is $5/tb/mo from DigitalOcean vs $$$$ for CloudFront.

Additionally, I really want to learn how LetsEncrypt works. certbot is great but it is very much a black box to me. Command-line arguments in, certificates out. If I write my own management system I can actually learn how the guts work.

Current Status

basic Rails app that knows about sites and proxies
creating or updating a site (re)generates a LetsEncrypt certificate for all of the domains that point at that site
wildcard domains are fully supported
authenticated endpoint that generates a zip file of all of the certificates and private keys

Next Steps

Automatic certificate refresh using something like Sidekiq Cron
Deploy onto the server in my basement on my ZeroTier network
Move all of my existing LetsEncrypt certbot crons into this system
Provision a POP by hand and then automate the steps to provision another one

If you'd like to follow along I put the project up on GitHub. I'll also be posting updates here as I go.

Want more stuff like this? Sign up for my mailing list. I post everything there a week before I post it here.

VMSave has made over 10,000 calls

2018-08-15T00:00:00+00:00

VMSave has made over 10,000 total calls as of yesterday. That's an astounding number to me. Originally VMSave was a tiny ruby script that I made to save my wife's mom's outgoing voicemail message. With her encouragement I made it into a full-fledged service. I never thought it would get this big, and yet here we are.

Here's some stats:

8,690 completed recordings
5,363 emails sent
41,432 minutes of call time
$3,616 in donations to keep the service running
Dozens of emails and hand written letters

I built VMSave to help people process their grief. Based on the feedback I’ve received and the numbers above, I think it’s worked.

An Open Licensing Organization for Open Source Software

2018-06-05T00:00:00+00:00

Mike Perham tweeted earlier today:

Could OSS use an ASCAP royalty model for funding developers? https://t.co/X2yKO8jYRQ
— Mike Perham (@mperham) June 5, 2018

ASCAP is a voluntary organization in the US (there are others in the US and multiple other organizations internationally) that sublicenses music to radio stations and other public performance places. Music creators register their works with ASCAP, music users pay an annual license fee to ASCAP and report back the music they play, and then ASCAP pays out the majority of that license fee to the creators who's music gets played.

The more I think about it, the more I'm convinced that this is the route we should take as an industry.

Here's how I think it would work. I'm going to refer to this theoretical copyright collective as OSCC (for Open Source Copyright Collective) throughout this document. It's a terrible name, I know, but we'll think of something better.

Membership

Membership would be open to anyone who has contributed to a piece of open source software that uses the OSCC license. Project maintainers would set up a CONTRIBUTORS file listing each contributor and how big of a royalty share they should get. Candidate membership automatically happens by being listed in a CONTRIBUTORS file, but you'd have to complete membership by setting up an account at the OSCC website.

Package maintainers would need to register their packages with OSCC as well, but that would be a once-per-package thing.

License and Fees

The OSCC license would be a variant of Apache 2.0, BSD, or MIT, with the additional clause that the rights granted by the license are also predicated on paying for annual OSCC license fees. Fees would be calculated based on some sort of sliding scale based on company size, company type, etc. Fees would be "all you can eat". I.e., you pay one set fee and you can use as much OSCC-licensed software as you want. We would need to carefully set things up such that companies don't just license through a penniless subsidiary, but ultimately this is based on good faith backed by a good legal team, just like ASCAP.

Monitoring

Monitoring would be pretty simple. Every licensor would upload their Gemfile/package.json/Pipfile/whatever to OSCC at least once a month. OSCC would comb through the uploaded files looking for registered packages and assign them usage credits based on how big their fee is (the licensor-size).

Royalties

Packages would earn usage credits. Each month a package would earn one credit per licensor with a size multipler. So, a tiny company like egghead would have a size multipler of 1, whereas a big company like Google might have a multipler of 10,000.

We would the calculate the value of a credit by adding up all the credits and all the net monthly OSCC revenue and dividing by the number of credits.

Example: OSCC earns $10,000 for the month after expenses. There are 1,000 total usage credits among all the packages, so one credit is worth $10. Sidekiq earned 100 credits for the month. Its total royalty is therefore $1,000 for the month.

Payout

Package maintainers could split their royalties however they want via the CONTRIBUTORS file. OSCC would deposit each contributor's share into an internal account at OSCC, and then once the contributor's total account value is greater than $100 they would get a check/direct deposit/wire/PayPal.

This is just a really rough proposal. If anyone wants to actually work on this for real send me an email or tweet at me. I'd love to try to get something like this going. It'd be a massive undertaking, but I think ultimately it would be hugely beneficial for the open source community.

Notes on Shutting Down an S-Corp

2018-06-04T00:00:00+00:00

A little over a year ago I described how I have my businesses set up in My Miniature Corporate Empire. Between then and now quite a few things have changed, the biggest being that I'm now employed by egghead, which had previously been my primary consulting client.

Over the past few months it's become very clear to me that this W-2 gig is better in every practical way than how I had been operating. There's no management overhead and far less worry about whether the next paycheck is going to happen. The management overhead is still there, however, because the businesses still exist. They're still sucking down energy and occupying mindshare that could better be used for, oh I dunno, chilling with my family.

Therefore, I'm shutting them down. My remaining consulting client will switch me to 1099 status and the publishing business just keeps going with less overhead.

This post is a checklist of things that I have to do to shut down my two Michigan LLC S-corps. It may not entirely apply to yours, or there may be other steps that you need to take. Consulting with your accountant if this is something you're thinking of doing.

Contract Assignment

First, I have to assign my consulting contract from Okapi LLC to myself personally. This is pretty simple, since there's no legal requirement for a form or anything, and my contract specifically allows assignment with notice and consent. I emailed my client, said "hey this is happening on this date" and asked for acknowledgement. Done.

I also need to figure out what happens with the copyrights for my books, websites, and code, but ultimately it should all come back to me. It's just a question of whether I need to do anything formal.

Redirecting Income and Expenses

Next, I need to redirect income and expenses. I opened a new bank account for my sole proprietorship and told PayPal, Stripe, and Dwolla to push payments there. The account comes with a debit card which I'll be using for all of my business expenses, which are pretty limited. Twilio, GitHub, AWS, and Heroku are the biggest offenders there.

Forms

Then there comes the glorious paperwork. Here are all the forms that have to get filled out and mailed in:

Articles of Dissolution for both LLCs. Fileable online with the state.
Forms 940 and 941. These tell the IRS to close my payroll accounts.
Forms W2 and W3. These tell the Social Security Administration how much I paid myself this year (not much).
Form 966. This tells the IRS about the dissolution of the S-corp.

In addition, I'll need to do a final tax return for the parent company. This should hopefully be straight forward since there wasn't much income and nothing tricky like health insurance or retirement contributions.

Closing Accounts

Once all of the above is settled I'll close each company's bank and credit card accounts. Not hard, but it'll involve a few phone calls and a visit to the bank. I expect this won't be happening until late summer because I don't want to do it until everything is settled.

Operating as a Sole Proprietorship Again

Our 2018 taxes will still be a tiny bit complicated because we'll have a simple K-1 from the parent business, but after 2019 forward it'll be smooth sailing. If for some reason I decide that consulting is something I want to do again full time, spinning up a new S corp is simple now that I've done it a few times. I might even use Stripe Atlas next time, since it comes with a bunch of perks and they do almost everything for you.

There's a certain amount of mourning that goes along with this whole thing. I'm slowly acknowledging that thigns didn't turn out how I had planned, but also recognizing that that's ok. Change is good.

Why your SaaS application should support SAML

2018-04-29T00:00:00+00:00

Your SaaS application should support SAML (Security Assertion Markup Language) if you're at all interested in big fat contracts from large enterprise customers. And why is that?

One word: money. Large enterprise customers pay quite a lot of money for services that help them do their work with a minimum of fuss. They want to do as little management of your service as they can possibly get away with, preferrably zero. If you can't make that happen, but your competitor can, guess who's not getting that big fat contract.

SAML is the technology that makes that happen. SAML came out in 2003, long before OpenID and OAuth and JWT and all those other, more modern, hipper authentication protocols. SAML is a stogy old goat based on XML and x509 certificates, which you may be familiar with because that's what SSL uses as well. It's supported by everyone that matters in the enterprise space.

When you set up SAML for your customer you're offloading all of the user management to their centralized system. They crypotgraphically vouch for users that they send your way which means all you have to do is find or create a user account for them and sign them in. No passwords, no email verification, no nothing. It's great for your customer because they get to manage everything on their end. It's great for you because you don't have to deal with any support requests related to passwords or usernames.

In summary: SAML == more money in your pocket.

Adventures in Stock Picking

2017-06-07T00:00:00+00:00

About a year and a half ago I started buying shares in individual public companies rather than buying shares of index funds. I did this because I thought it was fascinating and because I thought I could do a good job, at least matching the performance of the market as a whole. This week I decided to end the experiment and go back to index funds.

This is part a chronicle of my brief journey and part my reasoning for both starting and ending the experiment.

Once upon a time

Way back in middle school my Scoutmaster was a stock broker. He ran his brokerage out of a small office in town, just a few blocks from my house. One day on the way back from a camping trip I paged through a mutual fund brochure I found on the floor of the van and was utterly transfixed. I hadn't ever heard of a mutual fund before, didn't know what they were or how they worked, but I knew that I was very interested in finding out. Later he explained it all to me and gave me a few issues of the Wall Street Journal to page through, where I found pages and pages of pithy lines filled with jargon and numbers.

Fast forward to high school. I took some of the money I earned working a crappy job at a consignment shop and put it into the market. Finally! I was participating! I started buying $20 every month of Johnson Controls, a company in Wisconsin that builds things like the climate control system in your car and other stuff integrated into bigger products. I had set my account up with JCI's dividend reinvestment plan, meaning they would automatically buy me more shares as they paid dividends.

I sorta forgot about the whole thing when I graduated high school, but my sophomore year of college I sold it all at a substantial loss because I needed to pay for ... something. I don't even remember what it was, maybe tuition, maybe a credit card bill. In any case I lost of bunch of money and decided that investing in the stock market was not for me.

A few years later I entered the professional world and started putting money into a Roth IRA and my work-provided 401(k) plans. I set it on autopilot,just picking either a total market index fund or an S&P 500 index fund depending on what was available.

One day

In October 2015 I was perusing various forums related to the financial independence and early retirement (FIRE) movement and came across a link to a guy named Joshua Kennon. I don't remember the post I first landed on but I do know that I proceeded to read his entire blog from start to finish over the course of three weeks. At the time he had a multitude of posts about specific investment strategies, deep dives into fascinating companies, and broad overviews of entire industries.

He's since purged a lot of those things for a variety of reasons, but one of my favorite posts of his that's still around is this dive into an investment in the Hershey Company. I discovered, through Kennon's writing, that a lot of companies have fascinating histories and origin stories. I also rediscovered the interest in learning about companies and the stock market that had been buried by eight years of professional software development.

Because of that

I decided that I wanted to try investing in individual companies, deeply investigating each one and fully understanding how they make and spend their money. I figured I could do better than the S&P 500 because I just wouldn't buy the giant tech companies, nor the losers that bring down returns and yet hang in the index because of their size.

I started futzing around with spreadsheets and Google Finance, trying to figure out what I wanted my ideal portfolio to look like. I definitely didn't want any tech companies. My reasoning was pretty simple: my entire income stream is predicated on those big tech companies in a variety of ways, and if they were to go bankrupt I didn't want my retirement savings to be blown away at the same time.

Starting in November of 2015 I started buying individual stock, focusing on consumer staples like Colgate, Proctor and Gamble, Coca Cola, and of course Hershey. Eventually I ended up with a portfolio of over 20 companies spread across a half dozen industries.

Because of that

Then this happened in July 2016:

(Don't worry! She had a bit of jaundice when she was born but the disco lamp treatment worked wonders and now she's happily crawling around and getting into stuff).

After my daughter came into my life things in the stock picking department got difficult. New babies cause sleep deprivation. It's just a thing. I made some bad sleep-deprived calls and invested in some things that were actually terrible. My one saving grace was that I was smart enough to not risk much in these little jaunts but it still stung.

After getting burned I froze up and didn't know what to do. I stopped investing new money at all. Cash just built up in my brokerage account not earning anything.

Until finally

I firmly believe that I could do well for myself, my daughter, and other eventual heirs by investing in individual stocks. You just have to stick with what you know and have a head for simple math.

However.

Doing this takes a lot of time and energy. Reading quarterly and annual reports for 23 companies is just too hard to keep up with, when each one is 30-40 pages of columns of numbers and dense text. I would rather spend that time playing with my daughter on the living room floor.

Last week I decided to radically simplify. I unloaded almost my entire portfolio of individual positions and put the money into two exchange traded funds: one is a total market fund that invests in every single public company in the United States, and the other is a fund that contains about 300 companies in the consumer staples sector. Both of these are "market cap" weighted, which means you own more of the bigger companies and less of the smaller ones. This weighting makes little sense when you look at it as an individual investor but it makes the funds incredibly cheap to run, which means ultimately I keep more of the money I invest instead of paying brokerage fees.

I'm definitely not against investing in individual stocks. I kept my Hershey shares because I love how the company is structured. Almost all of the voting control of the company is vested in a trust for the benefit of a boarding school, originally for orphans but now for lower income families in general. This structure means the company makes decisions that benefit the school in the long run instead of for short run gains for outside shareholders. You can read more about the company in the link above and this article also by Joshua Kennon.

I did pretty well. When I sold off my individual holdings I realized about 13% profit, including several positions that did over 40%. For a period of time I was beating the S&P 500 by several percentage points but I ended up only about a half point better. Definitely not worth the effort and anxiety, at least to me. My time is better spent with my daughter or working on or in my business. Selling your own products will almost always have a better return than the market anyway.

If you're looking for something to help you track your investments, check out Personal Capital. It lets you see all of your accounts in one place and plan for your retirement.

My Miniature Corporate Empire

2017-05-15T00:00:00+00:00

This post is mostly for posterity so I don't forget why and when I did this stuff. If you find small-scale business stuff interesting you'll like it.

When I initially published my first book Mastering Modern Payments in 2013 I just did it myself, with no business backing it. I had a separate personal checking account where the Stripe deposits ended up, I deducted business expenses, and everything was peachy. Then I got a not-insignificant consulting gig and I decided it was time to get a bit more professional.

The Birth of a Giraffid

In March of 2014 I set up Okapi LLC (because okapi have stripes, get it? ha ha). Okapi became my central clearinghouse of everything business-y. Consulting payments and Stripe deposits came into it's business checking account and cash came out. Later my accountant advised me to elect S-corp taxation, which ended up saving us a boatload of taxes even though it cost a bit more to run. Consulting became my full time thing in September 2014 with steady payments from book sales supplementing our household income.

When my wife and I found out we were having a baby in late 2015 I got really into planning our family's future, including things like estate planning. Super-nerd levels of obsession.

One of the things that I was very concerned about was what would happen to the IP that I create when I die. Our estate planning attorney suggested that I sign it all over to Okapi but that made me uncomfortable. What would happen if a client sued Okapi and won a judgement over my insurance limits? They'd be able to get access to all of the IP that I worked so hard to create!

Okapi Spawns a Sibling

After talking things over with a handful of lawyers and my accountant I decided that the best solution to this (extremely paranoid) worry was to split publishing and IP holding off into a separate business. We named it Cora Street Press LLC, after the first street that my wife and I lived on together.

Cora Street Press is a bit different. Instead of just me owning Okapi, my wife and I own Cora as "tenants by the entireties", which is similar to owning a checking account jointly with someone else, with some additional state-specific benefits. It's also a manager-managed LLC, where I'm the manager and my wife is the backup.

Cora functioned well for half of 2016 and into 2017 when book sales started to finally slow down and overhead started becoming a big deal. It doesn't cost a lot of money to run an S-corp but it's definitely not free. Those costs quickly became a big portion of Cora's revenue.

Cora Adopts a Child

Cora and Okapi are both S-corps so my initial thought was to just drop Cora's S-corp status to eliminate some overhead, but there are hard deadlines for that set by the IRS. You can't just elect and drop S-corp taxation willy nilly because that would a nifty way to cheat on taxes. It also wouldn't drop much overhead because we'd still have to have Cora's taxes prepared as a partnership.

Here's where it gets weird.

It turns out that the easiest way to cut down on overhead while preserving the huge benefits of keeping consulting and publishing separate would be to make Okapi a subsidiary of Cora. That way I only have to run one payroll each month instead of two, and the tax preparation costs are much less because we're only filing one S-corp return instead of two.

The process was actually really easy. I filled out a little form that assigned my personal Okapi LLC interest to Cora and then filed another form with the IRS that told them about the subsidiary relationship. Technically Okapi is now a Qualified S-Corp Subsidiary.

Day to Day

The day to day operations of these two busineses didn't change a whole lot. We do the bookkeeping separately but now most of the money flows from Okapi's bank account up into Cora's. We run payroll, pay for health insurance, and make 401K deposits all out of Cora's account now.

The biggest change is that I worry less. Future publishing will just happen through the top-level business, consulting and other "hot" liability things happen through Okapi, and future weird things can happen as additional disregarded LLC subsidiaries. Cora is now the lynchpin holding company of my little business empire.

We (Probably) Have Two Years of ACA Left

2016-11-10T00:00:00+00:00

Like many of you, my family relies on the health care insurance marketplace for our health insurance needs. The ACA has been a boon to me and my wife because we happen to have pre-existing conditions. Without the Affordable Care Act we would be uninsurable outside of a group plan.

Due to the recent election, the ACA is probably going to have some major changes in the next few years. Here's two reasons why we probably don't need to panic just yet.

1. Regulatory Timing

Insurance companies submit their finalized plans for the year to most states by April or May of the prior year. That means 2018 plans will be in process by May 2017. This leaves only a precious few months for the new Congress to get a fresh bill started and completed in both the House and the Senate. They're going to spend at least a month finalizing their rules and dickering about the filibuster which leaves even less time.

2. The Filibuster

In order to get rid of the preexisting condition clause the Senate has to either beat a filibuster or eliminate it entirely. The alternative is to pass a bill through the budget reconciliation process, which can only deal with monetary issues. If they pass the budget reconciliation version without eliminating the pre-existing condition clause the health care system in the US will experience a death spiral and the majority will get blamed. If, on the other hand, the new Senate eliminates the filibuster they're screwing their future selves. The Republicans will again be the minority party at some point in the future and the filibuster is one of the most powerful tools for the minority.

Of course, I very well could be wrong. Nobody really knows yet what the new president and the new congress have planned. It's best to have a backup plan, which for consultants and freelancers probably means getting friendly with some companies where you'd be comfortable working full time.

Archiving Websites with Wget

2016-05-22T00:00:00+00:00

Let's say you want to archive a website. Maybe they're closing down or changing focus, or maybe you just want to view it offline. You want to capture the whole site at a single point in time. How would you do that?

You could just use your browser to save the page, but you probably won't get the HTML or images. You could print the page to a PDF, but then it's in a weird format and might be stuck in the print stylesheet forever.

You could use a service like Pinboard but they only archive one page, whereas you want to capture the whole site.

So, what do you do?

If you've used the internet for awhile you probably know of a little website called the Wayback Machine. The Wayback Machine, as it's name implies, lets you travel back in time and view websites as they existed at various points in the past.

This fantastic machine is run by an organization called the Internet Archive, a non-profit that has the noble mission of preserving the entire Internet, along with things like movies, old video games, music, etc.

When IA first started doing their thing, they came across a problem: how do you actually save all of the information related to a website as it existed at a point in time? IA wanted to capture it all, including headers, images, stylesheets, etc.

After a lot of revision the smart folks there built a specification for a file format named WARC, for Web ARCive. The details aren't super important, but the gist is that it will preserve everything, including headers, in a verifiable, indexed, checksumed format.

Capturing Archives

What does this have to do with our problem? It turns out that you can produce your own WARC files using a tool you already have on your Mac OS X and/or Linux machine! Just open up a terminal and type something like this:

$ wget \
    --mirror \
    --warc-file=YOUR_FILENAME \
    --warc-cdx \
    --page-requisites \
    --html-extension \
    --convert-links \
    --execute robots=off \
    --directory-prefix=. \
    --span-hosts \
    --domains=example.com,www.example.com,cdn.example.com \
    --user-agent=Mozilla (mailto:archiver@petekeen.net)\
    --wait=10 \
    --random-wait \
    http://www.example.com

Let's go through those options:

wget is the tool were using
--mirror turns on a bunch of options appropriate for mirroring a whole website
--warc-file turns on WARC output to the specified file
--warc-cdx tells wget to dump out an index file for our new WARC file
--page-requisites will grab all of the linked resources necessary to render the page (images, css, javascript, etc)
--html-extension appends .html to the files when appropriate
--convert-links will turn links into local links as appropriate
--execute robots=off turns off wget's automatic robots.txt checking
--span-hosts allows it to follow links to other domain names
--domains includes a comma-separated list of domains that wget should include in the archive
--user-agent overrides wget's default user agent
--wait tells wget to wait ten seconds between each request
--random-wait will randomize that wait to between 5 and 15 seconds
http://www.example.com is the website we want to archive

Two of the options need some explanation. First, disabling robots checking is not normally something you should do because it's not very polite, but I'm assuming you're going to be grabbing these archives for personal use only so turning this off is acceptable.

Second, we override wget's default user agent to include Mozilla so servers don't reject us outright. More importantly we add an email address so site owners can contact us if we're causing problems.

Third, adding a wait time between requests reduces load on the server you're accessing. You should be polite to the people that own the website you're archiving.

Browsing Archives

If you just grab HTML pages and stick them in a folder somewhere, you can just double click on them and view them in your browser. Not so much with a WARC file.

I use a simple tool named Web Archive Player to view the archives I've created. Just download the tool and run the application. It will prompt you for a warc file to open, and when you pick one it will open up your browser automatically so you can navigate your archive.

A few notes before you start archiving everything:

This is for personal use only! Don't start infringing copyright by publishing archives against publishers' wishes.
Always be polite! Use the wait option. Use a user agent that identifies who you are.
This works for sites that are mostly static HTML. It's not going to work for YouTube, Twitter, Facebook, etc that have videos or use Javascript to load things.

Does an LLC protect me from a personal injury lawsuit?

2015-11-23T00:00:00+00:00

Let's say you run a small photography business, taking pictures of babies for excited, proud parents. You've heard that you should incorporate or form an LLC to limit your liability.

But what does that actually mean?

Not what you think it does.

What an LLC does

It turns out an LLC protects you, the owner, from your business's financial liabilities. Going back to our photography business example, let's say you want to buy some better lenses for your camera that will let you take even more adorable photos. You convince a bank to give your business a loan without a personal guarantee from you (see note below), and you buy your equipment.

Two years later, your business runs out of cash and has to shut down. You can't pay the bank anymore, so they foreclose on the loan and force you to sell your camera equipment to settle the debt. Because your limited liability company is the only party to the loan, the bank can't make you sell your house or your car or dip into your retirement savings.

Another example, let's say a family sues you because they feel you overcharged and delivered a shoddy product. If they were to win, your business would need to pay the judgment, but you would not personally have to sell your car. You could let the business burn down instead.

Note: no bank in the world will lend your small business a loan without a personal guarantee from you. This means that you and your business are co-signing the loan, which puts you on the hook.

What an LLC does not do

Let's imagine the worst case scenario. You're shooting pictures of a new baby and their family in your house, when suddenly a freak accident happens, a light falls over on top of the baby, killing them instantly.

Does your LLC protect you, your family, and your business from the inevitable fallout?

NO, it does not.

The family and their insurance company can and will come after you, personally, as well as your business, for every single cent you have. If you don't have the right amount of insurance you will be devastated not only emotionally but financially.

So what's the right arrangement?

In every business you need to assess your risk, both personally and professionally, and find solutions that protect you. For the above situation, you should likely have a professional liability policy and a personal umbrella policy on top of your homeowner's policy. You should talk to an insurance broker so they can guide you to the coverage you need.

Want to know more?

My book Handle Your Business will teach you everything you need to know about running your small professional business. From insurance to incorporation to taxes to contracts, this short guide helps you run your business better.

One Thousand Days

2015-09-22T00:00:00+00:00

Today in my calendar there's a little entry that just says "1000 days."

One thousand days ago I received my first infusion of life saving deadly poison.

Between that day and today, I received twenty infusions of poison and then the best news: I was cancer free.

I also lost a parent. Wrote a book. Got married to a beautiful, amazing, tolerant, hilarious woman.

I lost another parent. Moved across the country. Bought a house.

I got married again, to that same beautiful, amazing, tolerant, hilarious woman. We honeymooned in Niagara Falls.

I quit my job. Made myself another one. Lost a pet. Wrote another book.

What's the common thread?

I'm still here.

And that's something to celebrate.

New Book: Handle Your Business

2015-09-17T00:00:00+00:00

Let's say you do some consulting work once in awhile. You do some work, people pay you, everyone's happy.

Now, let's say you want to make a living with consulting, like I did a year ago. What do you need to know? How do you set up an effective, legitimate business around your work?

I did over two years of research into how to set up my business properly, both to protect me and to save money on taxes every year. There's an awful lot you need to know, and a whole lot of noise you don't.

My new book Handle Your Business gives you all of the good stuff and none of the noise in a succinct, easy to read guide.

Learn everything you need:

What business entity to pick
How to structure your contracts
Where to get a business bank account
The insurance coverage you need, and what you don't
...and more!

Handle Your Business is available today.

Organizing Your Consulting Business

2015-08-31T00:00:00+00:00

This is an excerpt from my upcoming book Handle Your Business - the Succinct Guide to Money and Finance for the Self-Employed. Enter your email at the bottom for updates, early access, and a coupon when the book launches.

A business is a legal fiction. It only exists in so far as we and the courts believe it to. It's an entity made of pure thought, even more so than a computer program. When you write a computer program you're causing the computer to take physical actions. When you form a business, you're literally willing a new thing into existence.

This may seem like a trivial point, but you only get the benefits of a business as long as other people believe it exists. What are those benefits, you ask? There are two big things you get by forming a separate business entity:

limited personal liability for the financial liabilities of the company
tax incentives

You could just start doing business as yourself. You may have already. You could, in fact, just ignore this whole chapter, and there's nothing really wrong with that. But, you don't get those two things above.

Limited Liability

For a freelancer or consultant, there's honestly not a lot of benefit to the limited liability portion. Let's walk through a situation that might happen to you as a software developer. You write a system for a client that processes credit cards, using the latest in PCI-compliant systems like Stripe or Braintree so you're not storing credit card data on the client's servers. You do the best you can do to make it as secure as possible, but let's say there's a bug in the underlying framework. An attacker gets into the client's machines and modifies your code such that it starts skimming credit card numbers.

The credit card company identifies your client's system as the source of the leak and shuts down their merchant account. Your client sues you AND your business for negligence.

The liability shield built into the business might protect you, assuming you included the right language in your contract, but you're still on the hook for paying a defense lawyer. In the chapters on Insurance and Contracts we'll talk about ways to protect yourself, but just know that the business, by default, isn't really there to protect against this kind of liability.

"Limited liability" is much more narrow than that. It actually means you're not liable for debts the business owes by itself. For example, if your business took out a loan to purchase something, and then fell behind on payments and ended up in bankruptcy, your liability for that loan ends at the amount of money you have invested in the business. Unless the bank demanded a personal guarantee from you, of course. In that case you're still on the hook.

Tax Incentives

The biggest boon when you have a separate business entity is the tax deductions available when you're operating for profit. Businesses are taxed on their profits, not their total revenue (except in some states that have a gross revenue tax). Here's some examples of things you can write off as a business owner that you can't when you're an employee:

full cost of health care insurance premiums
full cost of business insurance premiums
mileage on your car
new computer equipment
phone and internet service, including web hosting
meals with clients
home office

Any expense the business incurs in the normal course of operations counts as something you can deduct in some way. There are rules surrounding some deductions because they're been abused in the past, but for purposes of this discussion just know that almost every expense is tax free.

Types of Business Entities

Broadly speaking, there are three different types of entities you can start.

Sole Proprietor and Partnership

First you have the defaults. If you just start charging people money for goods or services as yourself, you're by default a Sole Proprietor. The buck starts and stops with you. You reap all the profits from your business, and you're liable for everything your business does, because you are your business. A partnership is the same thing except you have two or more people involved instead of just yourself.

Corporations

Sole proprietors and partnerships are the original ways to do business, and they're the simplest to form (do nothing). That said, they have a lot of drawbacks, especially around liabilities. Thus, the corporation. Story time: a long time ago people would get together and fund trade expeditions. They would pool their money, take out loans, hire a ship and a crew, and send them out to find riches and trade routes. Exploration is a dangerous game. Sometimes (i.e. all the time) a ship would sink, the crew would disappear, and the banks that gave the group loans would demand their money back. They would find the richest investor and demand compensation, and the courts would give it to them, sometimes to the point of sending investors to debtors' prison.

Investors were naturally hesitant to invest in new expeditions, because the risk of catastrophic loss and subsequent personal loss was so high. So, they got together with their friends on the government and wrote down a way to limit their liability to just the money they put into the business.

A corporation is a separate legal entity from the owners. It doesn't die until it's killed. It can have bank accounts, buy things, sell things, and generally go about conducting business as if it were a sole proprietor, all while protecting the owners from their debtors. To this day, corporations are the most common form of formal business entity.

Modern corporations are great if you want your business to have lots of little shareholders or you want to retain significant money in the business. They also come with all kinds of required formalities, like annual meetings, stock certificates, and other paperwork. Corporations have to file their own tax return and have their own tax brackets. This means you would probably end up paying taxes twice on some portion of your company's revenue, which is not ideal. There are ways to minimize it, but consultants aren't looking for this kind of thing, so a corporation is probably not the best idea for them.

LLCs

An LLC (Limited Liability Company) is a hybrid between a partnership and a corporation. The owners enjoy limited liability without all of the paperwork that a corporation requires. In trade, they give up the ability to sell shares to the public, among other things.

You can conjure up an LLC in 10 minutes by filling out a form and sending it into your state's Secretary of State along with the registration fee. Bam. New company, born in less time than it takes to buy a cup of coffee.

The rules for the internal workings of an LLC vary by state, but the common ones are:

File with the state periodically (some states are annual, some are every other year)
Usually pay some sort of franchise fee or tax
Don't commingled personal and business assets (i.e. have a separate bank account)
Don't commit fraud

Every state's LLC law sets out default rules and then allows you to write an Operating Agreement to override them. For all practical purposes, single-member LLCs like your baby consulting company can generally get by with the default rules or a simple agreement like the following:

http://www.northwestregisteredagent.com/single-member-llc-operating-agreement.html

Yes, you'll be signing a contract by yourself. The point is that you have written processes in place for your business, which helps to enforce the notion in your mind and other peoples minds that the business is in fact a separate entity. Remember, a business only exists if people believe it does.

Taxes and S-Corp Elections

By default, an LLC is a "pass through" entity. The IRS doesn't acknowledge its existence, calling it a "disregarded entity", which means all of the revenues and expenses from the business flow onto your personal tax return and are taxed at the personal rates. Most states don't tax LLCs individually either, except for yearly registration fees, but some states like California have a tax on your gross receipts with a minimum of $800.

When you work for a business as a normal employee, the business reports what they paid you and how much they withheld for taxes to you and the IRS on form W-2. There are three Federal-level taxes on a W-2: Federal income tax, Medicare, and Social Security. As an employee, you only see half of the Medicare and Social Security taxes withheld from your paycheck. Your employer pays the other half and gets to deduct it on their income taxes. Each half of Social Security is 6.2% up to $118,500 in wages, and each half of Medicare is 1.45% on all wages.

As a business owner you are your own employer. This means you get the privilege of paying both halves of Medicare and Social Security, which comes to a total of 15.3% on the first $118,500 in income and 2.9% of every dollar after. You do get to deduct half of that when figuring how much is subject to income tax, but it's still a hefty bite.

Long ago the IRS decided to allow a special type of corporation, called a Sub-chapter S corporation, to reduce how much they pay for Social Security and Medicare. When a corporation elects Sub-chapter S status they agree to certain rules, including pass-through taxation like a partnership or sole proprietor, limited number of shareholders, and rules about the types of stock they can issue and who can own it. In return, they get to decide how much each owner gets paid as wage vs dividend and thus how much self-employment tax they pay.

The IRS allows LLCs to make this same election. Here's an illustration, assuming $100,000 of taxable income and a reasonable wage of $60,000.

	Disregarded Entity	S-Corp	Diff
Taxable Income	$100,000	$100,000	$0
Wage	$0	$60,000	$60,000
Social Security	$12,400	$7,400	$5,000
Medicare	$2,900	$1,740	$1,160
Total SE Tax	$15,300	$9,140	$6,160

By electing S-corp taxation you save yourself $6,160 in taxes. Here's another example, this time assuming $200,000 in taxable wages and $140,000 reasonable salary:

	Disregarded Entity	S-Corp	Diff
Taxable Income	$200,000	$200,000	$0
Wage	$0	$140,000	$140,000
Social Security	$14,694	$14,694	$0
Medicare	$5,800	$4,060	$1,740
Total SE Tax	$20,494	$18,754	$1,740

In this case you only save $1,740 because of the wage cap on Social Security.

As you can see, the benefits of S-corp taxation are massive when you have below $200,000 in taxable income per member for the year. They start to phase out at the Social Security wage cap, but there are some big deductions you can take to keep your taxable income near that level.

Save for Taxes!

As you can see, as a successful business owner you are going to be paying taxes. Don't be surprised next April when you see your bill by making estimated payments each quarter. The IRS's due dates for quarterly payments are:

April 15th
June 15th
September 15th
January 15th

No, these are not calendar quarters, but I have yet to see a good explanation as to why Stick them in your calendar with reminders so you don't forget.

How do you figure out how much to pay? For first year, don't stress about it too much. The IRS doesn't really care that you make uneven payments, just that you pay at least 100% of what you paid last year or 90% of what you need to pay this year by April 15th. Also remember that if you or your spouse had a job at any point in the year you paid at least something in already, so you can subtract that out when figuring out an estimate.

For subsequent years you and your accountant can figure out how much your quarterly payments should be.

Summary

In sum, here's how you should organize your business:

Form an LLC in your state
Elect S-corp taxation by filing form 2553 with the IRS
Save for taxes and pay quarterly

In later chapters we're going to talk about the other things you should do to maintain the separation between you and your business, including contracts, banking, and insurance.

Program Your Finances: Algorithmic Savings

2015-06-17T00:00:00+00:00

When I started my first full time job in 2007 I started putting away a little bit of my paycheck every two weeks into savings. For the past two years I haven't been doing that manually. Instead, I've been using Ledger's fantastic automated transactions to put money away without having to think about it, both for long term goals and envelope budgeting.

Automated saving transactions have been great, except that they never really captured the whole picture, nor did they fit a few constraints I wanted:

When a fund is below a minimum threshold it should get priority
When a fund is above a maximum it should not receive any more savings
I don't want to save any more than I actually have available in a given month

For example, I keep an emergency fund that I keep at about $15k. If it falls below, say, $13k, I want to boost it up as fast as possible. But, if I only have $10 left at the end of the month I don't want to try to save more than that.

Algorithms

Ledger's automated transactions can't reach that kind of flexibility because they don't have access to arbitrary account balances (at least as far as I can tell). Also, because they're evaluated at parse time, the first 300 lines of my ledger file are automated transaction rules.

Instead of using automated transactions, I wrote a little program that generates a transaction to be pasted into my ledger. It takes the three constraints above and turns the cash left over at the end of the month into savings without me having to put numbers into a spreadsheet and manually construct the Ledger transaction.

The algorithm happens in two stages and acts on a set of rules, something like this:

RULES = [
  { 'Emergency'      => { min: 13000, max: 15000, weight: 10 } },
  { 'Medical'        => { min:  1500, max:  4000, weight:  8 } },
  { 'House'          => { min:  3000, max: 15000, weight:  8 } },
  { 'Furniture'      => { min:   200, max:  4000, weight:  4 } },
  { 'Travel'         => { min:  2000, max: 20000, weight:  4 } },
]

It also depends on having a few numbers available, namely the balance of each fund in the set of rules as well as how much excess cash there was at the end of the month.

The algorithm then takes two passes over the rules.

Sum up the weights in all of the rules. If the account balance is greater than or equal to the max, set the weight to zero. If it's below the min, multiply the weight by 4. Keep track of the total weight in the set and the calculated weight for each rule.
For each rule, calculate the percentage "share" by dividing the account weight by the total weight. Then calculate the amount of this share by multiplying it by the remaining income, up to the max for that fund. Subtract that amount from the remaining income, subtract that rule's weight from the total weight, and continue down the rules until you're out of money.

Each rule is evaluated in terms of two shrinking pies: the total weight and the remaining income. When no funds hit their max value this is strictly equivalent to a straight percentage savings, but elegantly deals with both the min and max situations.

Here's what that looks like in code:

account_weights = {}
total_weight = 0

RULES.each do |rule|
  account = rule.keys.first
  rules = rule.values.first
  weight = rules[:weight]

  if (fund_balances[account] || 0) < rules[:min]
    weight = weight * 4
  elsif fund_balances[account] >= rules[:max]
    weight = 0
  end

  total_weight += weight
  account_weights[account] = weight
end

xtns = {}
RULES.each do |rule|
  account = rule.keys.first
  rules = rule.values.first
  weight = account_weights[account]
  balance = fund_balances[account] || 0
  share = weight.to_f / total_weight.to_f

  deposit_amount = [
    remaining_income * share, 
    rules[:max] - balance
  ].min

  next if deposit_amount.round == 0

  total_weight -= weight
  remaining_income -= deposit_amount
  xtns[account] = deposit_amount
end

This algorithm has some great properties:

The priority of a fund is determined by it's placement in the rules. Earlier funds get funded before later funds.
The amount a fund gets is determined by it's weight. Higher weight gets a bigger share.
Funds below their minimum get plumped up with the weight multiplier, while full funds automatically drop out.

The only drawback is that I have to manually run this script every month, but I feel like that's a small price to pay for the flexibility this gives me. If you're interested in the gory details of the script I put the whole thing in a gist. I'd love to hear your thoughts, even if you just want to tell me I'm crazy.

Pay Your Taxes!

2015-05-08T00:00:00+00:00

"I'm in deep with the IRS."

"We ended up owing $15,000 this year."

"I don't have that kind of money just laying around. How do I file an extension?"

Sound familiar? Maybe you're a freelancer. A consultant. An independent business person. Somehow, some way, you've got money coming in that isn't from a normal everyday W2 job.

One thing's for sure: you have to pay taxes on that.

What? Taxes?

Yep. Taxes. That thing you don't want to think about because other things are way more important, like actually running your business and bringing money in and buying groceries and cutting your toe nails.

Still, you have to pay them or the IRS gets cranky. Crankier than your two year old after a six hour car ride. Crankier than your cat is at the vet. Suffice to say, probably something you'd rather avoid.

How much?

It's your honor and duty as a citizen of the United States to pay as little as you can possibly get away with, but no less. As a business owner (because that's what you are, you lucky dog you) there are all kinds of tricks and deductions and things you can do to reduce what you owe, but that's a lot to think about and really that's why accountants exist and can charge so much.

The simplest thing you can do to avoid the IRS's ire is to pay what you paid last year.

Yep. It's that simple.

Look at your last Form 1040, find the line where it says "Total Tax" (line 63 on Form 1040, line 12 on 1040EZ), and pay that. Same with your state taxes, if you have state income tax.

When?

Quarterly. Except not really.

Specifically, you'll divide up that amount from last year into four equal payments and send the IRS a check and a Form 1040ES (or pay online with EFTPS) on these dates:

April 15
June 15
September 15
January 15

If you notice, those are not equal time periods. The IRS likes to keep things interesting.

What if I'll make more this year than last year?

Awesome! High five! That's how you run a successful business.

Here's what I do:

Set aside 30-40% of every invoice payment into a separate money market account.
Every quarter, I pay the quarterly payment we figured out above from that money market account.
At the end of the year, I send the IRS a check for whatever we owe on top of the quarterlies.

If you know how much you're going to be making you can do some math to figure it out and send in the extra on the quarterlies, but usually it's not worth it.

The percentage you set aside is going to depend a lot on your situation and location. For your first year it's safer to set aside 40% and then dial in the next year.

What if I'll make less?

No problem. If you know you're going to be making less, you can just reduce your quarterlies. Alternatively, you can just send the IRS some money every quarter. As long as you pay at least 90% of what you'll owe by the end of the year the IRS is happy.

But what if...

There there. It's ok. Taxes are complicated.

You still have to pay them.

Program Your Finances: Envelope Budgeting

2015-04-08T00:00:00+00:00

Note: you can find much more information about ledger on ledger-cli.org, including links to official documentation and other implementations. Also, check out my intro to accounting with Ledger.

A few years ago I heard about YNAB, or You Need A Budget. YNAB is a set of rules and associated software that help people to dig themselves out of financial holes and prosper with a budget. The rules are:

Give Every Dollar a Job
Save for a Rainy Day
Roll With the Punches
Live on Last Month's Income

YNAB embraces both traditional budgeting, where you have a fixed amount of money every month for a category, as well as "envelope budgeting", where you put a fixed amount every month into a category, but if you don't spend all of that it rolls to the next month.

In this blog post I'm going to talk about how to smoothly implement envelope budgeting in Ledger land.

Envelope Budgeting: A Primer

Envelope budgeting is a pretty simple concept. When you receive a paycheck, you separate out a certain amount of money for each category and put it in an envelope. When the money in the envelope is gone, you can't spend any more for that category. Some financial systems actually have you draw out your entire paycheck in cash and put it into physical envelopes, but we're not going to go that far.

Chart of Accounts

If you've ever taken an accounting class you're probably familiar with the concept of a "chart of accounts". In an accounting system, your accounts make a tree, starting from five root accounts:

Assets
Liabilities
Income
Expenses
Equity

For example, if you have a checking account, that's an asset, like this: Assets:Checking. A credit card would be a liability: Liabilities:Credit Card. Your paycheck would be income: Income:Salary, and getting groceries would be an expense: Expenses:Food:Groceries. Equity is out of scope for this discussion, but in a personal finance system it's typically used when you're declaring opening balances in accounts.

Parallel Accounts

The best way to implement envelope budgeting in Ledger is using a parallel chart of accounts. That is to say, a set of accounts that's outside of your normal real-money assets, income, expenses, or liabilities. I've chosen to use Assets:Funds and Liabilities:Funds ("fund" as in "slush fund") in the examples that follow, but you can use whatever you want as long as it doesn't mix with your real money accounts.

Let's say our water bill comes every other month and averages $100. In a traditional monthly budgeting system this would be hard to account for, since some months will be zero and some will have a charge. With our parallel accounts, though, this is easy:

2015/04/02 * Salary
    Assets:Checking              $1,000.00
    Income:Salary

2015/04/02 * Water Bill Accrual
    Assets:Funds:Water              $50.00
    Liabilities:Funds:Water

2015/05/02 * Salary
    Assets:Checking              $1,000.00
    Income:Salary

2015/05/02 * Water Bill Accrual
    Assets:Funds:Water              $50.00
    Liabilities:Funds:Water

At the beginning of April and May, we receive our salary deposit and set aside $50 each time for your water bill. Notice how, in the accrual account, we're depositing into our Assets:Funds:Water account and balancing it out from a companion liability. This reflects the fact that in double entry accounting every transaction has to balance, and dedicated balancing liabilities make things easier later on. Here are our balances:

           $2,100.00  Assets
           $2,000.00    Checking
             $100.00    Funds:Water
          $-2,000.00  Income:Salary
            $-100.00  Liabilities:Funds:Water
--------------------
                   0

Now let's look at what happens when our water bill comes due:

2015/05/03 * Water Bill
    Expenses:Water                  $95.00
    Assets:Checking                $-95.00
    Liabilities:Funds:Water         $95.00
    Assets:Funds:Water             $-95.00

Notice how we pull $95 out of our checking account and also pull $95 out of our Liabilities:Funds:Water account.

Here's what the balances look like now:

           $1,910.00  Assets
           $1,905.00    Checking
               $5.00    Funds:Water
              $95.00  Expenses:Water
          $-2,000.00  Income:Salary
              $-5.00  Liabilities:Funds:Water
--------------------
                   0

$95 went from the checking account into the water expense and the water fund still has $5 in it.

Automated Envelopes

This system would be a pain in the butt if we had to manually track it for every transaction. Thankfully, Ledger has us covered with automated transactions.

An automated transaction looks a lot like a normal transaction, except it starts with an = and has an expression instead of a payee and date. Let's see what our water accrual rule looks like:

= /Income:Salary/
    * Assets:Funds:Water         $50.00
    * Liabilities:Funds:Water   $-50.00

In this example the expression is a regular expression surrounded by /s. /Income:Salary/ will match any posting with that as the account name.

After the expression we have two lines. They start with a * to indicate that they're cleared transactions. Next is the account name and an amount, just like in a normal ledger transaction.

Now, let's set up a matching rule for spending out of the envelope:

= /Expenses:Water/
    * Liabilities:Funds:Water      1.0
    * Assets:Funds:Water          -1.0

This one is very similar to the first, except for those amounts. Notice how they don't have a commodity attached to them? In automated transactions, ledger will treat an amount without a commodity as a percentage, where 1.0 = 100%. This rule means that we want to match every water expense and pull 100% of it out of our water envelope.

Putting it all together, here's what the automatic version looks like:

= /Income:Salary/
    * Assets:Funds:Water            $50.00
    * Liabilities:Funds:Water      $-50.00

= /Expenses:Water/
    * Liabilities:Funds:Water          1.0
    * Assets:Funds:Water              -1.0

2015/04/02 * Salary
    Assets:Checking              $1,000.00
    Income:Salary

2015/05/02 * Salary
    Assets:Checking              $1,000.00
    Income:Salary

2015/05/03 * Water Bill
    Expenses:Water                  $95.00
    Assets:Checking                $-95.00

Here's the resulting register report:

15-Apr-02 Salary     Assets:Checking          $1,000.00 $1,000.00
                     Income:Salary           $-1,000.00         0
                     Assets:Funds:Water          $50.00    $50.00
                     Liabilities:Funds:Water    $-50.00         0
15-May-02 Salary     Assets:Checking          $1,000.00 $1,000.00
                     Income:Salary           $-1,000.00         0
                     Assets:Funds:Water          $50.00    $50.00
                     Liabilities:Funds:Water    $-50.00         0
15-May-03 Water Bill Expenses:Water              $95.00    $95.00
                     Assets:Checking            $-95.00         0
                     Liabilities:Funds:Water     $95.00    $95.00
                     Assets:Funds:Water         $-95.00         0

For every paycheck, $50 went into our fund. When we paid the water bill, $95 came out of the fund. To set this up for more envelopes, just create a corresponding pair of rules for each one.

One last thing. What if we want to change how much we're setting aside in the water envelope? Let's say our rates go up and we now need to save $55 from each paycheck instead of $50. Here's how we do that:

= /Income:Salary/ and expr date >= [2015/04/01] && date < [2015/06/01]
    * Assets:Funds:Water         $50.00
    * Liabilities:Funds:Water   $-50.00

= /Income:Salary/ and expr date >= [2015/06/01]
    * Assets:Funds:Water         $55.00
    * Liabilities:Funds:Water   $-55.00

We can't just delete the old rule because then the transactions from before would be off. Instead, we add date expressions to our rules. Ledger's expression grammar is pretty complicated and not very well documented, but this should be sufficient for the rules you'll be writing for automatic envelopes. Ledger's manual has more documentation on automatic transactions.

I put the examples in this gist if you'd like to play with them. You'll need Ledger 3 installed.

DKIM Deep Dive

2015-01-07T00:00:00+00:00

DKIM (DomainKeys Identified Mail) is another type of email deliverability record that helps recipient servers be confident that you authorized any given email. DKIM uses public-key-cryptography to mathematically sign important parts of your messages. This post is a deep dive into how it works and what it's good for.

Why do we need DKIM?

Like SPF, DKIM protects your domain against spammers and phishers by validating your legitimate mail. Mail that purports to come from you and doesn't have a DKIM signature is more suspicious and more likely to be put into recipient's spam folder.

DKIM uses the DNS in a similar way as SPF. When you deploy DKIM, you insert keys into your DNS records at specifc URLs, thus proving that you control your DNS records.

An Example

DKIM has more moving parts than SPF. Specifically, there are:

Records in your DNS settings that contain public keys
Private keys on your mail server
Cryptographic signatures embedded in your messages

We're going to explore each one of these in turn.

DNS Records

DKIM public keys are stored as TXT records on your domain, under the subdomain _domainkey. As an example, here's the DKIM key that Mandrill uses when sending as petekeen.net:

$ dig +short mandrill._domainkey.petekeen.net txt
"v=DKIM1\; k=rsa\; p=RSA_PUBLIC_KEY>\;"

DKIM records are always key-value separated by semicolons (the backslashes come from the dig tool, they don't actually need to be there). In this case there are three parts. v=DKIM1 says that this is a DKIM version 1 key. k=rsa says that this is an RSA public key. p=RSA_PUBLIC_KEY_DATA is the actual public key (I removed the data just because it's so big. Run the dig command to see the data).

Unless you're running your own mail server, you'll almost always get the value for this record from your email provider. It'll be buried in your account settings, usually under a header like "Verified Domains".

Private Keys

The private key that corresponds to the public key in your DNS lives on the mail server. If you're using an email service provider like Mandrill, Postmark, or Mailgun they handle this for you. If you're running your own mail server you'll need to handle it yourself, which is out of scope for this post.

Embedded Message Signature

When a mail server wants to send a DKIM-signed message, it first calculates a cryptographic signature for the body and certain headers. Here's an example from a message I sent a few days ago:

DKIM-Signature:
  v=1;
  a=rsa-sha1;
  c=relaxed/relaxed;
  s=mandrill;
  d=petekeen.net;
  h=From:Subject:To:Message-Id:Date:MIME-Version:Content-Type;
  i=pete@petekeen.net;
  bh=82iZmY7kCbFDunaEckImLSxqHv8=;
  b=IQK/KMfy9xVjTU2TEIkWVaajqjmwdc9xnc3yByC6dZQjeFmYD3Rvaeu6lct44vBLymxkdT5Po7G6
   b5Li5KWjcBZJ95L6ur1DaBZDTN2E6aVwd+5cQ4zFm4MXhMC6uAssS3+eUK+ZFteDLgkmns+q/Gbt
   5bqJZuixpEhqgM4exLI=

This is again a set of key-value pairs, just like the DNS record. It says that the headers in the h= (plus the DKIM-Signature header itself), combined with the hash of the email body in bh=, when signed by the private key that matches the public key at mandrill._domainkey.petekeen.net, produce the cryptographic signature in the b= field.

(You may be wondering how DKIM can sign a header that includes it's own signature. The answer is that the b= field is treated as if it were an empty string when calculating the signature.)

The sending mail server embeds this DKIM-Signature header into the message. When a DKIM-compatible mail server receives a message with a signature, it downloads the public key specified in the header and processes the given signature against the message. If they match, the message is authentic. If they don't match, something's wrong. Either way, a new header is attached to the message named Authentication-Results that tells servers and spam filters further along how to handle the message.

Limits

Unlike SPF, DKIM doesn't have a built-in specification for how to handle failing signatures. It's up to each receiving server how to handle it. Some send a bounce message, some just attach the Authentication-Results header and let other servers handle it.

DKIM also explicitly doesn't handle what to do when a message has no signature at all. For that, we need another email deliverability record named DMARC. See my article Fix Your Email Deliverability with DMARC for more details on how to handle it.

SPF: Sunscreen for your Email

2015-01-05T00:00:00+00:00

Sender Policy Framework (SPF) is a type of email deliverability record that helps servers that receive email verify that the sender is allowed to send. It's been around for a few years and has been taken up by every large email provider. This post is a deep dive into how it works and what it's good for.

Why do we need SPF?

The protocol that we use to send email from server to server, SMTP, allows anyone to set anything as their From or MAIL FROM (the envelope-from) address. Effectively, if I know your email address I can send messages as if I were you, and nobody would be the wiser unless they looked very closely at the message headers and compared them with previous messages from you.

SPF is a standard for declaring what servers can actually send on your behalf. An SPF record is attached to your domain name as a DNS TXT record. Domains that have SPF are less likely to be used by spammers and phishers, which means genuine messages coming from those domains will be given higher reputation with receiving servers.

An Illustrative Example

Here is bugsplat.info's SPF record:

$ dig +short bugsplat.info txt
v=spf1
  ip4:104.131.72.15
  ip4:192.241.250.244
  include:servers.mcsv.net
  include:_spf.google.com
  include:spf.messagingengine.com
  ~all

An SPF record consists of a version tag (v=spf1) followed by one or more mechanisms. There's a bunch of different mechanisms, along with a whole macro system that you can use to make things really complicated for yourself, but we're going to stick to the basics for now.

Mechanisms are matched from left to right, until something returns true. At a high level, this says that if the email is coming from one of those two IP's, it's good. In addition, any of the IPs listed in those includeed polices are also good. The google.com one is pretty obvious, that means that Google's GMail servers can send as bugsplat.info. servers.mcsv.net is Mailchimp's address, and spf.messagingengine.com is for Fastmail.

The last mechanism, ~all, will match everything that hits it. The tilde is a "qualifier" that means to fall through with a "soft fail" status. A soft fail basically says that the message came from an unknown source, but don't take action on it. The default qualifier is +, which means "accept the message". For the ip4 and include mechanisms above, if they match the sending server then the message will be accepted.

include is actually sort of a misnomer, because the contents of the referenced record doesn't actually get included anywhere. Including another record means: evaluate the email against this entire other SPF declaration. If it matches, then return the qualifier on the include mechanism (by default, +). If it doesn't, continue on with the rest of the mechanisms. The only thing that causes a match when evaluating an include is a +. Soft fail, normal fail, and neutral all cause the include to not match.

Limits

SPF records are limited to 10 total DNS lookups. Every include generates a DNS lookup, as do many other mechanisms including ptr and a. The only common mechanisms that don't are all, ip4, and ip6. This limit, along with a practical limit to DNS TXT records of 450 characters, means that adding a new mail provider to your setup is actually a big deal. You have to be careful to not exceed either of those limits, lest receiving servers start throwing permanent errors.

SPF also doesn't give much direction on what to actually do with a message that fails the checks. Generally your records should be set up to soft-fail, which means receiving servers shouldn't actually take any action other than maybe giving the message a higher spam rating. There's a newer, related standard named DMARC which you can use to actually declare a comprehensive policy. See my article Fix Your Deliverability with DMARC for more details on how to work it.

If you're interested in more SPF mechanisms the official SPF webpage has all the details. This HOWTO goes into even more depth, along with an explanation of the SPF macro system, if you're so inclined.

P.S. Could your email deliverability use a boost? Mail Rep will help your business improve your deliverability in just two weeks. Click here to read about Mail Rep.

Email: The Good Parts

2014-12-31T00:00:00+00:00

Everybody knows what email is. You click "Compose", fill in the recipient's email address, write your message, maybe give it a pithy subject, and hit "Send". Some time later your recipient opens their email and reads your message. Simple, right?

There are an awful lot of interesting things happening between "Send" and "Some time later". This post is an overview of what happens when you hit "send", and how your message makes its way to your recipient.

Fundamentals

Under the hood, an email is just a text file. When you send an email, your email client (like Gmail, Mail.app, or Outlook) takes your text, injects some information into "headers" (formatted text before the start of your text), and hands it off to a mail server. Here's an example of a simple composed email:

From: Pete Keen <pete@petekeen.net>
To: Joe Example <joe@example.com>
Date: Sun, 28 Dec 2014 09:52:03 -0600
Subject: This is a simple message

Here's some text in the message. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis interdum in neque sed tincidunt. Nullam sed auctor libero, sed facilisis ligula. Pellentesque habitant morbi tristique senectus et netus et malesuada fames ac turpis egestas. Aliquam sit amet dui a ligula ultrices porta quis vel mi. Nulla ac urna augue. Donec euismod tristique odio eget convallis. Cras ac quam vel sapien pharetra luctus. Aliquam sem eros, auctor non fringilla sed, viverra non turpis. Sed suscipit egestas posuere. In hac habitasse platea dictumst. Nulla facilisi.

As you can see, there's really not much to a simple email. Messages can get far more complex, with multi-part HTML messages and attachments, but fundamentally they're just formatted text files with headers.

Mail Transmission

After your client (mail user agent) composes your message it hands it off to a mail server (mail transfer agent), which will hand it to at least one other server at some point before it ultimately ends up in your recipient's inbox. Mail transmission happens via the Simple Message Transport Protocol (SMTP), a simple text-based protocol. Here's a simple SMTP transaction (blue lines with right arrows are what we send to the server, black lines with left arrows are what we receive from the server):

← 220 web01.bugsplat.info ESMTP Postfix
→ HELO client.bugsplat.info
← 250 web01.bugsplat.info
→ MAIL FROM: test@bugsplat.info
← 250 2.1.0 Ok
→ RCPT TO: pete@bugsplat.info
← 250 2.1.5 Ok
→ DATA
← 354 End data with .
→ From: test@bugsplat.info
→ To: pete@bugsplat.info
→ Subject: this is a test
→ 
→ Example test.
→ .
← 250 2.0.0 Ok: queued as CD76D606BB
→ QUIT
← 221 2.0.0 Bye

The transaction starts with an announcment, which we respond to with a HELO command specifying who we are. The server replies with its domain name.

Next, we tell the server who the message is from, who it should ultimately be delivered to, then we actually send the message. An important concept here is that the MAIL FROM and RCPT TO commands are separate from the actual contents of the message and the From and To headers. They can be completely different. This is actually how the "BCC" function in email is implemented, the addresses that are BCC'd are actually sent the message with their address in RCPT TO, but the primary address in the To header.

Each time a mail server receives a message it adds a Received header with its hostname, the hostname of the machine who sent it, the exact time, and some other information, generally to the top of the stack of headers. Here's the Received headers from the message we just sent above:

Received: by 10.140.147.132 with SMTP id 126csp2632885qht;
        Sun, 28 Dec 2014 09:44:19 -0800 (PST)
Received: from web01.bugsplat.info (web01.bugsplat.info. [192.241.250.244])
        by mx.google.com with ESMTPS id kg1si49786991pad.162.2014.12.28.09.44.17
        for <peter.keen@gmail.com>
        (version=TLSv1.1 cipher=ECDHE-RSA-RC4-SHA bits=128/128);
        Sun, 28 Dec 2014 09:44:18 -0800 (PST)
Received: from client.bugsplat.info (sales02.bugsplat.info [104.131.72.15])
    by web01.bugsplat.info (Postfix) with SMTP id CD76D606BB
    for <pete@bugsplat.info>; Sun, 28 Dec 2014 17:43:19 +0000 (UTC)

In diagram form, this is what the flow looks like:

Reading the headers from the bottom (and the diagram from the top), sales02.bugsplat.info connected to web01.bugsplat.info (lying about who it actually was) and sent a message destined for pete@bugsplat.info. web01 is configured to forward that address to my GMail address, so it connects to Google's mail server and passes along the message using the exact same protocol as shown above. Within GMail there's one more hop to the server that handles my account. Each time an email message is sent from server to server, SMTP is involved.

Lots of other information lives in the headers, including a unique message ID, timestamps, email addresses where you can send complaints, and cryptographic signatures.

You're probably wondering how web01.bugsplat.info found the GMail server in the first place. How did it know where to send the message? It looked at the DNS, where Google has set up MX (mail exchange) records for the gmail.com domain. Here's what those look like:

$ dig +short gmail.com mx
5 gmail-smtp-in.l.google.com.
10 alt1.gmail-smtp-in.l.google.com.
20 alt2.gmail-smtp-in.l.google.com.
30 alt3.gmail-smtp-in.l.google.com.
40 alt4.gmail-smtp-in.l.google.com.

Mail servers are tried in priority order, where the lower number has higher priority. If a receiving mail server doesn't respond the sender will try the next in the list, and if it runs out of servers to try it will hold onto the message for awhile before trying again. You can look up your company's MX records just as easily with the dig tool:

$ dig +short yourdomain.com mx

For more information on DNS, check out DNS: The Good Parts.

Spam

Spam is sort of a nebulous term, but it boils down to unwanted messages. Things you didn't sign up for, don't want, and/or are actively harmful.

Spam has been around for a long time. In fact, the first recognized unwanted message on the internet was sent in 1978, before SMTP was even around (it was an advertisement for a presentation by Digital Equipment Corproation). Since then it's morphed into an actual business model. Drugs, merchandise, insurance offers, and everything else under the sun is advertised via spam messages, sent out by the billions by rogue mail servers. In addition to real (but unwanted) products, the same techniques get used to send out malicious messages like phishing attacks.

The protocol underlying email, named SMTP (Simple Message Transport Protocol) is from a much more civilized era. There's very little sanity checking or verification in the base protocol, which means it's trivial to forge various parts of a message, including headers. If I know your address I can trivially craft a message that, by all outward appearances, looks like it came from you.

Transactional vs Newsletters

There are two broad swaths of email that businesses send today, other than personal correspondence. Transactional is generally generated in response to events within your application and goes to one person, one at a time.

On the other hand, newsletters are considered "bulk" email. They go from you to a list of people, all at the same time. This doesn't mean they're spam, just that you send a large number of almost-identitcal messages at the same time.

Email service providers distinguish their handling of these different types of messages. For example, spam rules are applied more stringently to messages that they detect as bulk. Depending on the keywords and general format of your messages, GMail might automatically decide to put your bulk email into their "Promotions" tab.

Trust

Over the years, technologies have come around that help to prevent forgery and help the big providers like Gmail and Yahoo trust your messages, both transactional and newsletters. Those are:

SPF, Sender Policy Framework. Lets the world know what servers are authorized to send mail as your domain.
DKIM, Domain Key Idenitifed Mail. Allows sending email servers to cryptographically sign messages, verifiying that you authorized them to send email as you.
DMARC, Domain-based Message Authentication, Reporting, and Conformance. Specifies a policy for recipient servers, telling them what to do if a message fails SPF or DKIM checks.

I went into these somewhat in Fix Your Email Deliverability with DMARC. In future posts I'll be taking a deep dive into each one, talking about its history, what its good for, and how to best deploy it.

Your DNS Provider Should Not Be Your Registrar

2014-12-01T00:00:00+00:00

Note: This article references events that happened in December 2014.

Hopefully, by time you're reading this DNSimple will have recovered from their DDoS-powered outage. Today has probably been a terrible day for everybody over there and I'm sure they're ready for a break. While you can't do much to directly defend against DDoS attacks, you can insure yourself against DNS outages.

If you're a DNSimple customer right now or a NameCheap customer several times earlier this year, you know what happens when your DNS service goes out. Your website is inaccessible, emails are probably bouncing, and so are customers and their wallets. It's all around bad news.

The cheapest insurance you can buy is to host your nameservers and your registrar at different companies. That way, if your registrar gets attacked it's no big deal because they're not involved with your day-to-day name resolution, and if your nameservers are attacked you can easily change them. You can't do that if the web interface you need to use is down at the same time as your nameservers.

Splitting your DNS services between two or more companies adds a tiny bit of one-time overhead to setting up a new domain name, but the peace of mind this strategy buys is worth it. Your can be back up and servicing customers at a new DNS provider in as little as five minutes, depending on your registrar, while your previous/primary DNS provider is struggling with an attack for hours.

Personally, I use Amazon's Route53 service as my nameservers and either Gandi (for .io) or Namecheap (for everything else) as my registrars, but you can use whoever you want. You could even use DNSimple as your registrar and Route53 as your nameserver if you want. The point is that you should have at least two wholely separate companies involved.

What is the best modern payment provider?

2014-11-25T00:00:00+00:00

Your business has to get paid, but how that happens is a complicated question, and the modern payment landscape is vast. How do you pick?

What payment service is the best fit for your business?

This list highlights the biggest modern payment providers in the market, where "modern" includes features like integrated merchant account and gateway services, RESTful APIs, and well maintained SDKs.

You can use this list to help you narrow down the choices for your business.

Stripe

Credit card payments
Alipay in China and Bitcoin (in beta)
2.9% + $0.30, volume breaks after $80k/mo
$0.25 for API-driven payouts (automatic payouts are free)

Pros

Good overall fee structure
Great API and documentation
Built-in subscription services
Excellent 3rd party ecosystem
Offers Alipay and Bitcoin
Many countries
Flexible statement descriptors

Cons

Does not offer the money flow flexibility that some businesses require
Not available worldwide (yet)
No phone support (still)

Balanced

Credit cards and US bank accounts
2.9% + $0.30 credit cards, 1% + $0.30 bank account ($5 fee cap)
$0.25 to non-merchant-owned bank accounts

Pros

Extremely flexible (escrow account, multiple ways to get money in and out)
Decent service
Very nice API
Flexible statement descriptors

Cons

Poor documentation
No built-in subscription services
No 3rd party ecosystem
US only

Braintree

Credit cards
2.9% + $0.30 after first $50k account lifetime gross volume

Pros

Excellent support
Easy PayPal integration
Easy drop-in form (v.zero)
Flexible statement descriptors
Braintree Ignition (no fees on first $50k)

Cons

Owned by PayPal
Dashboard interface not very powerful
No 3rd party ecosystem
US, Canada, Australia, Europe

WePay

Credit cards and bank transfers
2.9% + $0.30 cards, 1% + $0.30 bank transfers

Pros

Handles chargebacks transparently and without any fees
Easy iframe integration
Simple built-in subscriptions

Cons

US and Canada only
Fixed statement descriptor "WEPAY, INC"

PayPal

Credit card and bank transfers
2.9% + $0.30 with price breaks at various volumes

Pros

Very much international
Easy to get started
Comfortable for customers

Cons

Most APIs are old and bad
Documentation is confusing
IPNs are brittle and easy to mess up
Can be expensive

Dwolla

Bank transfers
$0.25 per transfer, in or out

Pros

Inexpensive
Easy to use

Cons

No credit card processing
Users have to have accounts
Slow, since it's just ACH
API is limited
US only

BitPay

Bitcoin
Transactions are free
Free with limited email support, $300/mo for phone support

Pros

Easy to use API
No transaction fees
Offers payment diversity for your customers
Prices in hard currency, customer pays with BTC
Can settle to your bank account in a variety of currencies

Cons

Customers have to have Bitcoin already
Bitcoin is still a new, speculative, fast-moving universe

P.S. Looking for help navigating this list and choosing the provider, or maybe mix of providers, that best fit your business? Contact me and let's chat.

Stripe removed SSLv3 support. Here's how to fix the HTTP 401 errors.

2014-11-20T00:00:00+00:00

On November 15th Stripe deprecated SSLv3 because of the POODLE vulnerability. On the whole, this has been a good and welcome change, because SSLv3 has been terrible for a very long time.

The problem is that on some systems this causes backend API requests to start failing with an error message from Stripe because they're unable to auto-negotiate TLSv1.2.

Here are three ways to help fix this:

1. Upgrade Ruby

This is the cleanest solution. Upgrade your Ruby to 2.1.4, 2.0.0-p594, or 1.9.3-p550. In those versions, SSLv3 is disabled, which forces auto-negotiation to pick TLSv1.2.

You can also try upgrading your rest-client gem to the latest version in your Gemfile:

gem 'rest-client', '>= 1.7.2'

This has worked for some people but it depends on your situation. You might already be running the latest, in which case carry on to the next option.

Also, if you're using the rest_client gem (notice the underscore instead of dash), be aware that this re-enables SSLv3. See this commit from late October 2014.

2. Patch OpenSSL

At the bottom of the link in #1 there's a monkeypatch you can apply that changes OpenSSL to remove SSLv3.

3. Patch Stripe

If you can't or won't upgrade your Ruby and changing OpenSSL is too scary, you're left with the option of monkeypatching Stripe's library directly. Drop this code in an initializer:

module Stripe
  def self.execute_request(opts)
    RestClient::Request.execute(opts.merge(ssl_version: :TLSv1))
  end
end

This is basically the solution that was proposed to Stripe but they rejected it because when new versions of TLS come out it'll break. So, don't be surprised when it breaks two years down the line, but for now it works.

Of these three options, if you can go with the first one please do. It's the cleanest and least brittle solution.

P.S. Want me to solve these kinds of problems for your business, before they become problems? Contact me about Payola Pro. You'll be glad you did.

Payola v1.2: Now with Subscriptions

2014-11-17T00:00:00+00:00

Today is release day for Payola v1.2.0 and the big watch word is subscriptions. So now that they're here, how do you use subscriptions with Payola? It's easy:

Install the gem
Configure a model
Set up a form
Profit!

Let's go through these in a bit more detail.

1. Installation

Add payola-payments to your Gemfile:

gem 'payola-payments', '>= 1.2.0'

Now run bundler and install the migrations:

$ bundle install
$ rails g payola:install
$ rake db:migrate

Payola assumes you have your Stripe keys in environment variables named STRIPE_SECRET_KEY and STRIPE_PUBLISHABLE_KEY. Make sure to set those up or configure them in Payola's initializer.

2. Model

Payola tracks everything about a subscription for you, but you have to tell it about your plans. For that, create a Plan model and include the appropriate Payola module:

$ rails g model Plan \
    stripe_id:string \
    name:string \
    amount:integer \
    interval:string \
    interval_count:integer
$ rake db:migrate

Now open up app/models/plan.rb and add the concern:

class Plan < ActiveRecord::Base
  include Payola::Plan
end

At this point you should be able to open up a console and add a plan:

$ rails console
irb(main):001:0> Plan.create(name: 'Test Plan', stripe_id: 'test-plan', amount: 100, interval: 'month', interval_count: 1)

This will create a Plan object as well as create a plan within Stripe.

3. Form

Payola currently only supports custom forms for subscriptions but it makes it as easy as possible to do. Let's create a simple controller first at app/controllers/subscriptions_controller.rb:

class SubscriptionsController < ApplicationController
  def new
    @plan = Plan.first
  end
end

This is what our form is going to look like:

Here's what the view looks like, using Bootstrap 3 for layout:

<div class="row">
  <div class="col-xs-8 col-xs-offset-2">
    <div class="well">
    <%= form_tag('/subscribe',
      role: 'form',
      class: 'payola-subscription-form',
      'data-payola-base-path' => '/payola',
      'data-payola-error-selector' => '.payola-error',
      'data-payola-plan-type' => @plan.plan_class,
      'data-payola-plan-id' => @plan.id) do %>
      <div class="form-group">
        <label>Email Address</label>
        <input type="email"
               name="email"
               data-payola="email"
               placeholder="you@example.com"
               class="form-control"></input>
      </div>
      <div class="form-group">
        <label>Card number</label>
        <input type="text"
               size="20"
               data-stripe="number"
               class="card-number form-control"
               placeholder="**** **** **** ****"></input>
      </div>
      <div class="row">
        <div class="col-md-6">
          <div class="form-group">
            <label>Exp</label>
            <input type="text"
                   size="8"
                   class="exp-date form-control"
                   placeholder="MM / YY"></input>
            <input type="hidden" data-stripe="exp_month"></input>
            <input type="hidden" data-stripe="exp_year"></input>
          </div>
        </div>
        <div class="col-md-6">
          <div class="form-group">
            <label>CVC</label>
            <input type="text"
                   size="4"
                   data-stripe="cvc"
                   class="form-control"
                   placeholder="***"></input>
          </div>
        </div>
      </div>
      <div class="text-center">
        <input type="submit" value="Subscribe" class="btn btn-info btn-lg"></input>
      </div>
      <div class="alert alert-warning payola-error" style="display: none"></div>
      <% end %>
    </div>
  </div>
</div>

Payola's subscription form behavior is triggered by your form having the class payola-subscription-form. After that, you just mark up the data destined for Stripe with data-stripe attributes. Any field with a name attribute will get submitted along with the form once Payola is done doing it's thing.

This particular form has a few additional niceties provided by the jquery.payment library from Stripe. Here's the javascript for the form, in app/assets/javascripts/form.js:

$(function() {
  $('.exp-date').payment('formatCardExpiry');
  $('input[data-stripe="number"]').payment('formatCardNumber');
  $('input[data-stripe="cvc"]').payment('formatCardCVC');

  $('.exp-date').on('keyup', function() {
      var e = $('.exp-date').first();
      var out = $.payment.cardExpiryVal(e.val());
      $('input[data-stripe="exp_month"]').val(out.month);
      $('input[data-stripe="exp_year"]').val(out.year);
  });

  $('.card-number').on('keyup', function() {
    var e = $('.card-number').first();
    var type = $.payment.cardType(e.val());
    var img = "card.png";
    switch(type) {
        case "visa":
          img = "visa.png";
          break;
        case "mastercard":
          img = "mastercard.png";
          break;
        case "discover":
          img = "discover.png";
          break;
        case "amex":
          img = "amex.png";
          break;
    }
    e.css('background-image', 'url(/images/' + img + ')');
  });

});

This does three separate things. First, it sets up formatters on the form's expiration date, card, and cvc number fields. Next, the JS sets up another event handler that uses jquery.payment's cardExpiryVal function to parse the card expiration date into a month and a year, and sets the hidden fields to that value.

Finally, it sets up an event handler that changes the credit card icon based on what type of card the customer is entering. The particular images that I'm using are from a very nice set of flat icons off of Creative Market. Shopify put out a free set a while back as well.

In order to make the icon actually show up in the right place, you need to add this small CSS snippet:

.card-number {
  background-image: url(/images/card.png);
  background-repeat: no-repeat;
  background-size: 30px;
  background-position: right 10px center;
}

(card.png is the default green card image.)

Ok, so now we have a form, but what exactly happens here? Payola Subscriptions works like this:

When the customer hits the submit button, Payola intercepts that and sends the fields tagged with data-stripe to Stripe to get a card token.
When Stripe returns a token, Payola POSTs that along with the email address field to /payola/subscribe/:plan_class/:plan_id, which creates a Payola::Subscription and attempts to create a Stripe::Customer. All of this happens in the background, so the user's browser polls your application every 500 milliseconds until the background job is done.
When Payola is finished, the user's browser will submit the original form to your application, with an additional payola_subscription_guid param tacked on. Your controller should associate that Payola::Subscription object with your customer's record.

How your application handles step 3 is up to you. Some applications may want to attach the subscription to an organization account while others may want to attach it to a user directly. In any case, Payola::Subscription has a polymorphic owner attribute that you should use. For example:

sub = Payola::Subscription.find_by(guid: params[:payola_subscription_guid])
sub.owner = current_user
sub.save!

4. Profit!

At this point you have a functional subscription system. Payola provides all sorts of hooks and notifications that you can use to trigger additional application-specific behavior, which you can read all about in the README.

P.S.: Payola Pro is an add-on to Payola that provides priority email support, pre-built integrations with several 3rd party services, Stripe Connect marketplace support, along with a lawyer-friendly commercial license. You can read all about it at payola.io/pro, and all of the modules have documentation on the Payola wiki.

P.P.S.: I want to thank Jeremy Green who has been instrumental in driving this forward. Subscriptions would probably be another month away without Jeremy's help.

Building Payola Extensions

2014-11-11T00:00:00+00:00

A few weeks ago I introduced Payola, a drop-in Rails engine for setting up Stripe billing. Since that time, it's gained over 400 stars on GitHub and the gem has been downloaded almost 2000 times. The most requested feature, subscripton payments, is well on it's way to being completed.

Payola is more than just a checkout button. It has hooks at various points in the payment flow that let you take action and tie Payola into your application to do things like manipulate the sale object before the charge happens or override the low-level arguments that Payola sends to Stripe. It also has a rich set of notifications when payments complete, fail, or are refunded. In this post, we're going to build a simple extension that sends push notifications when someone buys a product.

There are various third party services that provide push notifications but today we're going to use Pushover, an inexpensive cross-platform personal-use notification system. It's not for big broadcast groups or marketing like Urban Airship or Parse. Instad, Pushover is specifically for our use case: letting your application talk to the developer via push notifications.

Note: This article assumes that you've set up Payola in your application already. If you haven't, check out the Payola docs for getting started instructions.

Install Pushover

We're going to use Rushover to integrate with Pushover. It's simple to use and has no big external requirements. Add it to your Gemfile:

gem 'rushover'

and run bundle install.

You'll need to register for an account and install the app on your device. It's free to try for five days, and then you'll need to purchase a licence for $4.99.

The Hook

The specific Payola hook we're going to use is an event named payola.sale.finished. Payola fires this event when the sale is complete at Stripe. Internally Payola listens to this event to do things like send automatic emails.

Let's set up the listener in config/initialzers/payola.rb:

Payola.configure do |config|
  config.subscribe 'payola.sale.finished', lambda do |sale|
    Payola.queue!(PushoverCallback, sale.guid)
  end
end

This takes advantage of Payola's built-in job queuing system which integrates with whatever system you have on hand, defaulting to ActiveJob's inline system if it's available. If you want, you could use your queueing system directly.

The Callback

The code to talk to Pushover is pretty simple. Let's create a file at app/services/pushover_callback.rb:

class PushoverCallback
  def self.call(guid)
    sale = Payola::Sale.find_by(guid: guid)
    price = sprintf("%0.2f", sale.amount / 100.0)

    client = Rushover::Client.new(ENV['PUSHOVER_API_TOKEN'])

    client.notify(
      ENV['PUSHOVER_USER_TOKEN'],
      "#{sale.email} just bought #{sale.product.name} for #{price}",
      device: ENV['PUSHOVER_DEVICE'],
      sound: 'cashregister'
    )
  end
end

From the top, we look up the sale in the database and format the price as something useful. Then, we create a client and call the notify method on it. This tells Pushover to send the given message to the user identified by the token. The device argument tells Pushover to only send the notification to that specific device and is optional. The other argument, sound, lets you pick from a range of pre-defined options. I think the sound an old-school cash register makes is perfectly appropriate, but maybe an alien alarm makes more sense to you.

P.S.: Payola Pro has pre-built integrations like the example above for Mailchimp and Mixpanel, with many more on the way. It also brings support for Stripe Connect marketplaces, a commercial-friendly license, and priority email support. Check it out!

Introducing Payola

2014-10-20T00:00:00+00:00

I released an open source Rails engine named Payola that you can drop into any application to have robust, reliable self-hosted Stripe payments up and running with just a little bit of fuss.

When you're setting up Stripe in a Rails application there are a lot of choices you have to make. What should you use for webhooks? Do you even need webhooks? How much information should you keep in your database? Should you use Checkout or do you need to design your own form? Amongst all of these choices, you also have to decide what libraries you want to use, and boy howdy are there even more options here. Koudoku, StripeEvent, Stripe::Rails, not to mention commercial options like Gumroad, Plasso, and Cargo.

One of the reasons why I wrote my book Mastering Modern Payments: Using Stripe with Rails is to help you narrow down that set of choices to something reasonable, and I think it does a very good job of it. That said, even if you're using my book you still have to actually write the code to implement Stripe. Not that it's a lot of code, but it's basically always the same.

For the recent relaunch of MMP I decided to actually sit down and formalize the "hows" laid out in the book into a Rails engine. Anyone can drop Payola into an application and have payments going without too much drama, and notably none of the choices outlined above.

What Payola Does

Payola provides a complete solution for accepting Stripe payments within a Rails application. It is focused on selling items one at a time and includes a drop-in partial for setting up a Stripe Checkout button, along with a complete server-side asynchronous processing system for completing payments with Stripe.

To see a demo, click on or inspect one of the buttons in the Packages section on the MMP website.

Design

I designed Payola to be robust in the face of failure, whether that means network failure, bugs, Stripe API slowness, or anything in between. It consists of a few moving pieces:

User-facing Javascript that gets a token from Stripe and sends it, along with other Payment-related information, to the backend.
An async backend that uses a state machine to track a payment through every stage.
Integration with your application via a model concern and ActiveSupport notifications when interesting things happen.

Payola has built-in support for Sidekiq and Sucker Punch, but it's easy to add new backend worker systems which makes it even easier to adapt to your current system.

Payola should also be transparent to your customers. There should never be a time when they actually see a Payola URL in their address bar, nor should they ever see something Payola branded. From a buyer's perspective it should be your site selling the product, not Payola.

How A Click Becomes A Charge

Here are all of the steps in a successful charge:

Buyer clicks a checkout button, which spawns a Stripe Checkout lightbox.
Buyer enters their card information and clicks the Pay button.
Stripe validates their card information and creates a token.
Token is passed the Payola's javascript, which in turn POSTs it to the backend.
Payola creates a Payola::Sale object with the token and sets it to pending state.
Payola queues a background job to create a Stripe charge for the corresponding sale and passes the sale's guid attribute back to the JS.
The buyer's browser disables the button and polls Payola every 500ms asking for the state of their charge.
The background job calls the Payola.charge_verifier callback, then creates the charge with Stripe, then sends the payola.<product>.sale.finished notification to your application.
The background job finally sets the sale's state to finished, which is picked up by the JS.
The user's browser redirects to /payola/confirm/<guid> and then is immediately redirected to whatever the product's redirect_url returns, defaulting to /.

A charge will typically fail in the background job (step 8), either because the charge_verifier rejects it or Stripe rejects it. In that case, the sale is set to errored, the error message is set in the error column, and the Payola JS shows it in a (customizable) div after re-enabling the button. Your application will also receive a payola.<product>.sale.errored notification.

Installation

Installing Payola in your app is just a few steps. First, add the gem:

gem 'payola-payments'

Then, run the installer and install the migrations:

$ rails g payola:install
$ rake db:migrate

Next add the Payola::Sellable concern to the models you want to sell:

class SomeProduct < ActiveRecord::Base
  include Payola::Sellable
end

Your model needs three attributes:

permalink: a unique, human readable name
name: a short description
price: the price for the sellable in whatever format Stripe expects. For USD this is cents, for other currencies it could be different.

By default Payola will use USD but you can change that by adding an optional currency method to your sellable model. This can either be a fixed method if you're only using one currency, or it can be a column in the database if your products come in multiple currencies.

Optionally, you can provide a method named redirect_path. This method takes a Payola::Sale instance and returns a path where Payola should redirect the browser after a successful purchase. If you don't provide this Payola will redirect to '/'.

Finally, use the checkout partial to render a checkout button:

<%= render 'payola/transactions/checkout',
    sellable: SomeProduct.first %>

While the checkout partial has reasonable defaults for getting off the ground, you can customize basically every aspect of it. See the documentation for details.

Event Handling

Stripe has excellent support for webhook events and the StripeEvent gem does an excellent job handling them. Payola thinly wraps StripeEvent and adds a bit of behavior. To receive events, just set up a webhook url in your Stripe account settings that points at https://www.example.com/payola/events. Then, configure an event listener in config/initializers/payola.rb:

Payola.configure do |config|
  config.subscribe 'charge.succeeded' do |event|
    puts "whoohoo!"
  end
end

Payola adds deduplification to StripeEvent. It records every event_id that comes in and will only ever process an event once. If you'd like to further filter events, you can set event_filter, which should either return a Stripe::Event or nil if you'd like to stop processing.

In addition to Stripe's webhooks, you can listen for three special events:

payola.<underscored product class>.payment.finished
payola.<underscored product class>.payment.failed
payola.<underscored product class>.payment.refunded

These are invoked with the corresponding Payola::Sale, not a Stripe::Event and are executed in-line with the async processing chain, which means you can do things like create a user or send an email before the user-facing javascript returns.

What's Next

Currently Payola does not handle subscriptions or marketplaces, so those will be next on the list. Along with those I'll be adding support for custom forms instead of the Checkout button. I'm also planning on building out a Pro version that will include priority support and a bunch of pre-built integrations for external systems like Mailchimp, Mixpanel, Infusionsoft, and more.

Here's some more links to Payola stuff:

Send me an email if you'd like to talk about Payola or Payola Pro.

Building a Private Backplane Network for your VPSs with ZeroTier

2014-09-14T00:00:00+00:00

Almost all of my applications, both public and personal, run on a collection of virtual private servers (VPSs) hosted in various places including DigitalOcean and my own data center (i.e. the Mac mini in my basement). For a long time I've wanted to set up certain things, like centralized logging or metrics collection, but I've always been stopped by this idea that I can't run that stuff across the public network.

A few months ago I ran across a product named ZeroTier that, among other things, allowed me to set up this network without having to invest the time in attempting to build (or purchase) a traditional VPN. This post is going to talk about why, and how, you can replicate this setup.

What's a Backplane Network?

A backplane network is a separate, private network interface that you can use to send privileged traffic between hosts within your network. Sometimes this is called dual-homing. You can use this backplane network for anything you don't want to send over the public network. For example, monitoring or log streams, database servers, or internal admin dashboards. I personally use mine for all of the above.

Managing this kind of thing if you control your hardware and physical network is easy. You just put everything into appropriate VLANs and DMZs and make sure your router knows what to do. However, if you don't control your physical installation because your hosts are spread across providers, cities, or continents, it becomes a lot harder. Basically your options are to set up a VPN or try to send traffic over SSH tunnels.

ZeroTier is sort of like a VPN, in that it sets up an encrypted overlay network on top of the public network. The twist is that, instead of having a central control point like in a traditional VPN, ZT goes to great lengths to send as much as your traffic as possible point-to-point between hosts. It has extensive NAT-busting capabilities to let hosts on home networks participate.

Setting up ZeroTier

There are two steps to setting up your network. First, you sign up for a free account on ZeroTier's website. The free account lets you create as many networks as you want with up to 10 hosts per network. You can opt to pay for your account which enables unlimited hosts per network.

Once you've signed up and logged in, you'll see a box that says (network name) next to a Create Network button. Put a name in and hit the button. This name does not have to be unique, it's strictly for your use. Make a note of the new network's Network ID.

Next, you'll need to install the ZeroTier client on your machines. Ideally it would be packaged and installed via the official distribution systems like yum and apt, but ZeroTier isn't quite there yet. Instead, there's a downloadable installer for each platform. On Linux installer is a shell script containing an embedded binary, so all you have to do is download it and run it. For OS X they provide a normal DMG containing a normal Mac application. Similarly, on Windows they provide an MSI installer.

Once you have ZT running on a host, you can see it's automatically generated host ID like this:

$ sudo zerotier-cli info
200 info <your host id> ONLINE 0.9.2

Finally, each host needs to join your network. On linux and OS X you can do this via the command line, like this:

$ sudo zerotier-cli join <your network id>

Go back to the web interface. Within a few seconds you'll see your host ID listed, along with an unchecked checkbox in the Authorize column. Check the box, and soon your host will get a new IP address.

Repeat these steps for every host you want to join this network. In my personal setup I made a simple puppet module that installs the client and attempts to join my network. I just have to verify the new host's ID and check the authorize checkbox.

Set up DNS (optional)

One last step you may want to do is set up DNS records for your backplane network so you don't have to try to remember IP addresses. Putting private IPs in the public DNS isn't really a problem because your hosts are the only ones who will actually be able to connect to each other, but if you want to really lock things down you could run your own private DNS server for your backplane network. I have a separate domain specifically for my backplane network, zrail.net.

Once you have DNS set up, connecting to a host on your backplane is just like connecting on the public network:

$ ssh host.zrail.net

ZeroTier will ensure that packets destined for a backplane address only go out through the proper interface.

Running servers on this network is also very easy. For example, here's how you would set up a simple Ruby rack server:

$ rackup -p 8080 -o 10.123.123.123

The -o option tells Rack to only listen on the interface for that IP address (10.123.123.123 is a placeholder for the ZeroTier IP assigned to your host).

So far I'm just using this backplane network for SSH and database access. Soon I plan on setting up additional services and moving more stuff over to Docker containers, and having this private network will let me be a lot more flexible with how I set things up.

That said, ZeroTier is capable of a lot more than just being a VPN alternative. Especially with mobile devices, it has the ability to dramatically change how peer-to-peer apps are architected.

Five business lessons from an idle game

2014-09-08T00:00:00+00:00

Over the last few weeks I've been playing an idle game called AdVenture Capitalist. In this game, you play a businessman, running his various businesses from the comfy environs of your plush green lawn (and eventually your moon base). I realized this morning that, perhaps inadvertently, AdCap teaches a few very important lessons for people bootstrapping or starting up a business.

For those of you unfamiliar with the term "idle game", it's a genre where, after getting the ball rolling, you play a sort of managerial role. You usually can close the browser tab and come back minutes, hours, or days later and the game will have progressed in your absence. Kongregate has a whole category of idle games, containing hundreds of different variations on the theme.

AdVenture Capitalist begins with a single humble lemonade stand that earns just a few dollars every time you click on it's button. From there, you can hire employees, upgrade your stand, and expand into bigger and more profitable businesses. Among the clicktacular fun, there are a bunch of important lessons.

1. Multiple income streams are better than just one

After earning enough with your first lemonade stand, you can buy another one. Suddenly you have twice the profits for every click. After you earn enough with your lemonade stand empire you can expand into newspaper stands, car washes, hockey teams, banks, and eventually even oil companies. Each one of these businesses generates a different amount of profit every time you click on it, but together they throw off thousands, millions, and eventually tretrigintillions (10¹⁰²).

The real life lesson is that, if you're running a business, it's important to have multiple diverse streams of income, so that if one goes south you're not left with nothing.

2. Take all the help you can get

AdCap starts out requiring you to actively click on a button every time you want to make money. Very quickly, though, you have the option of hiring a manager for your businesses. They take care of the nitty gritty day-to-day clicking and let you think strategically about your empire. In this case, the lesson are to hire out parts of your workload (bookkeeping, customer support, etc) so you can spend more time looking at the bigger picture.

3. Deploy capital efficiently

Another core feature of AdCap game mechanics is the notion of upgrades. After earning enough money you can upgrade your businesses to make the more efficient or enticing to customers. For example, the very first upgrade is "Little Umbrellas" for your lemonades, which triples your profits. In real life, you have to spend money to make money. Advertising, product updates, redesigned websites, all of these things cost money but, if you deploy your capital correctly, will bring in much more than what you've spent.

4. It's a waiting game

AdCap is, of course, an idle game. Just like your real-life business, you don't have to hover over it while making minute adjustments to progress in the game. All you have to do is wait for the results of your decisions. This can be pretty difficult sometimes, so AdCap gives you the option of "micromanaging" your managers, making them work harder but also making them angry in the process. If you work them too hard they'll quit, forcing you to rehire them for an exorbitant raise.

5. It's very easy to get distracted

"Oh, a new pizza place is only $10 trillion. click oh no, now I don't have enough to buy the next upgrade that increases my profits by 3x across every business!"

There are quite a few balls to keep juggling while playing AdCap or running your business. You have to keep your employees happy, balance cash flow needs with capital expenditures, and balance your capital between ten different business lines and dozens of upgrades to maximize your effectiveness. It's so easy to get distracted by the day-to-day mundane activities that you lose sight of your bigger goal. For AdCap, that's buying that final upgrade. In real life, it could be retiring at 40 and traveling the world, or amassing a huge fortune, or any number of other deeply personal goals. To get anywhere, you have to focus.

By playing a bunch of Adventure Capitalist I probably haven't been focusing on my own businesses as I should be. I'm going to get back to work soon, right after these next ten newspaper companies.

Let's Begin Again

2014-09-02T00:00:00+00:00

At the ripe old age of 12 I started messing around with computers. I've been hooked ever since. I've also read everything I could get my hands on about business, hoping to some day be able to start my own. However, I took the well-worn path of college, then into industry where I lucked into a series of great jobs.

Today, I'm finally striking out on my own. Today I'm throwing open the doors to my software development consultancy.

I want to thank my former employer Kongregate for putting up with me over the past few years. It's been a hard road for all of us, and they've been amazing. It's definitely not them. (By the way, if you're an amazing Rails developer they're hiring!).

Enough about me. How about you? Do you have a project you need help with? I bring almost ten years of professional software development experience to the table, with a focus in online payment systems and high performance, high traffic web application development. Let's talk.

Fix Your Email Deliverability with DMARC

2014-08-21T00:00:00+00:00

If you do anything more advanced with email than hitting "Send" in Gmail then you should care about deliverability, which is the likelyhood that your email will end up in your intended recipient's inbox instead of their spam folder.

In the last few years three technologies have emerged that help you as a sender work with receiving mail servers to ensure that your mail gets where it needs to be. They are

Sender Policy Framework (SPF)
DomainKeys Identified Mail (DKIM)
Domain-based Message Authetication, Reporting, and Conformance (DMARC)

All three of these are implemented using DNS TXT records, so we'll be using the dig utility to explore them. If you don't know much about DNS, or just want a refresher, check out my article DNS: The Good Parts. Briefly, a TXT record lets you associate a bit of text with a DNS name. A DNS name can have more than one record associated with it, so you could have one or more A records, an MX record, and one or more TXT records all associated to example.com. The one thing you can't do is mix CNAMEs with other types, which I talk about in depth in DNS: The Good Parts.

Together, SPF, DKIM, and DMARC control which servers can send as your domain (SPF), authenticate a message, proving that you sent it (DKIM), and instruct recipients what to do if one or both of those checks fail (DKIM). Combined they're a powerful tool for improving and maintaining your deliverability. Let's dive into each one of them a little.

SPF

The first technology is Sender Policy Framework (SPF). SPF is a way for you to declare the IP addresses or IP ranges that are allowed to send email from your domain. Here's the SPF record for petekeen.net:

$ dig +short petekeen.net txt
"v=spf1 include:_spf.google.com include:spf.mandrillapp.com ~all"

SPF is composed of a version followed by one or more declarations. For my domain, I include Google and Mandrill's declarations and then declare everything else as a "soft fail". More specifically, I am telling the world that servers that belong to Google and Mandrill are authorized to send email as me, and everybody else is not, but don't reject it just because they're not authorized.

SPF records can get arbitrarily complicated. The important thing to remember is that they're just a whitelist and/or blacklist of IPs that can or can't send on behalf of your domain.

DKIM

Another important technology is DomainKeys Identified Mail (DKIM). When you send email through a provider that supports DKIM, they will sign the contents of your email and (most of) the headers using public key cryptography and add that signature as another header. Receiving email servers look up your public key and verify that nothing has changed in the email.

Email service providers have various ways of inserting the key into DNS, but typically you'll add a record at something like providername._domainkey.example.com which either contains or points at their key. For example, petekeen.net uses Mandrill extensively to send out messages, and Mandrill says to put a DKIM key at mandrill._domainkey.petekeen.net:

$ dig +short mandrill._domainkey.petekeen.net txt
"v=DKIM1; k=rsa; p=MIGfMA0GCSqGSIb3DQEBAQUAA4GNADCBiQKBgQCrLHiExVd55zd/IQ/J/mRwSRMAocV/hMB3jXwaHH36d9NaVynQFYV8NaWi69c1veUtRzGt7yAioXqLj7Z4TeEUoOLgrKsn8YnckGs9i3B3tVFB+Ch/4mPhXWiNfNdynHWBcPcbJ8kjEQ2U8y78dHZj1YeRXXVvWob2OaKynO8/lQIDAQAB;"

As you can see, this follows the same basic format as the SPF record. It contains a version attribute, an attribute that tells it what kind of key it is, and then the actual key itself.

DMARC

The third technology that helps to ensure delivery is named Domain-based Message Authetication, Reporting, and Conformance (DMARC). DMARC acts as policy statement that declares what to do with emails that fail SPF, DKIM, or both. There are a few different modes that you can use with DMARC, but the most basic one is to receive reports from receiving email servers on pass or fail status. Here's what the DMARC record for petekeen.net looks like (but see the next section on how to get your own):

$ dig +short _dmarc.petekeen.net txt
"v=DMARC1; p=none; pct=100; rua=mailto:re+POSTMARK_KEY@dmarc.postmarkapp.com; sp=none; aspf=r;"

The full standard goes into what all of these parts mean, but you can interpret this as: report all SPF and DKIM errors to the email address in the rua param but continue to accept them.

There are a lot of things you can tweak with your DMARC policy, but the one declared above is the least-impact you can have.

Monitoring

If you look closely at that DMARC record above, you'll see dmarc.postmarkapp.com. Postmark runs a free DMARC aggregation service, which will aggregate all of the reports from DMARC-supporting services and send you a report every Monday morning with details. The first step in implementing DMARC is to sign up with Postmark's service, set up the DMARC record that they give you in your DNS, and wait a week.

Yep, a whole week, then come back here and we'll talk about how to handle the inevitable errors that show up in your report.

How To Fix The Errors

So you waited a whole week (or just scrolled down, no big deal) and now you have an email from Postmark telling you about all of the problems it found. Now what?

This step is actually pretty easy. For each provider in the list that you know you use, you need to set up SPF and DKIM (if they provide it). Here's a list of common email providers' help documentation on how to do that:

MailChimp
Mandrill
Postmark
SendGrid (SPF, DKIM)
Mailgun
Amazon AWS SES (SPF, DKIM)
Google Apps

You'll probably have to go through this cycle every week for at least a few weeks, in order to catch all of the services that send email as you. Just remember to only add SPF for services that you know about.

VERY IMPORTANT NOTE You should only have one SPF record for your domain. If you use more than one outgoing email provider, you need to combine their include directives together. See the SPF record for petekeen.net above for an example of what this looks like.

What about unknown providers?

Eventually you will likely start to see things in your DMARC report that are suspicious. The most likely cause of this is spammers using your domain to tell everyone they can find about the magic of off-brand C1@LI$. If that starts happening, you can change your DMARC settings to be more strict.

The most complete guide for how to do that is the standard, since there are quite a few options. That said, if you want receiving email servers to quarantine suspicious messages you can change the p= setting from none to quarantine, or you can change it to reject to flat out bounce the messages.

There are a variety of reasons why you wouldn't want to do that, so I advise people to keep their settings at none unless they're absolutely sure of the implications for their own domain.

Know How To Roll (Your SSL Certificates)

2014-08-13T00:00:00+00:00

A few weeks ago Stripe's SSL certificate became invalid, along with several other major sites. Their certificate didn't expire, their certificate authority's root certificate did. This shouldn't happen, but as with most terrible things it crops up at rather inconvenient times.

There's not much you can do to protect yourself against a service provider's certificate expiring, but you can proactively protect yourself against your own certificate expiring. The biggest thing to do is have a schedule and a process.

Schedule

This is easy. Just make a monthly recurring entry in your calendar that says "Check SSL certificates". When that calendar entry comes up, go to your website and check your certificate by clicking on the lock icon. The things you're checking for:

Is the lock icon still showing up how it should? If you have an EV certificate you should see your company name in the title bar. If you have a normal certificate you'll see either a green lock in Firefox or Chrome or a grey "https" in Safari.
Is the expiration date coming up? You should proactively roll your certificates at least a day before they're due to expire.

This is the bare minimum you should do if you have one SSL certificate. If you have more than one, or you want to me more thorough, there are any number of free and paid SSL monitoring services out there that will check your certs day and night to make sure they're ok.

Process

The process you use to roll certificates is somewhat dependent on your infrastructure, but the general ideas stay the same:

Know how to generate a new private key
Know how to generate a new CSR from that key
Know how to renew your certificate with your provider using that CSR
Know how to install your new certificate

Note that you should use a new private key every time because there may have been a private key compromise you don't know about. See SSL Labs's SSL/TLS Deployment Best Practices (pdf) for more details.

If you use Heroku and all of this seems like too much bother, you should check out the ExpeditedSSL addon. They'll automate all of these steps away and make sure you're protected.

I run all of my sites on VPSs, so I have the privilege of managing everything myself. I put together a Rakefile that manages the hard-to-remember steps for me. It lives in private source control along with my keys and certs, but here's what it looks like today:

desc "Generate a new key"
task :gen_key do
  domain = get_env(:domain)
  filename = "#{domain}.key"

  `openssl genrsa -out #{filename} 2048`
end

desc "Generate a new CSR"
task :gen_csr => :gen_key do
  domain = get_env(:domain)
  csr_filename = "#{domain}.csr"
  key_filename = "#{domain}.key"

  `openssl req -new -utf8 -sha256 -key #{key_filename} -out #{csr_filename}`
  `cat #{csr_filename} | pbcopy`
end

desc "Generate a proper nginx cert file from Namecheap Comodo certificate download"
task :assemble_cert do
  cert_dir = get_env(:cert_dir)
  domain = get_env(:domain)
  pem_path = "#{domain}.crt"

  File.open(pem_path, 'w+') do |pem_file|
    add_file_to_cert pem_file, domain.gsub(/\./, '_') + '.crt'
    add_file_to_cert pem_file, 'COMODORSADomainValidationSecureServerCA.crt'
    add_file_to_cert pem_file, 'COMODORSAAddTrustCA.crt'
    add_file_to_cert pem_file, 'AddTrustExternalCARoot.crt'
  end

  puts "Wrote pem to #{pem_path}"
end

def add_file_to_cert(pem_file, filename)
  cert_dir = get_env(:cert_dir)
  full_path = File.join(cert_dir, filename)
  puts "Adding #{full_path} to pem"
  pem_file.write(File.read(full_path))
end

def get_env(name)
  val = ENV[name.to_s]
  raise "Required env variable missing: #{name}" unless val && val != ''
  val
end

It's really simple. All you do is feed it the fully-qualified domain name and it spits out a key and a CSR. It'll re-use an existing key if there is one to use. It's even nice enough to copy the CSR into your clipboard so you can just paste it into your provider's website.

Installing certificates is where it becomes infrastructure-dependent. Heroku has a nice guide, as does Amazon. If you're using Nginx, you'll need to generate a PEM file from your certificate and any intermediary certificates that it requires. The Rakefile above contains a task named assmeble_cert that will build a PEM file suitable for Nginx.

Very Important Note: Make sure to change the assemble_cert task to reflect the order that your certificate needs to get put together. This script generates correct files for Comodo certificates issued by Namecheap, but there's no guarantee that this is the order for any other certificate provider.

Fill out the form below to learn how to build a better Stripe integration, including a complete chapter on PCI security and SSL certificate generation.

Command Line Faxing

2014-08-04T00:00:00+00:00

When I started Okapi LLC, my little consultancy and publishing house, I had to fax in some forms to the State of Michigan. The entire system for opening businesses in Michigan, in fact, is basically a fax driven API. Being a modern, hip millenial I don't subscribe to a land line phone, nor do I own a fax machine. How was I supposed to fax things?

Enter Phaxio. They have a whole bunch of fax machines (actually they're probably banks of modems) in a data center somewhere and they let you use them with a simple HTTP API. All you have to do is go sign up and make an initial deposit. They'll provide you with an API key and secret pair that you can then use to send faxes using curl.

For a time I was actually hand-writing the curl commands. That got tedious and annoying so I wrote up this little script:

#!/usr/bin/env ruby

unless ARGV.length >= 2
  STDERR.puts "Usage: send_fax NUMBER FILENAME..."
  exit 1
end

number = ARGV.shift
api_key = ENV['PHAXIO_API_KEY']
api_secret = ENV['PHAXIO_API_SECRET']

command_args = [
  'curl',
  'https://api.phaxio.com/v1/send',
  "-F to=#{number}",
  "-F api_key=#{api_key}",
  "-F api_secret=#{api_secret}"
]

ARGV.each do |file|
  command_args << "-F filename[]=@#{file}"
end

exec command_args.join(" ")

All this does is grab my keys from the environment, sanity check the arguments, and construct and execute the curl command I was writing. It's as simple as that.

Like any good FaaS (facimile-as-a-service), Phaxio can be configured to send out webhooks when faxes come in or go out. The thing that really sets Phaxio apart in my mind is that you can set your webhook URLs to be mailto: URLs (ex. mailto:faxes@example.com), which means you don't have to set up an application for notifications. The emails come with handy links to download your faxes in one click.

How much does this cost?

Well, it's not free but it's practically free. Pages are 7 cents a pop and incoming numbers (which are totally optional unless you want to receive faxes) cost $2 per month. Michigan's ELF system automatically faxes you status updates and documents (would these be "faxhooks"?) so I have an incoming number.

What about signatures?

Ah yes. The reason why you'd need to send a fax in the first place instead of emailing things around is because your documents contain sensitive information and probably have to be signed. OS X's Preview application has a super handy feature built in named "Signature Capture". Just open up the PDF you need to sign, then go Tools -> Annotate -> Signature -> Create Signature from FaceTime HD Camera, which will open up a little dialog box like this:

Just sign your name onto a blank sheet of paper, hold it up to your camera, and then hit "Accept". Then your signature will be available within the same "Signature" submenu. Select it, click anywhere in a document, and poof you just signed it.

I've been using this system for about six months to sign and fax documents for things like opening bank accounts, setting up my company, and signing contracts. There are probably better systems out there. What's yours?

Start a VirtualBox VM at Boot on Mac OS X

2014-05-30T00:00:00+00:00

Sometimes you have a VirtualBox VM that's critical to your workflow. For example, the Mac mini in my basement hosts a VM that does things like host all of my private Git repos and provide a staging environment for all of my wacky ideas.

When I have to reboot that Mac mini for any reason, inevitably I find myself trying to push changes to some git repo and forgetting that I have to start up the VM again by hand. And then there's the yelling and the drinking and it's no good for anyone.

It turns out you can actually run VirtualBox VMs in a few different ways, including from the command line. Assuming you have a VM named examplevm, this command will start it up in the background:

$ VBoxManage startvm examplevm

Starting up in the background isn't quite right for how this will eventually be set up, so what about running in the foreground? Turns out VirtualBox has us covered:

$ VBoxHeadless -s examplevm

This will start the VM up in the foreground without any visible UI. Now all we need little launchd configuration for it (in ~/Library/LaunchAgents/bugsplat.examplevm.plist):

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
<plist version="1.0">
  <dict>
    <key>KeepAlive</key>
    <true/>
    <key>Label</key>
    <string>bugsplat.examplevm</string>
    <key>ProgramArguments</key>
    <array>
      <string>VBoxHeadless</string>
      <string>-s</string>
      <string>examplevm</string>
    </array>
    <key>RunAtLoad</key>
    <true/>
    <key>UserName</key>
    <string>peter</string>
    <key>WorkingDirectory</key>
    <string>/Users/peter</string>
    <key>StandardErrorPath</key>
    <string>/usr/local/var/log/examplevm.log</string>
    <key>StandardOutPath</key>
    <string>/usr/local/var/log/examplevm.log</string>
  </dict>
</plist>

There are a few important things to note about this configuration. First, notice how the ProgramArguments list is broken out. If you were to take the VBoxHeadless command from before, split it on spaces, and then each space gets it's own <string> element. sigh XML.

Second important thing is the WorkingDirectory key. It turns out VBoxHeadless is not very smart about where it looks for VMs. This has to be pointing at your home directory.

Third, the StandardErrorPath and StandardOutPath keys. The directory has to exist or launchd will just silently fail.

To get this thing running for the first time, just run this:

$ launchctl load -wF ~/Library/LaunchAgents/bugsplat.examplevm.plist

One more thing...

The VM that I'm using this for is running Ubuntu 12.04 LTS, which has a really annoying feature. If it knows that it crashed (or the "power" was "cut", for example if I just kill the VBoxHeadless process) the GRUB boot loader has no timeout on the select screen, and the select screen is written as a tight busy loop that will consume an entire CPU core just waiting for input that will never come because it's running headless¹.

To fix this, you'll need to add a line to /etc/default/grub inside the VM:

GRUB_RECORDFAIL_TIMEOUT=2

and then run:

$ sudo update-grub

This gives you a two second window in which to select memtest or a recovery partition if you want, but it will still boot to the normal image eventually.

¹ I seriously spent days banging my head against the wall thinking this was a VirtualBox bug or that something was corrupting my VMs when I killed VBoxHeadless. The thing that threw me off was the pegged CPU. It never would have occured to me that GRUB uses a tight busy loop waiting for input. ↩

Stripe Account Balances for Service Credits

2014-05-07T00:00:00+00:00

Say you want to give a customer an account credit for some reason. They're an especially good customer, or your service was down for a few minutes and you want to give service credits, or some other reason. You can do this using Stripe's account_balance feature.

Here's an example situation. You run a DNS service and had a five minute outage last month. Your SLA (service level agreement) says that you give a one day credit for a five minute outage, and for your customer Bob that's equal to $1.

Here's how you credit Bob's account:

bob = Stripe::Customer.retrieve('cus_bobskey')
bob.account_balance = bob.account_balance - 100
bob.save

Bob's card will be charged $9 for his next invoice: $10 from his monthly plan and -$1 from his account balance. After this invoice, his account balance will be set back to 0. Note that you have to set the account balance to a negative number. If you set it to a positive number, that amount will be added to Bob's next invoice instead of subtracted. It's also a good practice to subtract the amount from their existing balance. Most of the time this will be 0, but if they happen to already have a balance you don't want to stomp on it.

Here's another example. You want to give Cindy two free months for upgrading to your biggest plan, from the $20 Hobby plan to the $100 Super Startup plan. The same idea applies as for Bob:

cindy = Stripe::Customer.retrieve('cus_cindyskey')
cindy.plan = 'super_startup_100'
cindy.account_balance = cindy.account_balance - 20000
cindy.save

Cindy is halfway through the billing month when she decides to upgrade. On her next invoice, she'll see the following line items:

$50 for the half month
$100 for the next full month
$-150 from her account balance

Her card won't actually be charged because the entire amount came out of her account balance, which is now $-50. On month 2, she'll receive that $50 from her account balance and be charged the remaining $50 for her plan, and then on month 3 she'll be charged the full $100.

You can use the account balance with positive numbers, of course. This would add an additional line item to the customer's invoice for the account_balance amount. It's usually better to create line items directly, though, because that way you have control over the description.

Stripe's update_customer documentation

Using Stripe Checkout for Subscriptions

2014-03-30T00:00:00+00:00

Stripe provides a gorgeous pre-built credit card form called Stripe Checkout. Checkout is mainly intended for one-off purchses like Dribbble or my book. Many people want to use it for their Stripe-powered subscription sites so in this article I'm going to present a good way of doing that.

Here's a really basic Checkout button (in fact, the same button as on Stripe's documentation site):

<script
  src="https://checkout.stripe.com/checkout.js" class="stripe-button"
  data-key="pk_test_1MCYLYHQDa4DwnBoKd5CqoaP"
  data-image="https://stripe.com/img/documentation/checkout/marketplace.png"
  data-name="Demo Site"
  data-description="2 widgets ($20.00)"
  data-amount="2000">
</script>

There are several things in here that don't really work well for a subscription site, but they're easy to fix. For example, we don't want to show an amount in that way and the description doesn't make much sense. Here's a another version with a few changes:

<script
  src="https://checkout.stripe.com/checkout.js" class="stripe-button"
  data-key="pk_test_1MCYLYHQDa4DwnBoKd5CqoaP"
  data-image="https://stripe.com/img/documentation/checkout/marketplace.png"
  data-name="Demo SaaS Site"
  data-description="Pro Subscription ($29 per month)"
  data-panel-label="Subscribe"
  data-label="Subscribe"
  data-amount="2900">
</script>

Stripe lets you customize most of the text on the form. In this example, we changed tha panel button to say "Subscribe" instead of "Pay", and changed the description to something more appropriate for our site. You can also see that we added a template variable to the data-description attribute, so if we had multiple tiers all we'd have to change is the word "Pro" and the data-amount attribute.

There's one last thing that we can change on here. There's that "Remember Me" checkbox, which can be confusing for customers. They're signing up for a subscription site, so aren't you already remembering them? Thankfully, Stripe recently added the ability to disable that checkbox:

<script
  src="https://checkout.stripe.com/checkout.js" class="stripe-button"
  data-key="pk_test_1MCYLYHQDa4DwnBoKd5CqoaP"
  data-image="https://stripe.com/img/documentation/checkout/marketplace.png"
  data-name="Demo SaaS Site"
  data-description="Pro Subscription ($29 per month)"
  data-panel-label="Subscribe"
  data-label="Subscribe"
  data-amount="2900"
  data-allow-remember-me="false">
</script>

Great! Nice, streamlined, beautiful form without having to design it yourself. But what if you don't like that blue button? Stripe provides a Javascript API so you can make any link or button pop up Checkout:

<button class="btn btn-primary btn-large" id="stripe-demo">Subscribe</button>

<script src="https://checkout.stripe.com/checkout.js"></script>
<script>
var handler = StripeCheckout.configure({
  key: "pk_test_1MCYLYHQDa4DwnBoKd5CqoaP",
  image: "https://stripe.com/img/documentation/checkout/marketplace.png",
  name: "Demo SaaS Site",
  description: "Pro Subscription ($29 per month)",
  panelLabel: "Subscribe",
  allowRememberMe: false
});

document.getElementById('stripe-demo').addEventListener('click', function(e) {
  handler.open();
  e.preventDefault();
});
</script>

Pretty straight forward. Every attribute that you pass into the simple integration using data attributes instead gets passed into the configure method. You can pass overrides into the open method, so for example, if you had a series of buttons with a specific class and data attributes for the description, you could get that off of the event target and pass it into open.

But what about passwords?

If you're building a subscription product you'll likely want the user to set their password. There's two ways you can use Stripe Checkout and still have the user set their password:

Have a second step after Stripe Checkout that allows the user to set up their account, including their password.
Send a confirmation email to the user immediately after the subscription flow that brings them to a password reset screen.

Of the two I prefer the second, since you should be confirming the user's email address anyway. That said, you should test with your customers and see what works best for them.

Now what?

Of course, none of this is worth anything without creating the customers. For that, you'll need to use Stripe's server-side APIs along with your secret key. Stripe has excellent documentation on how this works.

Note: this describes Stripe Checkout as of March 30th, 2014. Stripe is continually updating and testing Checkout, so things may change in the future.

Self-hosted Git Server

2014-03-16T00:00:00+00:00

I've had a GitHub account since 2008. June 16th, to be exact. For almost six years I've been hosting my code on someone else's servers. It was sure convenient, and free, and I don't regret it one bit, but the time has come to move that vital service in-house.

I've run my own private git server on the Mac mini in my living room since 2012. For the last few years, then, my GitHub account has become more of a public portfolio and mirror of a selection of my private repos. As of today, my GitHub account is deprecated. If you want to see what I'm working on now you can go to my Projects page. I'll be gradually moving old projects over to this page, and new projects will show up there first.

Implementation

The projects page has three moving pieces. The git repos themselves, read-only public clone access, and finally displaying the projects on the page.

For the git repos, I was able to just re-use the puppet recipe I put together to install Gitolite on my Mac mini. It has a much simpler config because it just needs dynamic repos and no other crazy hooks.

To add read-only clone access I turned to a project named Grack. Grack implements git's smart HTTP protocol as a Rack handler which makes it super simple to add to bugsplat.rb, the software that runs this site. Here's what config.rb looks like:

require 'dotenv'
Dotenv.load

require 'app'
require 'grack'

grack_config = {
  project_root: ENV['PROJECTS_REPOS_ROOT'],
  adapter: Grack::GitAdapter,
  git_path: ENV['GIT_BINARY'],
  upload_pack: true
}

puts grack_config.to_json

use Rack::ShowExceptions
run Rack::URLMap.new \
  '/'       => App.new,
  '/source' => Grack::App.new(grack_config)

Every git repo in the directory named by PROJECTS_REPOS_ROOT is available for read-only public cloning.

To display these repos, I added a new Project class. Here it is, in it's entirety:

require 'grit'
require 'yaml'

class Project
  def initialize(path)
    @path = path
    load_config
  end

  def load_config
    data = repo_data(".repo.yml")
    @config = data.nil? ? {} : YAML.load(data)
  end

  def name
    @config['name'] || base_path.gsub('.git', '')
  end

  def description
    @config['description'] || "No description"
  end

  def clone_url
    "https://www.petekeen.net/source/#{base_path}"
  end

  def information_page
    "/projects/#{base_path.gsub('.git', '')}"
  end

  def base_path
    File.basename(@path)
  end

  def repo_data(path)
    repo = Grit::Repo.new(@path)
    obj = repo.tree / path
    if obj
      obj.data.encode('UTF-8')
    else
      nil
    end
  end

  def readme_contents
    repo_data("README.md") || ""
  end

  def self.all
    Dir[File.join(ENV['PROJECTS_REPOS_ROOT'], "*.git")].sort.map do |dir|
      Project.new(dir)
    end
  end

  def self.find(name)
    path = File.join(ENV['PROJECTS_REPOS_ROOT'], "#{name}.git")
    if File.directory?(path)
      Project.new(path)
    else
      nil
    end
  end
end

It uses Grit to pull out the README.md file from the repo, as well as a small YAML config file that contains a few pieces of metadata.

The app then uses the existing Redcarpet-based Markdown renderer that renders the rest of the pages and throws the content up on the page.

Future Additions

There are a lot of little things that this system lacks. Easy code and commit browsing are both huge features that I'd like to add at some point. I thought about using Gitlab which has all kinds of nice features, but hacking things together is kind of my thing.

I realize that it's a little ironic that most of the links in this post point at GitHub. My reasons for moving to this system are pretty simple: I want to control my own destiny, free of even the possibility that someone else will be able to decide how or what I choose to share with the world.

For a lot of people, GitHub or Bitbucket or another 3rd party service presents a reasonable compromise for them, and that's fine. For myself, today is the last day that I'm pushing new repos to GitHub as well as the last day I'm paying them for my organization account.

Using the Mailchimp API for Sales

2014-03-06T00:00:00+00:00

One of the very first things I did when I started working on the idea that eventually became Mastering Modern Payments was set up a Mailchimp mailing list. People would land on the teaser page and add themselves to the list so that when the book came out they would get a little note. After the book launch (with 30% of that initial list eventually buying) I started putting actual purchasers on the list.

For three whole months my process was:

Use a rake task to export the entire list as a CSV
Navigate to Mailchimp
Remind myself where the import button is and click it
Paste in the CSV
Assign field labels
Hit the "import" button
Wait for an email saying it was done

I did this every single time I sent an email to the list, which was quite often when I was actively fixing bugs in the book. Clearly this couldn't continue.

Mailchimp turns out to have a very nice API these days and there's a plethora of good libraries out there to help you take advantage. Because the Mastering Modern Payments application is written in Ruby I chose to go with Gibbon.

The code is remarkably straight forward:

class MailchimpWorker
  include Sidekiq::Worker

  def perform(guid)
    ActiveRecord::Base.connection_pool.with_connection do
      sale = Sale.find_by(guid: guid)

      gb = Gibbon::API.new

      gb.lists.subscribe(
        id: Rails.configuration.mailchimp[:list_id],
        update_existing: true,
        email: {email: sale.email},
        merge_vars: {
          PRODUCT: sale.product.permalink,
          PURCH: 't',
          GUID: sale.guid,
          AMOUNT: sale.amount,
          PURCHAT: sale.created_at.strftime('%Y-%m-%d %H:%M:%S')
        }
      )
    end
  end
end

To start out, it's a Sidekiq worker. Every customer-initiated interaction with Stripe and Mailchimp in the MMP sales app goes through Sidekiq. Feel free to substitute Sidekiq for whatever other background worker you use, or if you're feeling cheeky just don't use one at all.

Then it goes on to look up the sale, build a new instance of the Gibbon API, and call the subscribe method. Three things to note here. First, the update_existing flag. I don't want to stomp on the user's record if it exists, I just want to update it with their sales info.

Second, the merge_vars keys are in all caps. Mailchimp merge vars are case sensitive, in so far as they are always upper case. If you specify the wrong merge vars nothing will happen, the update will just silently fail (I spent probably two hours debugging that one).

Third, that PURCH merge var. This list is a mixture of people who are possibly interested in buying the book and also people who have definitely purchased the book. Frequently I'll want to send to either of those groups, but not both, and Mailchimp's search interface makes it surprisingly difficult (impossible, actually) to check for null values. Instead, I set this little true/false flag and then I can do a search for purchased == 't' or purchased != 't'.

The Sidekiq job that processes the sale with Stripe kicks this job off as the very last step in the process. I can instead sit back with a bottle of beer and write emails to the list, confident in the knowledge that everyone who should get an email will.

The Life of a Stripe Charge

2014-01-20T00:00:00+00:00

One of the most common issues that shows up in the #stripe IRC channel is people setting up their front-end Stripe Checkout integration and then expecting a charge to show up, which isn't really how Stripe works. In this post I'm going to walk through a one-off Stripe charge and hopefully illustrate how the whole process comes together.

Tokenization

The first stage in processing credit cards with Stripe is "tokenization", where Stripe turns a credit card number, expiration date, and CVC (card verification code, the three or four digit number printed on your card) into a single use token that your application can use in the second stage.

To illustrate the process, here's a real (test-mode) Stripe checkout button. Go ahead and click it. You can fill it in with the card number 4242 4242 4242 4242, any future expiration date, and any three digit CVC.

And here's the source:

<script
  src="https://checkout.stripe.com/checkout.js" class="stripe-button"
  data-key="pk_test_6pRNASCoBOKtIshFeQd4XMUh"
  data-amount="2000"
  data-name="Demo Site"
  data-description="2 widgets ($20.00)">
</script>

If you have your web console open when you click "pay now" you'll see a request to a URL like this:

https://api.stripe.com/v1/tokens?email=foo%40example.com
  &payment_user_agent=Stripe+Checkout
  &amount=0
  &iovation_blackbox=<a_very_large_string>
  &card[number]=4242+4242+4242+4242
  &card[cvc]=123
  &card[exp_month]=4
  &card[exp_year]=2014
  &card[name]=foo%40example.com
  &key=pk_test_6pRNASCoBOKtIshFeQd4XMUh
  &callback=sjsonp1390180955159
  &_method=POST

There's a few interesting things going on here. Stripe POSTs at a /tokens API endpoint over https, which means everything is encrypted including the query params. These params include the card number, expiration date, and CVC, as well as the email address you put into the form and whether you want Stripe to remember you across sites. The API responds with a JSONP fragment that contains a single-use token that represents this card information.

Under the hood, Stripe is effectively storing the card in its vault of card information for a small amount of time and handing you back a way to refer to it so your server never knows the real information. This roundabout process is due to a set of banking industry regulations named PCI and is the key to Stripe's easy integration. Everyone needs to be PCI complaint, but because your server-side process never knows the real card information, to be compliant you just have to serve all of your pages over HTTPS and use stripe.js or checkout.js to tokenize card information. A few years ago Ken Cochrane put together an excellent developer's guide to PCI compliance that has a whole lot of common-sense advice and guidance.

Stripe::Charge

The second stage of processing a card with Stripe is actually creating a charge. Until you create a charge from your server using your Stripe secret key everything is temporary. Passing an amount parameter to checkout.js as above will put a temporary authorization on the provided card but you still have to create the charge. Here's a Ruby/Sinatra example:

post '/charge' do
  token = params[:stripeToken]
  Stripe.api_key = 'sk_test_abcdef1234567890'
  begin
    Stripe::Charge.create(
      card: token,
      amount: 2000,
      currency: 'usd',
      description: 'test charge'
    )

    redirect '/done'
  rescue Stripe::StripeError => e
    @error = e
    erb :error
  end
end

Between the call to Stripe::Charge and the redirect a whole series of actions happen between your server, Stripe's API, the card network, and at least one bank.

Stripe API servers The first thing that happens is your application makes an API call to Stripe's servers. This API call contains your Stripe secret key, the amount you want to charge, and the card you want to charge in the form of the token you got from stripe.js or checkout.js.
Card network The next thing that happens is that Stripe's servers contact the card network. Visa, Mastercard, Discover, and American Express are all examples of card networks. The card network's job is to route transactions to the bank that issued the card. For example, I have a Chase Visa card. Chase is the bank that issued the card and Visa is the card network that processes the transactions. Individual credit card processers don't have to know about all of the banks in the world, they just have to know how to contact the right network. In more traditional forms of credit card processing this step is performed by what's called a payment gateway but Stripe just handles it for you.
Bank The card network contacts the bank responsible for the card in question and asks to do two things, called an authorize and a capture. An authorize request tells the bank to verify and reserve a certain amount of money out of the card's available credit for a transaction. A capture request tells the bank to actually transfer funds out of their account and into your Stripe account.

Typically authorize and capture requests happen as one step, but sometimes merchants find it useful to be able to authorize for a larger amount than they end up charging, or to verify that you have funds available before they're ready to actually send you something. For example, a gas station will authorize something like $100 on your card temporarily until they know how much gas you pumped. Another example would be ordering a book from Amazon, who will authorize your card for the book but only charge when they ship it out of a warehouse. You can tell Stripe to create an authorization by passing the capture=false param to Stripe::Charge, and later capture it by calling the Stripe::Charge#capture method.

One more note about banks. In the Chase example above, Chase is the bank and Visa is the network, but sometimes the card network is also the bank. The most common examples are American Express and Discover, but there are others.
Back to Stripe After the bank has either accepted or declined the charge it will respond to the card network, who will respond to Stripe, who will finally respond to your application's server-side Stripe::Charge#create method call and your application can carry on with whatever else it needs to do. In the example above, it redirects the customer's browser to the /done URL. If the customer's bank declined the charge or some other error happened Stripe's API will throw an exception which we can catch and render for the user.

Customers and Subscriptions

Stripe-level customers work basically the same way as one-off charges. Tokenization works exactly the same but on the server, instead of creating a Charge object immedately you create a Customer object with the stripeToken parameter:

post '/signup' do
  token = params[:stripeToken]
  Stripe.api_key = 'sk_test_abcdef1234567890'
  begin
    Stripe::Customer.create(
      card: token,
      email: params[:stripeEmail]
    )

    redirect '/done'
  rescue Stripe::StripeError => e
    @error = e
    erb :error
  end
end

When you create a customer you can pass in a plan parameter that refers to a previously-created Stripe::Plan. This will immediately start their subscription and transparently charge them for their first period.

You can also create charges using customers instead of cards:

Stripe::Charge.create(
  customer: @customer.id,
  amount: 1000,
  currency: 'usd'
)

Wrap Up

Stripe makes credit card processing simple by wrapping up a bunch of formerly independent pieces, letting you concentrate on your application. That said, knowing the basics of those pieces will help you understand what's going on under the hood and more importantly help you ask the right questions when things don't go quite right.

You should also check out my Stripe Webhook Event Cheatsheet.

A Practical Exercise in Web Scraping

2013-12-15T00:00:00+00:00

Yesterday a friend of mine linked me to a fictional web serial that he was reading and enjoying, but could be enjoying more if it was available as a Kindle book. The author, as of yet, hasn't made one available and has asked that fan-made versions not be linked publicly. That said, it's a very long story and would be much easier to read using a dedicated reading app, so I built my own Kindle version to enjoy. This post is the story of how I built it.

Step 1: Source Analysis

The first step of any kind of web scraping is to understand your target. Here's what the first blog post looks like (with different content):

<h1 class="entry-title">The Whale</h1>
<div class="entry-content">
  <p>
    <a title="Next Chapter" href="http://example.com/the/next/chapter">
      Next Chapter
    </a>
  </p>
  <p>"And what tune is it ye pull to, men?"</p>

  <p>"A dead whale or a stove boat!"</p>

  <p>
    More and more strangely and fiercely glad and approving, grew the
     countenance of the old man at every shout; while the mariners
     began to gaze curiously at each other, as if marvelling how it
     was that they themselves became so excited at such seemingly
     purposeless questions.
  </p>

  <p>
    But, they were all eagerness again, as Ahab, now half-revolving in
    his pivot-hole, with one hand reaching high up a shroud, and
    tightly,   almost convulsively grasping it, addressed them
    thus:&mdash;
  </p>
  <p>
    <a title="Next Chapter" href="http://example.com/the/next/chapter">
      Next Chapter
    </a>
  </p>
</div>

After browsing around I found a table of contents, but since all of the posts were linked together with "Next Chapter" pointers it seemed easier to just walk those. The other interesting thing here is that there's a comment section that I didn't really care about.

Step 2: Choose Your Tools

The next stage of web scraping is to choose the appropriate tools. I started with just curl and probably could have gotten pretty far I knew the DOM futzing I wanted to do would require something more powerful later on. At the moment Ruby is where I turn to for most things, so naturally I picked Nokogiri. The first example on the Nokogiri docs page is actually a web scraping example, and that's basically what I cribbed from. Here's the initial version of the scraping function:

def scrape_page(url)
  html = open(url)
  doc = Nokogiri::HTML(html.read)
  doc.encoding = 'utf-8'

  content = doc.css('div.entry-content').first
  title = doc.css('h1.entry-title')

  next_url = ""

  content.search('a[title="Next Chapter"]').each do |node|
    next_url = node['href']
    node.parent.remove
  end

  {
    title: title,
    content: content,
    next_url: next_url
  }
end

Ruby has a built-in capability for opening URLs as readable files with the open-uri standard library module. Because of various problems with Nokogiri's unicode handling I learned about in previous web scraping experiences, the best thing to do is to pass a string to Nokogiri instead of passing it the actual IO handle. Setting the encoding explicitly is also a best practice.

Then it's a simple matter of using Nokogiri's css selector method to pick out the nodes we're interested in and return them to the caller. The idea is that, since each page is linked to it's successor we can just follow the links.

Step 3: The Inevitable Bugfix Iteration

Of course it's never that easy. Turns out these links are generated by hand, and across hundreds of blog posts of course there will be some inconsistencies. At some point the author stopped using the title attribute. Instead of using the super clever CSS selector a[title="Next Chapter"] I had to switch to grabbing all of the anchor tags and selecting based on the text:

content.search('a').each do |node|
  if node.text == "Next Chapter"
    next_url = node['href']
  end
  node.parent.remove
end

This works great, except that in a few cases there's some whitespace in the text of the anchor node, so I had to switch to a regex:

content.search('a').each do |node|
  if node.text =~ /\s*Next Chapter\s*/
    next_url = node['href']
  end
  node.parent.remove
end

Another sticking point was that sometimes (but not always) the author used non-ASCII in their URLs. The trick for dealing with possibly-escaped URLs is to check to see if decoding does anything. If it does, it's already escaped and shouldn't be messed with:

def escape_if_needed(url)
  if URI.unescape(url) == url
    return URI.escape(url)
  end
  url
end

Step 4: Repeat As Necessary

Now that we can reliably scrape one URL, it's time to actually follow the links:

task :scrape do
  next_url = 'http://example.com/the/first/chapter/'

  sh "mkdir -p output"

  counter = 0

  while next_url && next_url =~ /example.com/
    STDERR.puts(next_url)

    res = scrape_page(next_url)
    next_url = res[:next_url]
    title = res[:title].text

    File.open("output/#{sprintf('%04d', counter)}.html", "w+") do |f|
      f.puts res[:title]
      f.puts res[:content]
    end

    counter += 1

    sleep 1
  end
end

This is pretty simple. Set some initial state, make a directory to put the scraped pages, then follow each link in turn and write out the interesting content to sequential files. Note that file names are all four digit numbers so that the sequence is preserved even with lexicographical sorting.

Step 5: Actually Build The Book

At first I wanted to use Docverter, my project that mashes up pandoc and calibre for building rich documents (including ebooks) out of plain text files. I tried the demo installation first, but that runs on Heroku and repeatedly ran out of memory so I tried a local installation. That timed out (did I mention that this web serial is also very long?) so instead I just ran pandoc and ebook-convert directly:

task :build do
  File.open("input.html", "w+") do |f|
    Dir.glob('output/*.html').sort.each do |filename|
      f.write File.read(filename)
    end
  end

  STDERR.puts "Running conversion..."

  sh("pandoc --standalone --output=output.epub --from=html --to=epub --epub-metadata=metadata.xml --epub-stylesheet=epub_stylesheet.css input.html")
  sh("ebook-convert output.epub output.mobi")
end

Pandoc can take multiple input files but it was easier to manage one input file on the command line. The stylesheet and metadata xml files are lifted directly from the mmp-builder project that I use to build Mastering Modern Payments, with appropriate authorship information changes.

In Conclusion, Please Don't Violate Copyright

Making your own ebooks is not hard with the tools that are out there. It's really just a matter of gluing them together with an appropriate amount of duct tape and bailing twine.

That said, distributing content that isn't yours without permission directly affects authors and platform shifting like this is sort of a gray area. The author of this web serial seems to be fine with fan-made ebooks editions as long as they don't get distributed, so that's why I anonymized this post.

Simple Git-backed Microsites

2013-12-05T00:00:00+00:00

A few days ago I built a new tool I'm calling Sites. It builds on top of git-backed wikis powered by GitHub's Gollum system and lets me build and deploy microsites in the amount of time it takes me to create a CNAME.

Something that I've wanted for a very long time is a way to stand up new websites with little more than a CNAME and a few clicks. I've gone through a few rounds of trying to make that happen but nothing ever stuck. Furthest progressed was a Rails app exclusively hosting Comfortable Mexican Sofa, a simple CMS engine. I never ended up putting any sites on it, though.

GitHub's Pages are of course one of the best answers, but I'm sticking to my self-hosting, built-at-home guns.

A Short Code Tour

The code is split up into four distinct parts:

viewer is a Sinatra app that presents wiki content as web pages. It also can serve static assets right from the wiki repo and caches everything in an in-memory LRU cache. If you have a file layout.erb it will wrap the pages in that layout, otherwise it'll pass the content straight to the browser.
manager is another Sinatra app that allows me to create new sites on the fly. Because a site is just a Gollum wiki and a Gollum wiki is just a git repo, it just has to create a git repo at the right place and do a redirect. It bakes in HTTP Basic Auth so other people can't create sites all willy nilly.
some small extensions to Gollum add basic auth and override the method Gollum uses to find the correct wiki repo. By default Gollumn wants to be told exactly where the repo is in a class-level Sinatra setting, but that doesn't work when things are dynamic.
a Rack middleware ties it all together. The middleware has three jobs. If the incoming hostname maps to a CNAME that one of the sites has declared in a special wiki page named cnames, pass the request to the viewer app. Otherwise, either pass the request to Gollum for existing sites or to the manager to create a new site. The hard work of building the CNAME to site mapping is cached for a short period of time to minimize disk hits.

The neatest part about this setup is that, since sites are just git repos, I can clone the repo to my laptop and work with it in Emacs instead of directly editing in Gollum if I don't want to. This also lets me easily add asset files and layouts.

Demo Time

Here's what the manager app presents when you just go to sites.bugsplat.info:

I'm the only one that's ever going to be looking at this, so it doesn't really need to be anything fancy.

If you click on one of those links, you'll get the familiar Gollum interface:

To create a new site, just append its name to sites.bugsplat.info:

Notice the new little button. Clicking that creates the repo and sends you back to Gollum to populate the home page:

If you want to see how the demo site is put together, it's running here and the source is on GitHub here.

Installation

I built Sites to fit my infrastructure which is a delightful bastardization of 12 Factor so I haven't tried installing it elsewhere. I know it won't run properly on Heroku because it needs to be able to put the git repos in a persistent place, but it might work well as a Docker image. If you get it running somewhere please let me know and I'll link it here.

Simulating a Market in Ruby

2013-12-02T00:00:00+00:00

Trading markets of all kinds are in the news pretty much continuously. The flavor of the week is of course the Bitcoin markets but equity and bond markets are always in the background. Just today there is an article on Hacker News about why you shouldn't invest in the stock market. I've participated in markets in one way or another for about a decade now but I haven't really understood how they work at a base level. Yesterday I built a tiny market simulator to fix that.

Basic Concepts

tl;dr: a market for a commodity is two sorted lists, one of prices someone is willing to pay and another of prices is willing to accept.

A market exists to enable people to trade something, whether that be shares of stock or pork futures contracts or cryptocurrency tokens. In modern markets the fundamental core is called the order book. This is an open listing of offers to buy and sell a given commodity at some price. For example:

Pete has 10 shares of TSLA and is willing to sell them at $10 per share
Andrew would like to buy 10 shares of TSLA and is willing to pay $9.99 per share

The order book looks like this:

Buy	Sell
10 @ $9.99	10 @ $10.00

Emily can see the order book because it's public and open. She also has 10 shares of TSLA and decides to match Andrew's bid. She puts in a sell order at $9.99. Now the order book looks like this:

Buy	Sell
10 @ $9.99	10 @ $9.99
	10 @ $10.00

Hold on for a second. Why did Emily's offer bump Pete's down in the list? Each column of the order book is sorted by price and then by time. Buy orders are sorted highest price first, sell orders are sorted lowest first.

Trades only happen at the top of the order book. Because there's a match at the top of the order book the market executes the trade. Emily and Andrew exchange $9.99 for 10 shares of TSLA and Pete is left waiting for another suitor to come along and match his price.

This is how the notional price of an equity or contract is determined by the market. Each time a trade happens that price gets broadcast to the world as the price.

These orders are called limit orders because they say "buy TSLA for $9.99 but no more" or "sell TSLA for $10 but no less." In our example, if Fred didn't look at the book and decides to put in a buy for $10.05, he'll get his 10 shares at $10.00 from Pete.

Limit orders are the fundemental building block of a market. There are other order types but they're almost always built using one or more limit orders. One notable exception is a market order which orders a specific quantity at whatever the current market price is. (thanks for the corrections, minimax!)

Building a Simulation

Almost all of the above I learned by reading articles about basic trades and actually building a simulation. I decided to practice readme driven development and test driven development for this project, mainly because RDD helps me organize my thoughts and TDD helps me keep the programming going in the right direction. My first pass at the simulation was... well I guess you could say terrible. I completely misunderstood how the order book worked so I built this thing where any price match would execute a trade, not just at the top of the book. You can see that in the first working commit of my simulator.

After thinking about the problem really hard and reading about how order books are actually supposed to work I did some research and came across the algorithms gem. Among other awesome things, this gem includes several implementations of a data structure named the Red Black Tree. This structure keeps it's keys sorted during insert and removal, which is perfect for the order book. Each order book consists of a pair of these tree maps, one for buy and one for sell. The keys are the actual Order object and the value is just true, since we only really care about the keys.

The core of the simulation is submitting orders and checking to see if there's a match. Submitting is fairly trivial:

def submit_order(order)
  if order.order_type == :buy
    buy_map.push(order, true)
  else
    sell_map.push(order, true)
  end
end

Because the book is kept sorted, determining a match is also relatively straightforward. Here's the code in the book:

def match?
  return false if buy_map.size == 0 || sell_map.size == 0
  sell_map.min_key.match? buy_map.max_key
end

We just have get the top of each column in the order book and compare them. The tree map takes care that we can efficiently get both the min and max key.

Here is the implementation of Order#match?:

def match?(other)
  price_match = if order_type == :buy && other.order_type == :sell
    price >= other.price
  else
    price <= other.price
  end

  commodity == other.commodity &&
    quantity == other.quantity &&
    price_match
end

We first determine how to compare the price and then do some sanity checking on commodity and quantity. This simulator is limted to trading orders of exactly the same quantity but real markets can fulfill orders in more complicated ways.

Learning

So what did I learn yesterday afternoon? A few things. First, readme driven development and test driven development go hand in hand when building a project like this. Writing (and rewriting) the readme helped to clarify what I actually wanted to build, and writing tests both before and after building the implementation helped immensely with keeping the goal clear and my implementation correct.

Second, I gained a much better understanding of how markets work on a basic level. Actually getting in and building something seems to cement the ideas a whole lot better than just reading about them.

Little Data: How do we query personal data?

2013-11-12T00:00:00+00:00

My wife and I recently moved from Portland, OR to Ann Arbor, MI. Among the cacophony of change that is involved with a move like that, we of course changed to the local utility company. Browsing around in their billing application one day I came across a page that showed a daily graph of our energy usage, supposedly valid through yesterday for both gas and electric. And it has a button that spits out a CSV file of the date, which means if I actually wanted to I could build my own tool to analyze our usage.

This got me thinking about all of the little databases out there that have data about me that I might actually consider useful. Useful, in this case, means that by looking some combination of the data I could draw some useful conclusions and take some sort of action. Here's a list of the ones that immediately came to mind:

Energy usage
LoseIt
Withings
RunKeeper
Bank records
Email

Some of the things in that list have easily accessed APIs, like RunKeeper and Withings. Some of them you can export the data manually to CSVs. And then some of them really should have an API but don't, like LoseIt. And then what if we could add more data? Things like how often we run the clothes dryer, how often the furnace or AC kicks in and for how long, whether we're in the house or not, etc.

None of this is Big Data. This is all Little Data. It's reminiscent of the Quantified Self movement, but it's more about combining existing data sets into a unified whole than synthesizing new data.

I've gone so far as to build isolated silos for some of these data sets. For example, I track my finances using ledger and a variety of home-grown tools. My email gets backed up hourly and that's provided some useful data occasionally. Even so, all of it is still in individual silos. Ideally I would be able to dump all of this into some kind of personal "data soup" that I could query with a unified interface and build analytics tools on top of.

At this point I have a bunch of questions and no answers. What shape does the data soup take? How does data get in, how would it get queried? What questions could I realistically answer? I'm going to keep thinking about this and hopefully you will too.

Post-mortem of a Dead-on-Arrival SaaS Product

2013-10-19T00:00:00+00:00

A little over a year ago I announced the launch of my latest (at the time) product named Marginalia. The idea was to be a sort of online journal. A cheaper, more programmer friendly alternative to Evernote. It never took off, despite my best intentions, and so a few months ago I told the only active user that I was going to shut it down, and today I finally took that sad action. This post is a short history of the project and a few lessons learned.

A Brief History

Marginalia actually started out pretty humbly. See, for a very long time I've been emailing ideas to myself and then forgetting that I did that and thus those ideas were just completely lost. I needed a better way to capture all of this so I could get it out of my head and into something more permanent. In late January 2012 I finally put together a simple Rails application that used a Mailgun incoming account to parse emails sent to a special address and add the Markdown-formatted body content to a new page. The app would reply to that email with the From: address set to a unique address. Anything emailed to that address would get appended to the note with a timestamp.

I used this simple app called "Ideas" for seven months and loved it to pieces. I captured a lot of ideas in there, along with daily work notes and other various things. At some point I started talking about it with other people and they encouraged me to try selling it, and after a lot of consideration I added subscriptions and billing and all of the other little things a real SaaS app needs. Then I wrote a blog post and got it on the front page of HN briefly, and then... nothing. A few signups for the original "pay once" subscription model, a handful of people signing up for the $5/mo plan and then never coming back, and otherwise no other traffic. I kept using it for awhile and then built a better mousetrap and moved on.

Lessons Learned

People want their privacy. I originally tried pitching Marginalia as a way to keep a programming journal, which ended up being a losing proposition. First of all, software developers don't want to spend money on tools when their free or already-paid-for editor is just as good. Second, keeping journals of any kind at an external service is something that most developers are not at all into because of the privacy implications.

Itched scratches frequently bleed money. Every month Heroku charged me $29 for SSL and a non-development-level database. Every month for more than a year, with zero income. It's not a lot of money but it's money from my web hosting budget that could have been going toward something more productive. I also spent an awful lot of money on paid advertising trying to get an audience with zero return.

Launching without an audience means nobody shows up. Building a product without an excited, engaged audience is one of those things that software developers tend to do, often and with gusto. It's so easy to build up this idea in your head and in your editor and just expect people to show up after you're done. It's something that I've done three times and it's something I will hopefully never repeat. For my latest product I started with a simple landing page with a Mailchimp signup form. Only after actually determining interest did I move forward with the plan.

Conclusion

The source is up on GitHub if you want to take a look at how it came together. There's some features, including on-page javascript evaluation and data blocks, that never got announced but which I still think were a good idea.

DRY your Rails CRUD with Simple Form and Inherited Resources

2013-09-22T00:00:00+00:00

When you're writing a Rails application you usually end up with a lot of CRUD-only controllers and views just for managing models as an admin. Your user-facing views and controllers should of course have a lot of thought and care put into their design, but for admin stuff you just want to put data in the database as simply as possible. Rails of course gives you scaffolds, but that's quite a bit of duplicated code. Instead, you could use the one-two-three combination of Simple Form, Inherited Resources, and Rails' built-in template inheritance to DRY up most of the scaffolding while still preserving your ability to customize where appropriate. This lets you build your admin interface without having to resort to something heavy like Rails Admin or ActiveAdmin while also not having to build from scratch every time.

Inherited Resources consists of a base controller you can inherit from that implements all of the standard resourceful actions, plus a few convenience things for working with this controller. If you have a Book model, you could create a complete resourceful controller for it with the following code:

class BooksController < InheritedResources::Base
  protected
  def permitted_params
    params.permit(book: {:title, :author, :isbn})
  end
end

As you can see, Inherited Resources has automatic integration with Rails 4's Strong Parameters.

DRY up your CRUD

Rails 3.1 and further shipped with a thing called 'template inheritance'. This simply means that there's a default search path for templates that Rails will hunt through to find a suitable template. If there's no index.html.erb in app/views/books, for example, Rails will look in app/views/application because that's BooksController's base class. We can use this to construct default views for our CRUD controllers. First, let's make a base class for our controllers to inherit from:

class CrudController < InheritedResources::Base
  def attrs_for_index
    []
  end

  def attrs_for_form
    []
  end

  helper_method :attrs_for_index
  helper_method :attrs_for_form
end

And now we can set up default views. First, app/views/crud/index.html.erb:

<h1>
  <%= resource_class.to_s.pluralize %>&nbsp;
  <small>
  <%= link_to 'New', [:new, resource_class.to_s.downcase.to_sym] %>
</h1>
<table>
  <thead>
    <tr>
      <% attrs_for_index.each do |attr| %>
        <th><%= attr.to_s.titlecase %></th>
      <% end %>
      <th></th>
    </tr>
  </thead>
  <tbody>
    <% collection.each do |resource| %>
      <tr>
      <% attrs_for_index.each do |attr| %>
        <td><%= link_to resource.attributes[attr.to_s], resource %></td>
      <% end %>
      <td><%= link_to 'Edit', [:edit, resource] %></td>
      </tr>
    <% end %>
  </tbody>
</table>

Inherited Resources gives you a few helpers that make these views really easy:

collection maps to the collection of objects in your controller. If it was BooksController, collection would return @books. This is only present in the index view.
resource_class returns the class of the resource your controller is managing
resource maps to @book and is available in every view except index.

Now we need a show view. Most of the time, all you want to see is a dump of the resource's attribute, which is exactly what we're going to do in app/views/crud/show.html.erb:

<h1>
  <%= resource_class %> <%= resource.id %>
  <small><%= link_to 'Edit', [:edit, resource] %></small>
</h1>
<table>
  <tr>
    <th>Key</th>
    <th>Value</th>
  </tr>
  <% resource.attributes.sort.each do |key, value| %>
  <tr>
    <td><%= key %></td>
    <td><%= value %></td>
  </tr>
  <% end %>
</table>

The edit and new views are super simple:

<h1>New <%= resource_class.to_s.titlecase %></h1>
<%= render 'form' %>

<h1>Editing <%= resource_class.to_s.titlecase %> <%= resource.id %></h1>
<%= render 'form' %>

Let's look at app/views/crud/_form.html.erb:

<%= simple_form_for resource do |f| %>
  <% attrs_for_form.each do |attr| %>
    <%= f.input attr %>
  <% end %>
  <%= f.button :submit %>
<% end %>

Here's where Simple Form really starts to shine. It auto-detects the proper form input to display by inspecting the attribute, so we don't have to do any work for a basic form. Of course, because we're using view inheritance, if you want to make a more complicated for you can just drop what you want into app/views/<controller>/_form.html.erb and it'll get picked up automatically. That also goes the same for any of the templates.

We should flesh out BooksController with overrides for those attr methods:

class BooksController < CrudController
  def attrs_for_index
    [:title, :author, :isbn]
  end

  def attrs_for_form
    [:title, :author, :isbn]
  end
end

Sorting, Paging, and Whatnot

You're probably thinking that this whole thing is great, but what if you need to, for example, sort or paginate the results on the index? Inherited Resources has you covered. Just override the collection method, like this:

class BooksController < CrudController
  def collection
    @books ||= end_of_resource_chain.order('created_at DESC')
  end
end

The end_of_resource_chain method gives you your resource relation after applying all of the other neat things that Inherited Resources can do. Check out the README for more details on that.

In the same vein, what if you want to limit the objects created and accessed by the rest of the CRUD actions? For example, let's say you want to limit the books available to the current user:

class BooksController < CrudController
  def begin_of_association_chain
    current_user
  end
end

begin_of_association_chain is where Inherited Resources starts out building objects. If you don't provide it, it defaults to the resource class.

There's a lot more you can do with Inherited Resources and Simple Form, like build controllers that deal with multiple nested resources and forms that have automatically populated select dropdowns. You should check them out.

Essential Tools for Starting a Rails App in 2013

2013-09-15T00:00:00+00:00

Over the past few years I've written a number of Rails applications. It's become my default "scratch an itch" tool for when I need to build an app quickly to do a task. Even though Rails is mostly batteries-included, there are a few tools that make writing new applications so much easier. This is my list of tools that I use for pretty much every new Rails project.

Edit: The discussion on Hacker News has some great gems that you should consider using as well.

Dotenv

Dotenv is a simple gem that loads environment variables from a file named .env in your project root into the ENV hash within Ruby. Getting configuration from the environment is one of the factors in 12 Factor Applications, and using a .env file for development eases the transition to deploying on Heroku. Or, if you're crazy like me, deploying on your own hardware using a nasty brew of Capistrano and Foreman.

Devise

Most Rails apps are going to need a way to authenticate users. You could write something yourself, but there are a lot of subtle security concerns that you have to take into account. By using an off the shelf product like Devise you're insulated from having to worry about that. Some people use AuthLogic, which is also perfectly fine.

Brakeman

There have been quite a few security vulnerabilities over the past year or so inside Rails, some of which are due to Rails themselves, but many are coding errors or best practices that, over time, have turned out to be not the best. Brakeman is a security scanner that looks at your code base for both categories of error and tells you if you're doing something wrong. I run Brakeman over my codebase as part of my test suite so I know immediately when I'm doing something that isn't quite right.

Rails Best Practices

In a simlar vein to Brakeman, Rails Best Practices is a list of best practices that anyone can add to, vote on, and modify. They provide a scanner that looks for violations of these best practices and tells you about them. I also run this as part of my test suite, not because they're necessarily security focused, but hard-won experience has taught me that doing (most of) the things that RBP says to do leads to a more maintainable codebase. They provide a configuration file that you can tweak, in case the scanner starts warning on something that you don't think it should.

Simple Form

Much of what we do as Rails developers boils down to making simple CRUD forms to work with models. Much of this is going to be inside an admin interface that users never actually see so we want to get the job done as quickly as possible. Simple Form lets you write the simplest form declaration possible and bakes in a lot of useful things like error and validation handling. It's also compatible with a number of CSS frameworks like Zurb Foundation and Bootstrap. I tend to use Simple Form in lieu of an admin interface generator like ActiveAdmin, mostly because I haven't had much luck getting those to play with Rails 4.

Sidekiq

At some point every Rails application is going to need to do some background processing, especially if you're making server-side calls to other web services. These should always be done outside of a web request because Rule Number 1 is The network is unreliable (the PDF in the sources block is a great explanation of the problems of distributed computing, btw). I've explored a number of different background processing systems for Rails and the best that I've found is named Sidekiq. It uses less resources per worker than any of the rest and it is super easy to manage.

Adventures in Self Publishing

2013-09-03T00:00:00+00:00

Three months ago I decided to write my first technical book and it's earned me over $5,000 in the two weeks since launch day, so I thought I decided to share what I've learned.

I had been reading Nathan Barry's excellent book Authority and something about it inspired me. I started throwing around ideas, things that I knew well and that weren't well covered already, and I turned up Stripe. I know Stripe very well having used it for a bunch of projects in the past few years. I also know Rails, using it in most of those projects plus at my day job. I knew for sure that there were things about payment processing that weren't really talked about much in the 10 minute Stripe tutorials. Thus began my five month journey of writing and self-publishing Mastering Modern Payments: Using Stripe with Rails.

Why self publish?

For me, it wasn't even really a question. There's no way a traditional publisher would be interested in a tiny niche topic like this, and even if they were I wanted as much control as I could get for my first major writing project. Not to mention, the amount of money I would get from direct sales is vastly more than I would get from the same number of sales if I were getting royalties.

So, why not something like Leanpub? Again, it comes down to control. Leanpub provides a valuable service but I wanted to control the entire experience, from building the book files to the landing page all the way through to the sales experience. A great part of the value of the guide is that the Guide + Code edition comes with the code for the application that actually sells the book, and with Leanpub that wouldn't be possible.

Process and Tools

My writing process is pretty simple. I make an outline, at first just broad themes for chapters or sections. I step back, play with the order a little maybe, and then start adding more detail to the outline. At some point I get bored with that and start filling in chunks with prose. This is the same process I've used for most of my blog posts and it's been pretty successful.

Editing was a much bigger job for the guide than it has been for my blog, of course. I did a first pass by hand on actual paper with an actual red pen and then followed that up with another pass on the computer. Shortly after that I released the guide as preorders to my mailing list (more on that below) and they provided vast quantities of feedback, ranging from typos to bug fixes all the way through to high level feedback on the shape and intent of the guide. I can't thank those reviewers enough. They made it possible to have a polished product on launch day.

The tools I used for most of the writing process:

Emacs and Markdown mode. The book is entirely written in Markdown, except for a very few sections written in raw HTML. I use the excellent Markdown Mode for basically everything I write, since it basically just gets everything right.
Docverter. I use Pandoc and Flying Saucer via a local instance of Docverter to build the various formats of the guide.
Rake and Redcarpet. I wrote a few different custom Markdown renderers using Redcarpet that do things like highlight code samples with Pygments, check the syntax of all of the Ruby code samples, spider all of the links in the book, and finally render out the table of contents how I prefer it.
Gimp for the cover design.

I've open sourced the code that I used to build the book. It's very specific for Mastering Modern Payments and the process that I eventually settled on, but I'm hopeful that it will provide inspiration for others that might be contemplating the same kind of self-publishing venture. It includes the pdf template and style sheet I use, along with the Rakefile that contains all of the logic for building the guide in it's various formats.

Preorders

The first thing I did after actually deciding to write MMP was to put up a landing page and start collecting email addresses. On July 15th I started selling preorders, which in this case consisted of 30% off the final purchase price in exchange for an advanced copy of the guide along with regular weekly updates as people sent in feedback. This turns out to have been highly lucrative and extremely motivational. Preorder sales totaled over $3,000. Dozens of people were interested enough in this project to pay early. If you're developing an info product you should definitely consider some sort of preorder arrangement.

Numbers

I kept a simple journal throughout the main writing period and adding up all of that I wrote for about 70 hours. The total hours spent on the whole project is probably double that because I didn't count much of the development of the companion Rails application, editing the guide, writing emails to customers, nor of all of the other little tasks that go into a project like this.

As of yesterday, the guide has grossed $8,603 (this includes preorders):

99 copies of Just the Guide for $2,485
108 copies of Guide + Code for $5,341
3 copies of the Team license for $777

(note that these numbers don't add up to the straight price * copies amount because I've had both 10% and 30% discounts going at various times)

The biggest driver by far was HN traffic from two posts on launch day, August 15th. The second biggest driver was a link in Ruby Weekly. I purchased an ad on /r/rails and sponsored The Changelog podcast, neither of which directly generated sales but may have driven direct site visits and sales later on. There wasn't a whole lot of time to put thought into tracking visits from The Changelog so I don't have any direct numbers.

Since launch day the landing page has gotten 6,084 unique page views driving 136 sales for an overall conversion rate of 2.2%. Prior to launch day, the mailing list converted over 30% for preorders.

Lessons Learned

Get help sooner in the process

When I started this project I was all by myself which hampered editing and reviewing a lot. I didn't get my first comprehensive external review of the thing until after I had sold the first few copies. Next time I'll assemble a few reliable technical reviewers ahead of time so that they can start reading drafts before publication.
Write more guest blog posts

This is something that Nathan espouses in Authority and I just didn't have time to get to. I only had one guest blog post on launch day and it didn't generate much traffic at all, let alone sales. Next time I'll have at least three lined up, as well as having coverage in the relevant weekly newsletters and podcasts set up beforehand.
Write more relevant content for my email list

Early on I was sending interesting little Stripe and Rails things to my mailing list, but after preorders started it became product updates every time. I think at some point I started overselling them. For my next product I'm going to try to alternate product updates with interesting, relevant information that isn't directly related to the product.
Don't have so much going on while developing and selling

I had far too many things going on when I decided to launch. Here's an abbreviated schedule for launch day and the following:

2013-08-12 — Start packing for move
2013-08-15 — Launch day AND travel day on airplanes
2013-08-16 — Out of internet range, getting things set up for the wedding
2013-08-17 — Wedding Day
2013-08-18 — Travel back to Portland, get ready to move
2013-08-23 — Pack everything we own and start driving across the country
2013-08-28 — Attend a funeral
2013-08-29 — Sign a new lease in Michigan

I knew about the wedding and moving well ahead of time, so why did I pick August 15th? Lack of critical thinking, clearly.

Conclusion

I'd like to take the opportunity to thank my wife, who has put up with not only this project but all of my projects over the last two years. She's the most amazing, understanding person that I've ever met. I'd also like to thank Michael Buckbee, Andrew Culver, and a whole host of other people who kept me motivated and on track, right to the end.

This project has been an amazing experience. I've learned so much about how to build a product, how to build an audience, and how to make a success, and it's not even over. I'll be releasing periodic updates with corrections and new information since both Stripe and Rails are both active, changing products.

Mastering Modern Payments Is Out Today!

2013-08-15T00:00:00+00:00

I'm so proud to announce that Mastering Modern Payments: Using Stripe with Rails is officially launching this morning. Mastering Modern Payments is your guide to integrating Stripe with your Rails application and is packed with sample code and best practices that will make sure your integration works now and in the future.

I've collected for you the best resources from around the web and added my own experiences and stories to help you make your Stripe integration as robust as it can be. Click below to check it out!

Find Out More about Mastering Modern Payments

DNS: The Good Parts

2013-07-19T00:00:00+00:00

Frequently I come across confusion with domain names. Why doesn't my website work? Why is this stupid thing broken, everything I try fails, I just want it to work!! Invariably the question asker either doesn't know what DNS is or doesn't understand how something fundamental works. More generally, people think that DNS is scary or complicated. This article is an attempt at quelling that fear. DNS is easy once you understand a few basic concepts.

What is DNS

First things first. DNS stands for Domain Name System. Fundamentally it's a globally distributed key value store. Servers around the world can give you the value associated with a key, and if they don't know they'll ask other servers for the answer.

That's it. That's all there is to it. You (or your web browser) ask for the value associated with the key www.example.com and get back 1.2.3.4.

Basic Exploration and Fundamental Types

The great thing about the DNS is that it's completely public and open so it's easy to poke around. Let's do a little exploring, starting with this domain, petekeen.net which I am hosting on a machine named web01.bugsplat.info. Note that you can run all of these examples from an OS X or linux command line.

First, let's look at a simple domain name to IP address mapping:

$ dig web01.bugsplat.info

The dig command is a veritable Swiss Army knife for querying DNS servers and we'll be using it quite a bit. Here's the first part of the response:

; <<>> DiG 9.7.6-P1 <<>> web01.bugsplat.info
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 51539
;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 0

There's only one interesting thing in here. We asked for one record and got exactly one respose. Here's the question we asked:

;; QUESTION SECTION:
;web01.bugsplat.info.       IN  A

dig defaults to asking for A records. A stands for address and is one of the basic fundamental types of records in the DNS. An A record holds exactly one IPv4 address. There's an equivalent record for IPv6 addresses named AAAA. Next, let's look at the answer our DNS server gave us:

;; ANSWER SECTION:
web01.bugsplat.info.    300 IN  A   192.241.250.244

This says the host web01.bugsplat.info. has exactly one A address: 192.241.250.244. The 300 is called the TTL value, or time to live. It's the number of seconds that this record can be cached before it needs to be checked again. The IN component stands for Internet and is meant to disambiguate between the various types of networks that the DNS historically was responsible for. You can read about those in IANA's DNS Parameters document (thanks for the correction, mcmatterson!)

The rest of the response tells you things about the response itself:

;; Query time: 20 msec
;; SERVER: 192.168.1.1#53(192.168.1.1)
;; WHEN: Fri Jul 19 20:01:16 2013
;; MSG SIZE  rcvd: 56

Specifically, it tells you how long it took for your server to respond, what that server's IP address is (192.168.1.1), what port dig asked (53, the default DNS port), when the query completed, and how many bytes the response contained.

As you can see, there's an awful lot going on in a single DNS query. Every time you open a web page your browser makes literally dozens of these queries to resolve the web host, all of the hosts where external resources like images and scripts are located, etc. Every single resource involves at least one DNS query, which would involve an awful lot of traffic if DNS wasn't designed to be heavily cached.

What you probably can't see, however, is that the DNS server at 192.168.1.1 contacted a whole chain of other servers in order to answer that simple question of what address does web01.bugsplat.info map to. Let's run a trace to see all of the servers that dig would have to contact if they weren't already cached:

$ dig +trace web01.bugsplat.info

; <<>> DiG 9.7.6-P1 <<>> +trace web01.bugsplat.info
;; global options: +cmd
.           137375  IN  NS  l.root-servers.net.
.           137375  IN  NS  m.root-servers.net.
.           137375  IN  NS  a.root-servers.net.
.           137375  IN  NS  b.root-servers.net.
.           137375  IN  NS  c.root-servers.net.
.           137375  IN  NS  d.root-servers.net.
.           137375  IN  NS  e.root-servers.net.
.           137375  IN  NS  f.root-servers.net.
.           137375  IN  NS  g.root-servers.net.
.           137375  IN  NS  h.root-servers.net.
.           137375  IN  NS  i.root-servers.net.
.           137375  IN  NS  j.root-servers.net.
.           137375  IN  NS  k.root-servers.net.
;; Received 512 bytes from 192.168.1.1#53(192.168.1.1) in 189 ms

info.           172800  IN  NS  c0.info.afilias-nst.info.
info.           172800  IN  NS  a2.info.afilias-nst.info.
info.           172800  IN  NS  d0.info.afilias-nst.org.
info.           172800  IN  NS  b2.info.afilias-nst.org.
info.           172800  IN  NS  b0.info.afilias-nst.org.
info.           172800  IN  NS  a0.info.afilias-nst.info.
;; Received 443 bytes from 192.5.5.241#53(192.5.5.241) in 1224 ms

bugsplat.info.      86400   IN  NS  ns-1356.awsdns-41.org.
bugsplat.info.      86400   IN  NS  ns-212.awsdns-26.com.
bugsplat.info.      86400   IN  NS  ns-1580.awsdns-05.co.uk.
bugsplat.info.      86400   IN  NS  ns-911.awsdns-49.net.
;; Received 180 bytes from 199.254.48.1#53(199.254.48.1) in 239 ms

web01.bugsplat.info.    300 IN  A   192.241.250.244
bugsplat.info.      172800  IN  NS  ns-1356.awsdns-41.org.
bugsplat.info.      172800  IN  NS  ns-1580.awsdns-05.co.uk.
bugsplat.info.      172800  IN  NS  ns-212.awsdns-26.com.
bugsplat.info.      172800  IN  NS  ns-911.awsdns-49.net.
;; Received 196 bytes from 205.251.195.143#53(205.251.195.143) in 15 ms

The DNS is arranged in a hierarchy. Remember how dig inserted a single . after the hostname we asked for before, web01.bugsplat.info? Well, that . is pretty important and stands for the root of the hierarchy. The root DNS servers are run by various companies and governments around the world. Originally there were only a handful of these servers but as the Internet has grown more have been added, so that now there are notionally 13. Each one of these servers, however, has dozens or hundreds of physical machines hiding behind a single IP.

So, at the top of the trace we see the root servers, each represented by an NS record. An NS record maps a domain name, in this case the root, to a DNS server. When you register a domain name with a registrar like Namecheap or Godaddy they create NS records for you.

In the next block you can see that dig randomly picked one of the root server responses and asked it for the A record web01.bugsplat.info. Which root server? Let's ask!

$ dig -x 192.5.5.241

; <<>> DiG 9.8.3-P1 <<>> -x 192.5.5.241
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 2862
;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 0

;; QUESTION SECTION:
;241.5.5.192.in-addr.arpa.  IN  PTR

;; ANSWER SECTION:
241.5.5.192.in-addr.arpa. 3261  IN  PTR f.root-servers.net.

The -x flag tells dig to do a reverse lookup on the given IP address. The DNS responds with a PTR record which maps an IP with a hostname, in this case f.root-servers.net.

Getting back to our original query, the F root server responded with another set of NS servers, this time the ones responsible for the info top level domain. dig asks one of these servers for the A record for web01.bugsplat.info, gets back another set of NS servers, and then asks one of those servers for the A record for web01.bugsplat.info. and finally receives an actual answer. (thanks for the corrections, colmmacc!)

Whew! That would be a heck of a lot of traffic, except that almost all of these entries are cached for a long time by every server in the chain. Your computer caches too, as does your browser. Most of the time DNS resolution will never touch the root servers because their IP addresses hardly ever change. The top level domains com, net, org, etc, are also generally heavily cached.

Other Types

There are a few other types that you should be aware of. The first is MX, which maps a domain name to one or more email servers. Email is so important to the functioning of the Internet that it gets its own record type. Here are the MX records for petekeen.net:

$ dig petekeen.net mx

; <<>> DiG 9.7.6-P1 <<>> petekeen.net mx
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 18765
;; flags: qr rd ra; QUERY: 1, ANSWER: 2, AUTHORITY: 0, ADDITIONAL: 0

;; QUESTION SECTION:
;petekeen.net.          IN  MX

;; ANSWER SECTION:
petekeen.net.       86400   IN  MX  60 web01.bugsplat.info.

;; Query time: 272 msec
;; SERVER: 192.168.1.1#53(192.168.1.1)
;; WHEN: Fri Jul 19 20:33:43 2013
;; MSG SIZE  rcvd: 93

Note that an MX record points at a name and not an IP address.

The other record type that you should be familiar with is CNAME which stands for Canonical Name and maps one name onto another. Let's look at the response we get for a CNAME:

$ dig www.petekeen.net

; <<>> DiG 9.7.6-P1 <<>> www.petekeen.net
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 16785
;; flags: qr rd ra; QUERY: 1, ANSWER: 2, AUTHORITY: 0, ADDITIONAL: 0

;; QUESTION SECTION:
;www.petekeen.net.      IN  A

;; ANSWER SECTION:
www.petekeen.net.   86400   IN  CNAME   web01.bugsplat.info.
web01.bugsplat.info.    300 IN  A   192.241.250.244

;; Query time: 63 msec
;; SERVER: 192.168.1.1#53(192.168.1.1)
;; WHEN: Fri Jul 19 20:36:58 2013
;; MSG SIZE  rcvd: 86

The first thing to notice is that we get back two answers. The first says that www.petekeen.net maps to web01.bugsplat.info. The second gives the A record for that server. One way to think about a CNAME is as an alias for another domain name.

Why CNAME is Messed Up

CNAMEs are incredibly useful, but they have one very important gotcha: if there a CNAME exists for a particular name, that is the only record allowed for that name. No MX, no A, no NS, no nothing. This is because the DNS substitutes the CNAME's target for its own value, so every record valid for the target is also valid for the CNAME. This is why you can't have a CNAME on a root domain like petekeen.net, because you generally have to have other records for that domain like MX.

Querying Other Servers

Let's say for sake of argument that you messed up a DNS configuration. You think you've fixed the problem, but you don't want to wait for the cache to expire to see. With dig you can actually query one of a number of public DNS servers instead of your default server like this:

$ dig www.petekeen.net @8.8.8.8

The @ symbol followed by an IP address or hostname tells dig to query that server on the default DNS port. I use this a lot to query Google's public DNS servers or Level 3's sort-of-public servers at 4.2.2.2.

Common Situations

In this last section we'll talk about some common situations that web developers find themselves in.

Redirect bare domain to www

Almost always you'll want to redirect a bare domain like iskettlemanstillopen.com to www.iskettlemanstillopen.com. Registrars like Namecheap and DNSimple call this a URL Redirect. In Namecheap you would set up a URL Redirect like this:

The @ stands for the root domain iskettlemanstillopen.com. Let's look at the A record for that domain:

$ dig iskettlemanstillopen.com
;; QUESTION SECTION:
;iskettlemanstillopen.com.  IN  A

;; ANSWER SECTION:
iskettlemanstillopen.com. 500   IN  A   192.64.119.118

That IP is owned by Namecheap and is running a small web server that just serves up an HTTP-level redirect to http://www.iskettlemanstillopen.com:

$ curl -I iskettlemanstillopen.com
curl -I iskettlemanstillopen.com
HTTP/1.1 302 Moved Temporarily
Server: nginx
Date: Fri, 19 Jul 2013 23:53:21 GMT
Content-Type: text/html
Connection: keep-alive
Content-Length: 154
Location: http://www.iskettlemanstillopen.com/

CNAME to Heroku or Github

Notice in the screenshot above that there's a second row defining a CNAME. In this case www.iskettlemanstillopen.com maps to an application running on Heroku. You'll have to set up Heroku with a similar domain mapping, of course:

$ heroku domains
=== warm-journey-3906 Domain Names
warm-journey-3906.herokuapp.com
www.iskettlemanstillopen.com

Github is similar, except that the mapping lives in a file called CNAME at the root of your pages, as described in their documentation.

Wildcards

Most DNS servers allow you to set up DNS wildcards. For example, I have a wildcard CNAME set up for *.web01.bugsplat.info that maps to web01.bugsplat.info. That way I can host arbitrary things on web01 and not have to create new DNS entries for them every time:

$ dig randomapp.web01.bugsplat.info

;; QUESTION SECTION:
;randomapp.web01.bugsplat.info. IN  A

;; ANSWER SECTION:
randomapp.web01.bugsplat.info. 300 IN CNAME web01.bugsplat.info.
web01.bugsplat.info.    15  IN  A   192.241.250.244

Wrap Up

Hopefully this gives you a good beginning understanding of what DNS is and how to about exploring and verifying your configuration. Just remember that you can always ask the DNS questions and generally get back answers. The Internet standards (RFCs) that define DNS are:

There are a few more interesting RFCs as well, including 4034 about a standard named DNSSEC and 5321 which talks about DNS as it relates to email. These are all fascinating reads if you want more background information

This article is featured in Hacker Monthly issue 42.

Announcing: Mastering Modern Payments: Using Stripe with Rails

2013-07-15T00:00:00+00:00

Over the past few years I've put together a number of projects that use Stripe and their Ruby API to collect payments and manage subscriptions. I've learned quite a bit about how to effectively use the things that Stripe provides to my best advantage. Two months ago I decided that I would like to share that knowledge and so I started working on a guide to integrating Stripe with Rails and today I'd like to announce that Mastering Modern Payments: Using Stripe with Rails will be available on August 15th, 2013.

Mastering Modern Payments covers the gamut of Stripe integrations, from the simplest using checkout.js all the way through custom payment forms, subscriptions, background workers and more. It's my first ebook and I couldn't be more excited.

My intended audience is Ruby developers who know Rails well but need some help with their first Stripe integration or want help figuring out how to make their existing integration better. If that's you, you should sign up for the mailing list to get updates and announcements leading up to launch day as well as a 10% off coupon code and a free chapter named State and History which covers how to easily keep an audit trail on your sales records.

Find Out More about Mastering Modern Payments

Shipping with Stripe and EasyPost

2013-07-02T00:00:00+00:00

Let's say that instead of running a Software as a Service, you're actually building and shipping physical products. Let's say quadcopter kits. People come to your website, buy a quadcopter kit, and then you build it and ship it to them. It takes you a few days to build the kit, though, and you would rather not charge the customer until you ship. Traditionally Stripe has been focused on paying for online services but recently they added the ability to authorize and capture payments in two steps. In this post we're going to explore billing with Stripe and shipping with EasyPost with separate charge and capture.

Step 1: Calculate Shipping

EasyPost makes it really simple to calculate shipping rates. Just take the customer's shipping address and create an EasyPost::Shipment object:

from_address = EasyPost::Address.create(
  name:    'Pete Keen',
  street1: '618 NW Glisan Ave',
  city:    'Portland',
  state:   'OR',
  zip:     '97211',
  country: 'US',
  email:   'pete@petekeen.net'
)

to_address = EasyPost::Address.create(
  name:    params[:to_name],
  street1: params[:to_street1],
  city:    params[:to_city],
  state:   params[:to_state],
  zip:     params[:to_zip],
  country: params[:to_country],
  email:   params[:email],
)

parcel = EasyPost::Parcel.create(
  length: 10,
  width: 10,
  height: 6,
  weight: 30,
)

shipment = EasyPost::Shipment.create(
  to_address: to_address,
  from_address: from_address,
  parcel: parcel
)

@rates = shipment['rates']

Display those rates to the customer, have them pick the one they want, and then move on to the next step.

Step 2: Authorize

Charging a card and just doing the authorization step are remarkably similar. Just get the user's credit card info with stripe.js or checkout.js and then make the charge with the capture parameter set to false:

charge = Stripe::Charge.create(
  card:   params[:stripeToken],
  amount: 10000 + (@selected_rate['rate'].to_f * 100).to_i,
  currency: 'usd',
  capture: false
)

This will authorize a charge of $100 plus the shipping rate for up to seven days. EasyPost gives back shipping rates as decimal strings, so to give it to Stripe we have to convert it to a number, then to cents, and finally to an integer. Save shipment.id and charge.id to your database, you'll need them later.

Step 3: Build the Product

You're on your own here. Just make sure to get everything done in 7 days, because after that Stripe will release the funds from the customer's credit card and the charge object won't be valid anymore.

Step 4: Ship It!

Now you're at the point where you're ready to ship. All you have to do now is purchase the shipping label and tell Stripe to capture the charge:

shipment = Shipment.retrieve(saved_shipment_id)
label = shipment.buy

charge = Stripe::Charge.retrieve(saved_charge_id)
charge.capture

label_url = label['label_pdf_url']
# display the url and print it

Now just print off the PDF, tape it to your box, and take the package to the UPS or FedEx store or the post office and drop it off. Wait a few weeks and search YouTube for quadcopter videos and you'll be sure to see one of your kits, gracefully flying around.

Conclusion

Stripe is awesome for processing credit cards. EasyPost is the easiest way to buy shipping known to man. Combine them and you have yourself a simple way to sell and ship physical products in the US and worldwide.

Making eBooks with Docverter

2013-06-18T12:00:00+00:00

I've been writing my guide to integrating Stripe with Rails using markdown, as with most textual projects that I work on. Every chapter is a markdown-formatted file living in a git repo, sycned-on-save to my git server and S3 using SparkleShare. When I want to peek at the rendered version I use little previewer app running on a VM on my Mac mini that I talked about previously.

A good eBook needs a PDF version, of course. Awhile back I wrote an open-source service named Docverter that can render XHTML to PDF using a library named Flying Saucer, among other things. All you have to do is pipe in the HTML and other related files and you get back a rendered, self-contained PDF file. There are a few non-trivial aspects to this, of course, because HTML is not primarily intended for printable output. The W3C has worked up a whole CSS module for page-related styles but it's not the most readable document. There's a few simple-ish things that you can do to your document to make it look nice, though.

Here's the simplest HTML to PDF renderer:

require 'docverter'

Docverter.base_url = 'http://c.docverter.com'

html = <<HERE
<html>
  <head>
    <title>Test Document</title>
  </head>
  <body>
    <h1>Test Header</h1>
    <p>This is some text</p>
  </body>
</html>
HERE

File.open("out.pdf", "w+") do |f|
  f.write(Docverter::Conversion.run do |c|
    c.from    = 'html'
    c.to      = 'pdf'
    c.content = html
  end)
end

The HTML document is very simple, as is the conversion. Docverter::Conversion.run takes a block which yields a Docverter::Conversion object that can be set with any options Docverter supports. Most basically, you have to specify from, to, and content. If you run this program you'll get a file named out.pdf.

Fonts

The first non-trivial thing that one would want to do is customize fonts. Flying Saucer knows quite a bit of CSS, including @font-face. All you have to do to customize fonts is to download the font as a ttf and modify the above program to look like this:

require 'docverter'

Docverter.base_url = 'http://c.docverter.com'

html = <<HERE
<html>
  <head>
    <title>Test Document</title>
    <style type="text/css">
      @font-face {
        font-family: 'Droid Sans';
        font-style: normal;
        font-weight: 400;
        src: url('droid_sans.ttf');
        -fs-pdf-font-embed: embed;
        -fs-pdf-font-encoding: Identity-H;
      }
      body {
        font-family: 'Droid Sans';
      }
    </style>
  </head>
  <body>
    <h1>Test Header</h1>
    <p>This is some text</p>
  </body>
</html>
HERE

File.open("out.pdf", "w+") do |f|
  f.write(Docverter::Conversion.run do |c|
    c.from    = 'html'
    c.to      = 'pdf'
    c.content = html

    c.add_other_file 'droid_sans.ttf'
  end)
end

A few interesting things are going on here. First, @font-face declares the font. font-family must match the name the font file specifies. Second, -fs-pdf-font-embed and -fs-pdf-font-encoding must match the values given above or the embedding won't work. src is the filename of the file, which we add to the conversion using add_other_file, which takes a path. Drop droid_sans.ttf in the same directory as the script and run it again. Notice that the PDF is now in Droid Sans, which is pretty pleasant.

Footers

Most documents longer than a page are going to need page numbers. Adding page numbers to a PDF with Docverter is very much not trival. You need to combine the powers of CSS Paged Media running elements and generated content to generate properly formatted footers with page numbers. Here's the HTML source:

<html>
  <head>
    <title>Test Document</title>
    <style type="text/css">
      @font-face {
        font-family: 'Droid Sans';
        font-style: normal;
        font-weight: 400;
        src: url('droid_sans.ttf');
        -fs-pdf-font-embed: embed;
        -fs-pdf-font-encoding: Identity-H;
      }
      body {
        font-family: 'Droid Sans';
      }
      div.page_footer {
        display: block;
        text-align: center;
        font-family: 'Droid Sans';
        position: running(footer);
      }
      div.page_footer .page_number:after {
        content: counter(page);
      }
      @page {
        @bottom-center {
          content: element(footer);
        }
      } 
    </style>
  </head>
  <body>
    <div class="page_footer"><span class="page_number"></span></div>
    <h1>Test Header</h1>
    <p>This is some text</p>
  </body>
</html>

Three things to note here. First, the new div with class page_footer has to come before the rest of the body content because it gets moved into place as Flying Saucer renders. If it's at the bottom it won't exist when Flying Saucer tries to render the page and so nothing gets rendered at all.

Second, notice position: running(footer). CSS Paged Media introduces the running() which tells the renderer to stick the content that's currently selected, in this case div.page_footer, into a slot named footer. We set up the page using the @page selector. Inside there is another selector named @bottom-center, which specifies the center section at the bottom of the page. The content attribute with an element() value tells the renderer to take the content from the slot named footer and use it to populate the section. Note that we could have named the footer slot anything. The named slots are in their own namespace separate from ids and classes.

Finally we get to actually setting up the page number. There's a default counter named page which we put after the span with class page_number using CSS generated content.

Page Breaks

PDFs are effectively pre-printed documents, so CSS Paged Media gives you a few different facilities for controlling page breaks. For example, if I wanted a page break before every H1 element I could say this:

h1 {
  page-break-before: always;
}

If, instead, I want to break after a certain element, like a closing paragraph or something, I could do this:

p.closing {
  page-break-after: always;
}

Sometimes you have elements that you want to not break across pages if at all possible. In Mastering Modern Payments there are dozens of code examples that, if I let Flying Saucer break at the natural spots, would have one or two lines on one page and the rest of the 10-line sample on the other. CSS Paged Media lets you control that, too, with page-break-inside. Here's the rule I use:

code {
  page-break-inside: avoid;
  orphans: 0;
  widows: 0;
}

This says to avoid inserting page breaks inside a code block. The orphans option controls how many lines are allowable at the bottom of a page inside a block and widows controls how many are allowed at the top of the next. By setting both to 0 I'm saying that I don't want any page breaks at all. Flying Saucer will ignore me if it's not possible, if for instance I have a code block that spans more than an entire page.

Making nice-looking PDFs with HTML source is not trivial. It would probably end up being easier to just drop the raw text inside Apple's iBooks Creator and style it that way, but I like a challenge. It's already looking pretty nice, and with a little more work I think I can have professional-grade PDF output. Docverter also knows how to do Markdown to ePub and Mobi so I should be set for book production. Now to finish writing.

Page Viewer, a Simple Markdown Viewer

2013-06-15T00:00:00+00:00

For various projects including Mastering Modern Payments I've found it really useful to be able to view the Markdown source rendered as HTML but I don't really care about editing it online. I put together a little gem named page_viewer which renders Markdown files like this:

page_viewer has some convenience features. First, it dynamically renders the markdown files for each request. This means that if the files change underneath there's no cache to refresh or anything. Second, code blocks are highlighted with Pygments. You'll need a working python installation but that's it. Third, it integrates with Docverter for on-the-fly PDF conversion.

It's also easy to subclass to do what you want. For example, the site where I'm developing my guide subclasses PageViewer::App to add some routes that render the whole guide as one page with a table of contents, and another one that renders the whole thing as a PDF.

I use an instance page_viewer to view my personal wiki nicely rendered on a web page. My wiki repo is hosted on my private gitolite instance with a local clone. Each time I push changes, which happens pretty frequently with Sparkleshare, gitolite clones the repo to a directory somewhere else on the machine. This happens to be the same directory that page_viewer is pointing at.

Installation is pretty simple. Create a new project and include the page_viewer gem in the Gemfile:

source :rubygems

gem 'page_viewer'

Configure and run PageViewer::App inside config.ru:

require 'page_viewer'

PageViewer::App.set :page_root, '/path/to/some/markdown/files'

use Rack::Auth::Basic, "Restricted Area" do |username, password|
  username == ENV['USERNAME'] && password == ENV['PASSWORD']
end

run PageViewer::App

:page_root needs to contain the path to the markdown files you want to render and is required. Every file in that path ending in .md will be renderable.

Is this going to be useful for other people? Probably not, since it's really specific to my own needs. If you have a dynamically changing set of markdown documents you want rendered fresh all the time, though, this may be the ticket.

Book Review: The Box

2013-06-09T00:00:00+00:00

As a kid something about the Millenium Falcon always bothered me. See, everybody knows that cargo travels around the world, and presumably around the galaxy, in containers. You know, the big steel boxes that you see on TV where murders and drug deals are always happening. If Han Solo's ship is supposedly some awesome super duper fast freighter, where do the containers go? I guess on some level the fact that Solo was a smuggler sort of registered, but still you'd think that the tea and stuff (that's what smugglers move according to the school history books) would just be hidden in the containers.

After reading The Box) by Marc Levinson I'm relatively certain the Falcon would be considered a breakbulk tramp freighter. "Breakbulk" meaning traditional non-containerized freight, "tramp" for the fact that Han and Chewie are tramps. Well, and they travel around without a fixed schedule or ports of call, picking up freight wherever and whenever they can. But mostly because they're tramps.

History Lessons

The Box presents a comprehensive history of the shipping container and of shipping in general, from the days of sailing ships, gallantly making their way from India to Britain for the tea trade, all the way to modern day when huge container ships carrying over fifteen thousand twenty foot long standard containers ply the seas between Asia and Europe.

One of the most interesting sections of the book for me was the exhaustive description of how, exactly, a breakbulk freighter is cramed to the gills with individual items of every imaginable description. It turns out that longshoreman (people who work the docks and load the ships) used to have to know a lot of details of how to pack a ship and what things can be crammed where in order to maximize the limited amount of storage capacity.

The Container

Malcom McLean’s fundamental insight, commonplace today but quite radical in the 1950s, was that the shipping industry’s business was moving cargo, not sailing ships.

The origin of the standardized international shipping container is almost, but not entirely, the story of a man named Malcom McLean. Before McLean containers had been tried on and off for decades, both in the United States and in Europe, on trucks and rails and ships. There had never been dedicated through-service, though, where a container could be packed at the shipper's factory and arrive at the destination without ever being opened.

A 25-ton container of coffeemakers can leave a factory in Malaysia, be loaded aboard a ship, and cover the 9,000 miles to Los Angeles in 16 days. A day later, the container is on a unit train to Chicago, where it is transferred immediately to a truck headed for Cincinnati. The 11,000-mile trip from the factory gate to the Ohio warehouse can take as little as 22 days, a rate of 500 miles per day, at a cost lower than that of a single first-class air ticket. More than likely, no one has touched the contents, or even opened the container, along the way

The road to the standard intermodal container took quote a few twists and turns, however. Every shipper had their own standard at first, none of them compatible with each other. Eventually a committee convened by the ISO decided on dimensions and shippers slowly adopted them. The advent of the giant containerships in the 70's and 80's spelled the end of the non-standard container sizes. Originally there were two sizes, both 8 feet 6 inches high and 8 feet wide, one being 20 feet long and the other 40 feet long. Eventually a few more sizes were added, but the core standards were still the same. The key innovation that standardization added was the method by which containers are connected together to form stacks on ships, the twistlock.

One thing I was astounded by was just how fast the container changed the shipping industry. Over the course of fifty years, containerization eliminated thousands of jobs worldwide, eliminated entire ports while creating new ones where there was just a tiny dock. It opened up entire countries to new forms of trade and ruined the economies of others. The Box tells this whole compelling story, really a story of unintended consequenses. Kind of like how Han Solo went from being a tramp to a general in a galaxy-spanning republic.

How and why I'm not running my own DNS

2013-06-07T00:00:00+00:00

A few months ago I posted about how I run my own DNS servers using my virtual private servers and tinydns. Well, it turns out that's not a great idea, for a few reasons. First, because if I mess up I'm entirely shut out of my servers. I tried to turn off a service on them the other day and accidentally turned of the tinydns service instead and it took me ages to get back in. Running DNS on the same machines that handle email and web hosting for almost every piece of my online presence is just way too fragile.

Buy the entire collection of DNS articles as a nicely formatted ebook.

Second, and really more importantly, this happened:

At some point last week I started seeing a very large number of queries for various domains coming into one of my servers. There were thousands of them a minute, supposedly coming from just a handful of machines. I couldn't figure this out, and the only solution I could reasonably come up with was to turn off logging. That's totally unacceptable, and if I couldn't figure this out I probably shouldn't be running DNS for myself after all. After consulting with a few very helpful people from the Internet we determined that my server was being used as part of a reflection attack originating at a known bad actor used for various command and control botnet nastiness. It was time to switch as soon as possible. Thankfully I had the TTL for my NS records set very low, otherwise this would have been a much more painful process.

Route 53

Before I decided to go the route of self-hosting I had investigated a few different providers. The most promising of these was Amazon's Route53, with the small snag that there was no easy out of the box solution to dynamic DNS for my home machine. When it came time to switch I figured that a solution would present itself and charged forward with manually switching a few domains over.

At some point after moving my most important domains and turning the DNS service off for good on my VPSs I came up with this little script:

#!/usr/bin/python

ZONE_ID = "ZXXXXXXXX"
DOMAIN_NAME = "dynamichost.example.com."

from boto.route53.connection import Route53Connection
from boto.route53.record import ResourceRecordSets
import requests
import sys

ip = requests.get("http://ip.example.com").text

conn = Route53Connection()

response = conn.get_all_rrsets(ZONE_ID, 'A', DOMAIN_NAME, maxitems=1)[0]
old_ip = response.resource_records[0]

if ip == old_ip:
    sys.exit(0)

changes = ResourceRecordSets(conn, ZONE_ID)

delete_record = changes.add_change("DELETE", DOMAIN_NAME, "A", 60)
delete_record.add_value(old_ip)

create_record = changes.add_change("CREATE", DOMAIN_NAME,"A", 60)
create_record.add_value(ip)

changes.commit()

This runs every minute on a VM running on the Mac mini in my living room. It uses requests to get my external API from a tiny webservice running on one of my servers, and if it's changed from what Route53 thinks it is, it uses boto to delete and recreate the A record. There's a bunch of public services out there that provide your external IP but I wanted to run my own. Of course.

Here's the code for the webservice:

run lambda do |env|
    [200, {"Content-Type" => "text/plain"}, [Rack::Request.new(env).ip]]
end

I think I learned a valuable lesson about limits with this whole "host everything all the time" exercise. Namely, that DNS is best left to the professionals, just like outgoing email. Route53 isn't the cheapest provider around but they promise 100% uptime and have a very nice easy to work with API. I very much recommend them.

Cancer

2013-06-06T00:00:00+00:00

"Thank you for calling the Bergs and the Keens. We're not home right now...". Mom hasn't changed the outgoing voicemail message yet. John's voice is still there, encouraging anyone who wants to leave a message. It's the last vestiage of him, other than what we hold in our memories. The last bit of him that's still real, and even that's just an echo that will fade with time.

Note: this is a deeply personal entry. If you're here for the programming stuff you may want to skip it.

Update August 29th 2013

February 2006

A few weeks prior my stepdad John almost ran his pickup truck into the side of a semi-truck on the highway. That's how these things get noticed, you see. The months, years, of waking up at 5am and not being able to sleep because of the pounding headaches just become part of the routine, but as soon as something catastrophic happens, that's when people sit up and take notice. I must have been at school when Mom called and told me that John had to have some tests done because they think he had brain tumor. After reviewing the MRI his brain surgeon announced that John had a large tumor, almost the size of a golf ball, embedded in his cerebrum. He was to get have it removed as soon as possible.

My sisters and I sat in the uncomfortable waiting area for 12 hours while his surgeon opened up John's skull and removed the tumor. We came back every day for two weeks while he recovered, first in a normal recovery room and then in the ICU when things turned sour for awhile. The first clue was when the nursed asked him if he knew where he was and he said "I'm going to be skating soon". The winter Olympics were going on in Italy that month and my mom and him were avid fans.

Eventually, with time, he got better. The MRI scans became less frequent. They talked about having to go to Madison for the day less and less. Things went back to normal. The tumor they pulled out of his head was benign. He never quite talked the same after that, and his temper was maybe a little bit hotter than it used to be, but by and large he was back to the John we knew and loved.

February 2012

I was standing in the living room when my phone rang. It was Mom, calling me for our weekly Sunday chat. I could tell she was a little upset, so I asked her what was going on, and she says "Well, John's in the hospital. We went to the ER last night because of some back pain he's been having for a few days and they did a CT scan and found cancer in his kidney. He's having surgery tomorrow. How was your week?". She was clearly in shock. I don't really remember my reaction, but I know I was crying and couldn't talk and had to yell at her to stop talking for a second.

My sister was on a plane that afternoon from Pennsylvania and I was on the next available flight out of Portland on Monday morning. I landed in Milwaukee and drove straight to the hospital, where I found Mom and my sisters waiting in the waiting area. The surgery went fine and he was in recovery. They took out one of his kidneys but they weren't sure what was going on. They knew, or at least they had a pretty good idea, but they didn't want to say without running a bunch of tests. Again, we were at the hospital every day, sitting with him, willing him to get better. After a few days we finally got a diagnosis: stage IV metastatic renal carcinoma in his kidneys and lungs. As soon as he was healed from surgery he'd be starting chemo.

Summer 2012

John and Mom came to visit my then-girlfriend Emily and me for a few days. John was in between chemo drugs, since the first one stopped working eventually. We went up to Snoqualmie Falls for the afternoon, took some great photos, and had generally a pretty good weekend. They were supposed to take the train back from Seattle to Chicago, but John wasn't really feeling well enough to go that long so they ended up flying back.

October 31st 2012

Emily had been looking forward to handing out Halloween candy for about a week. It's a thing that they used to do in her family and she's pretty sentimental about these kind of things, so she roped me into it. She picked up two huge bags of mixed goodies a few days prior, and then at the appointed hour on the appointed day we took the camp chairs down to the front porch and set up, waiting for kids to show up.

An hour passed. None did. Not a single one.

Emily's phone rang. It was her aunt, which was pretty weird. Her aunt doesn't normally call, she just sends care packages in the mail filled with candy and bottles of BBQ sauce. Her aunt tells her that her mom is in the hospital, about to undergo emergency surgery to remove part of her colon. She's had a blockage for awhile and the doctors decided that it had to come out, right then and there. They think it's cancer.

Emily was on a plane the very next day and stayed for more than a week, one of the longest times we've been apart since we started going out in 2009. We talked on the phone every night. She came back but without any news about what kind of cancer her mom has. Eventually we find out it's stage IV metastatic colon cancer.

November 28th 2012

While Emily was visiting family I had started to develop some pain in my groin. I attributed it to over-use while she was away, but we were pretty concerned nonetheless. Emily made me promise that if the pain didn't go away, that I was to see a doctor and get it checked out. Well, the pain went away for a few days and I forgot about it, but then came back with a vengance. I made an appointment the next morning to see a GP, which happened a few days later. He looked around, felt around, it was pretty embarassing for both of us I think. He said "I'm pretty sure it's one of two things, neither of which is cancer, but let's get an ultrasound just to make sure." The next day I was in the ultrasound lab, and then I was back at work and trying not to freak out.

The next day I got a call from my doctor. He said that he put in a call to the urology department and they would put their best surgical oncologist on the job and that I would be taken care of. A few days later I finally got an appointment and went in. About two weeks after that I was in surgery. Mom flew out to help Emily take care of me while I recovered and stayed about a week. That's the longest she could stand to leave John, who was steadily getting worse.

December 26th 2012

My first day of chemo was the day after Christmas. About a week prior my diagnosis came back: stage IIa metastatic testicular germ cell cancer. Merry Christmas, everybody! Emily and I went in to the clinic at 8am to get familiar with the setup and for me to get my first treatment. It was... uneventful? I guess I expected it to be kind of dramatic, but really it was no big deal. My chemo schedule was one week of 5 days straight followed by two weeks of one day, repeated three times. Nine weeks. The first we were ok but by week seven I was a slug, barely able to make it into the treatment chair before falling immediately asleep. I got nauseous just getting in the car to go to the clinic. I'm proud to say I didn't throw up once the entire time. I didn't actually go the whole nine weeks. Toward the end my platelet count was too low to get treatment but my medical oncologist was adamant that skipping the last two treatments was no big deal. In March my doctors pronounced me tentatively cured.

April 19th 2013

Mom called while I was out to lunch with coworkers. She said that John was rapidly getting worse. I already had bought a ticket for May and she told me not to change it. Yet. The next day at noon she called again and said that I had to get home as quickly as I could. I changed my ticket, threw clothes in my backpack, and Emily and I were on the road to the airport in 10 minutes so I could catch the last flight of the day to Wisconsin. I didn't speed from Milwaukee to Milton, since it was dark and rainy, but I didn't dawdle either. When I got there Mom and my sisters were all there in the living room with John, watching movies and taking turns holding his hand while he lay in his hospice bed, breathing slowly. At 8am the next morning he passed with all of us around him. He didn't give up, right to the end. Mom said he had been so happy that I was better.

The funeral was that Friday. We spent the week gathering photos and figuring out the arrangements. I'm told it was a nice service, but I don't really remember much of it. Emily was able to be there for a few days which was helpful. John's ashes came back to the house that Wednesday. I won't tell you what him and Mom worked out for when she eventually passes, but I will say it's simultaneously the most romantic and the saddest thing I've ever heard.

June 4th 2013

I had my three-month follow up appointment. My blood markers were completely clear and the doctor said that my chest X-ray was so perfect she could put the scan in a medical textbook. I'm cured. Modern medicine works. Hallelujah.

At this point we don't know what's going to happen with Emily's mom but we're hoping for the best. Emily and I are getting married next year and hopefully everyone will be there, but life might get in the way. I guess we'll find out.

A very good friend of mine asked me on more than one occasion "Have you ever really thought about it?" "It" being death. At the time I didn't really have an answer for him. My mind was generally on other things back then. Girls, computers, cars, moving halfway across the country, etc.

Now, though, I can confidently say that my own mortality has crossed my mind once or twice. If there's a lesson to be drawn from my story it's this: live life how you want for as long as you can, because it may be shorter than you're expecting.

Epilogue

Emily's mom passed away peacefully in her sleep on August 24th, exactly one week after Emily and I got married in an intimate family-only ceremony.

Blog Relocation

2013-06-02T00:00:00+00:00

After a lot of thought and deliberation I've decided to retire bugsplat.info as my blog address. It's served me well for about six years, but the word "bugsplat" has recently gained some other unrelated connotations, the latter being the most unsavory. The other big reason is that at this point I would like people to associate my work with my actual name, not some other name that you would only know was related if you knew me already.

That said, any @bugsplat.info email addresses you have will continue to work. Any links to bugsplat.info will also continue to work through the magic of HTTP 301 Moved Permanently. Read on to see how I set that up, because it's kind of interesting.

HTTP has a whole bunch of different status codes. Pretty much everyone knows about 404 Not Found, of course. Other important codes are 200 Ok, which is what servers respond to requests they can handle along with the content at that address. 401 Authorization Required is another interesting one. That's what triggers a browser login box that you might see from time to time.

301 Moved Permanently (and its little brother 302 Moved Temporarily) are used to tell your browser "hey, the content that you're looking for is over at this other location". Google also uses these redirections when generating search results, which is why they're so important to get right.

Along with changing the domain of this I wanted to change the URL format of blog posts. I'm pretty tired of having the date in the URL itself. It's long and redundant and kind of ugly. Some small changes to the application code took care of generating those URLs, but what to do about the redirects? I installed a new app at bugsplat.info running this code:

require 'rack/rewrite'

use Rack::Rewrite do
  r301 %r{^/payment-integration.html}, "http://www.petekeen.net/mastering-modern-payments"
  r301 %r{^/\d{4}-\d{2}-\d{2}-(.*)$}, "http://www.petekeen.net/$1"
  r301 %r{^(.*).html$}, "http://www.petekeen.net$1"
  r301 %r{^(.*)$}, "http://www.petekeen.net$1"
end
run lambda { |env| [200, {"Content-Type" => "text/plain"}, ["Hello. The time is #{Time.now}"]] }

This app uses rack-rewrite, a small Rack middleware that emulates Apache's mod_rewrite but using a Ruby DSL. There's four specific rewrite instructions here. The first one rewrites the URL for my guide to using Stripe with Rails to a URL matching it's actual title, which should help for SEO purposes. The next one sends old blog posts with the date and with .html at the end to the new site, sans date and suffix. The last two rules are generic catch-alls. Oh, and that little lambda thing is just there so the middleware has something to attach to, it's never actually called.

I think I caught all of the links on the site itself, but if you notice any weirdness with this new setup, please let me know.

Design for Failure: Processing Payments with a Background Worker

2013-05-30T00:00:00+00:00

Processing payments correctly is hard. This is one of the biggest lessons I've learned while writing my various SaaS projects. Stripe does everything they can to make it easy, with quick start guides and great documentation. One thing they really don't cover in the docs is what to do if your connection with their API fails for some reason. Processing payments inside a web request is asking for trouble, and the solution is to run them using a background job.

The Problem

Let's take Stripe's example code:

Stripe.api_key = ENV['STRIPE_API_KEY']

# Get the credit card details submitted by the form
token = params[:stripeToken]

# Create the charge on Stripe's servers - this will charge the user's card
begin
  charge = Stripe::Charge.create(
    :amount => 1000, # amount in cents, again
    :currency => "usd",
    :card => token,
    :description => "payinguser@example.com"
  )
rescue Stripe::CardError => e
  # The card has been declined
end

Pretty straight-forward. Using the stripeToken that stripe.js inserted into your form, create a charge object. If this fails due to a CardError, you can safely assume that the customer's card got declined. Behind the scenes, Stripe::Charge makes an https call to Stripe's API. Typically, this completes almost immediately.

But what if it doesn't? The internet between your server and Stripe's could be slow or down. DNS resolution could be failing. There's a million reasons why this code could take awhile. Browsers typically have around a one minute timeout and application servers like Unicorn usually will kill the request after 30 seconds. That's a long time to keep the user waiting just to end up at an error page.

The Solution

The solution is to put the call to Stripe::Charge.create in a background job. This example is going to use a very simple background worker system named Sucker Punch. It runs in the same process as your web request but uses Celluloid to do things in a background thread.

First, let's create a job class:

class StripeCharger
  include SuckerPunch::Worker

  def perform(event)
    ActiveRecord::Base.connection_pool.with_connection do
      token =  event[:token]
      txn = Transaction.find(event[:transaction_id])

      begin
        charge = Stripe::Charge.create(
          amount: txn.amount,
          currency: "usd",
          card: token,
          description: txn.email
        )
        txn.state = 'complete'
        txn.stripe_id = charge.id
        txn.save!
      rescue Stripe::Error => e
        txn.state = 'failed'
        txn.error = e.json_body
        txn.save!
      end
    end
  end
end

Again, pretty straightforward. Sucker Punch will create an instance of your job class and call #perform on it with a hash of values that you pass in to the queue, which we'll get to in a second. We look up a Transaction record, initiate the charge, and capture any errors that happen along the way.

Transaction in this case is a simple ActiveRecord object with just a few attributes, just enough to capture what Stripe gives us:

class Transaction < ActiveRecord::Base
  attr_accessible :stripe_id, :state, :amount, :error, :email
end

Sucker Punch needs to know about our job class, so let's tell it in an initializer:

SuckerPunch.config do
  queue name: :payments_queue, worker: StripeCharger, workers: 10
end

Now for the controller that ties it all together:

class TransactionsController < ApplicationController

  def create
    txn = Transaction.new(
      amount: 1000,
      email: params[:email],
      state: 'pending'
    )
    if txn.save
      SuckerPunch::Queue[:payments_queue].async.perform(
        transaction_id: txn.id,
        token: params[:stripeToken]
      )
      render json: txn.to_json
    else
      render json: {error: txn.error_messages}, status: 422
    end
  end

  def show
    txn = Transaction.find(params[:id])
    raise ActionController::RoutingError.new('not found')
      unless txn

    render json: txn.to_json
  end
end

The create method creates a new Transaction record, setting it's state to pending. It then queues the transaction to be processed by StripeCharger. The show method simply looks up the transaction and spits back some JSON. On your customer-facing page you'd do something like this:

function doPoll(id){
    $.get('/transactions/' + id, function(data) {
        if (data.state === "complete") {
          window.location = '/thankyou';
        } elsif (data.state === "failed") {
          handleFailure(data);
        } else {
          setTimeout(function(){ doPoll(id); }, 500);
        }
    });
}

Your page will poll /transactions/<id> until the transaction ends in either success or failure. You'd probably want to show a spinner or something to the user while this is happening.

With this setup, you've insulated yourself from problems in your connection to Stripe, your connection to your customer, and everything in between.

This is an excerpt from my guide Mastering Modern Payments: Using Stripe with Rails.

Book Review: Tubes: a Journey to the Center of the Internet

2013-05-25T00:00:00+00:00

Andrew Blum took his Internet service completely for granted until one day a squirrel chewed through a cable in his neighborhood, cutting off his access and sending him around the world, attempting to answer the question: What exactly is the Internet? What is it made of? Where is it, really? He documents his travels in the book Tubes: a Journey to the Center of the Internet.

Blum starts out describing a map of the internet (note: not this other map of the internet) made by a company named TeleGeography, which shows an incredibly intricate web of little lines connecting major cities around the globe. These lines are fiber optic cables, running over land and under oceans, hooking up with each other in giant data centers in places like Ashburn, VA and London, UK. Blum goes to these interconnect points and talks with the people who run them, the people who started them back in the early 1990s when the Internet was just getting going. The book concludes with a trip to Prinville, OR to see one of Facebook's newest data centers, an example of where all the data that traverses these fiber optic lines ends up.

Prior to reading Tubes I had a pretty good understanding of some of these topics. For example, Neil Stephenson (author of some of my favorite books) wrote an amazing article for Wired Magazine in 1996 where he travels around the world following the building of FLAG, a state of the art (for the time) 26,000 kilometer undersea cable. Tubes did have a lot to say about how the early days of the commercial internet came about, though, especially with regards to the early Internet exchange points in Washington DC, San Jose, and London. Also, I had no idea that there was such a huge facility in Ashburn run by a company named Equinix. Blum goes into a lot of great detail about the history of these companies and the people that run them.

Overall I think Blum does a good job communicating that the Internet really isn't a thing so much as an idea, very much made up of the people who design and build out the fiber optic links and the things they link together. The fiber is almost an afterthought, a means to an end. If you're at all interested in how the Internet came to penetrate our lives over the last two decades, you should check out Tubes.

New Blog Design

2013-05-24T00:00:00+00:00

For the past three years this blog has has featured the same basic design. Header, sidebar, content, footer. Simple, classic, kind of ... homely. Today I decided to give a new design a whirl, and if you're reading this on the website you can see what's changed.

Essentially I've removed the sidebar in favor of a clean header and footer. I've also condensed several pages onto a comprehensive About page. The colors are basically the same, although I've brightened up the background a bit. The biggest change by far has been switching from some random two-column css layout I found somewhere on the internet to a design based on Twitter Bootstrap. I used a tool called Bootstrap ThemeRoller to customize the colors and fonts and then made a few remarkably small changes to the markup and here we are.

See any weirdness or bugs, or want to offer some criticism, constructive or not? Email me or Tweet at me.

Mounting a Magic Trackpad on a Kinesis Advantage Keyboard

2013-05-17T00:00:00+00:00

Update: The Magic Trackpad 2 fits in the same spot but you might have to use a different sticky material. See these tweets for details.

Way back in early 2009 my wrists started acting up. As a professional software developer only two years into my young career, this was super concerning. I asked around at the office and my coworkers told me I should get a Kinesis Advantage Pro keyboard, which aside from being super goofy looking, seems to have cured a lot of people of their RSI symptoms. I looked into it and eventually broke down and bought one, and then another one because the first developed some sticky keys but I didn't want to part with it to send it back for service.

Using a mouse with the Kinesis is a little bit tiring, though. It's a much larger keyboard than you might be used to, so the reach from keyboard to mouse is in kind of a weird direction and your mouse might be in a weird spot. I've been using an Apple Magic Trackpad for the past few years and after awhile I got sick of the reach and stuck it in the obvious place, the giant blank spot between the keywells of the Kinesis, like so:

It's not an exact fit, though, so I added a little shim made of two wrist rest pads stuck together:

This worked alright but as time went by the trackpad started slipping around and I had to adjust it every few seconds. I started dreaming about designing and 3D-printing a bracket to hold the trackpad in place so it wouldn't move around but wasn't permanently attached to the keyboard in case I need to replace it. That's expensive and tedious and probably not that fulfilling in the end, so I investigated a bunch of other options and finally hit upon the Heininger 1065 CommuteMate DashGrip. It's intended to be slapped onto a car dashboard so you can perch your sunglasses and keys (wait aren't they in your ignition?) and spare change and 40oz Slurpee on your dash, without worrying that they'll scatter everywhere when you take a corner too hard.

The DashGrip is basically a rectangle of sticky gel-like material that's surprisingly difficult to photograph:

In this photo I've already cut it to size and attached it to the trackpad. The other half is still in it's original shipping wrapper. It's easy to cut down to size with scissors, and with it the trackpad fits snuggly but very securely in it's designated spot. By snug I mean very strong. Strong like ox:

The coolest part is that it doesn't leave any adhesive residue on the keyboard or trackpad. I'm not really sure how this stuff works but it seems pretty magical. Here's a side view of what it looks like put together:

There's two caveats to this otherwise great solution. First, it's not going to work for you if you hate tap-to-click. The switches for clicking are actually inside the two front feet of the trackpad, which in this setup are not really in a clicking kind of mood since they're embedded in soft sticky gel. Relatedly, if you want to use the trackpad away from the keyboard it's going to be kind of a pain, because of how super strong the bond is (see photo above). I happen to love tap-to-click, though, and don't use the trackpad away from the keyboard often enough to care.

If you can live with those two things, I suggest you try out this setup, especially if you already have a Magic Trackpad and a Kinesis. It's pretty awesome.

Distributed Personal Wiki

2013-05-10T00:00:00+00:00

For as long as I can remember I've been trying to find a good way to keep personal text notes. Recipes, notes, ideas, that kind of thing. Things that aren't really suited to blogging. Along the way I've used (and stuck with) PmWiki, DocuWiki, TiddlyWiki, and most recently I built my own sort-of-pseudo-wiki Marginalia.

Lately, though, it's been kind of a drag to use a web-based application just to write down some work notes. Having sort of an obsession with Markdown I decided to just start keeping notes in Markdown-formatted files in a directory. Of course, files that aren't backed up are likely to disappear at any moment, so I naturally stuck them in a git repository and pushed to my personal git server. But then, how do I deal with synching my work and home machines? I guess I'll manually merge changes...

Yeah, that lasted about 10 minutes. I had a whole setup baked up that tied together a rake script and an OS X LaunchAgent that watched a directory and everything, but the merging is of course the hardest part.

I went hunting for alternatives. I even briefly considered trying out Evernote, but that didn't really meet with my self-hosting ideals. Dropbox was also a non-starter because, again, not self-hosted. Then I came across SparkleShare and my eyes lit up. SparkleShare uses git as it's transport mechanism, can sync with any git repository, and automatically watches a given directory for changes.

SparkleShare is trivial to set up. Download the package, drag the app to /Applications, and run it. It'll create a directory at ~/SparkleShare and then ask you to add a hosted project. Just create a git repo somewhere (or not, it can sync to any directory that it can talk to via ssh), point SparkleShare at it, and you're done. Do this across every machine you want this share on and whenever you add, delete, or modify a file in that directory it'll get synced to all of the other machines automatically.

So, that covers the sync and backup strategy. What about the wiki part? I've been using Emacs markdown-mode for several years to get syntax highlighting when writing blog posts and such, and unbeknownst to me in a recent release Jason added Wiki links and keyboard shortcuts to follow them. I upgraded my dotfiles to the latest version of markdown-mode and then bam, wiki in a git repo.

There are two more things that I want to do that would make this system work really well. First, I want to set up Gollum on one of my servers and point it at the git repo that SparkleShare is syncing, so that I can have a web interface and pretty formatting when I want it. The nice thing is that markdown-mode and Gollum use the same syntax for wiki links.

Second, I want to replicate the send-an-email-to-create-a-note functionality that Marginalia has. I think I can do this with a tiny CGI script hooked up to Mandrill or Mailgun's incoming email processing system. All it has to do is drop the message text into the SparkleShare-synced directory with a filename based on the subject.

Increasing the Encryption Noise Floor

2013-01-27T18:06:32+00:00

Inspired by Tim Bray's recent post about encrypting his website, I decided to enable and force HTTPS for bugsplat.info. The process was straightforward and, turns out, completely free. Read on to find out how and why.

Why?

Because I think the whole web should be encrypted and I figured I should practice what I preach. Bugsplat is primarily static html and is completely public but that doesn't mean I can't increase the encryption noise floor, so to speak. If only the websites with secret datas are encrypted then just using them is suspicious. If every site is encrypted then there's nothing to be suspicious about.

How?

Bugsplat is deployed to my RamNode VPS (notice: affiliate link) with Capistrano::Buildpack, a Capistrano add-on that allows you to deploy applications using Heroku-style buildpacks. Recently I added support for simply configuring HTTPS with a few options in your Capfile. Here's the relevent config from bugsplat.info's Capfile:

set :use_ssl, true
set :force_ssl, true
set :ssl_cert_path, '/etc/nginx/certs/bugsplat.info.crt'
set :ssl_key_path, '/etc/nginx/certs/bugsplat.info.key'

:use_ssl enables listening on port 443 with SSL and the two path options just tell nginx where to find the keys on the server, which are deployed separately with Puppet. :force_ssl adds this small snippet to the exported nginx config file which redirects plain requests to SSL:

if ($ssl_protocol = "") {
   rewrite ^https://$server_name$request_uri? permanent;
}

As for the certificate, I ended up going with a free certificate from StartSSL. This certificate doesn't necessarily guarantee that I am who I say I am because I just had to validate an email address, but it does guarantee that the connection is encrypted which is really all I care about. At some point I plan on going through the verification steps needed to get Class 2 certificates from StartSSL, but that's for another day.

Full Text Search with Whistlepig

2013-01-09T07:51:11+00:00

Yesterday I suddenly developed the intense need to add search to this site. Among the problems with this is that the site is kind of a weird hybrid between static and dynamic, and it has no database backend. If posts were stored in Postgres this would be a trivial matter, but they're just markdown files on disk. After flailing around for awhile I came across a library named Whistlepig which purported to do in-memory full text indexing with a full query language.

November 5, 2013: I've removed search because nobody used it and this way the site can be 100% static.

First pass: Regular expressions

To rewind a bit, my first horrible stab at search was to find all of the posts that matched a user-provided regular expression:

@results = @posts.find_all do |post|
  post.body.match /\b#{user_query}\b/
end

This is of course complete madness. Not only am I allowing the user to put whatever they want in a regex but it only matches whole words. Not to mention that, while I only have around sixty pages right now, that number's never going to go down.

Whistlepig to the Rescue

Whistlepig is a small text search index. Small as in not very many features and not much code, but the features that are there are perfect for my needs:

Full query language
In-memory, in-process
Arbitrary number of indexes for the same document

Here's a full example of how to index and query a document:

require 'rubygems'
require 'whistlepig'

document = "Hi there"

index = Whistlepig::Index.new "index"

entry = Whstilepig::Entry.new
entry.add_string "body", document

docid = index.add_entry entry

query = Query.new("body", "hi")
result = index.search(query)
assert_equal docid, result[0]

The indexing code in bugsplat's app is not much more complicated. Here's the interesting bit:

@pages_by_docid = {}

@pages.each do |page|
  entry = Whistlepig::Entry.new

  entry.add_string "body", page.render(@strip_renderer)
  entry.add_string "name", page.name
  entry.add_string "title", page.title.downcase
  entry.add_string "tags", page.tags.join(" ").downcase
  entry.add_string "page_id", page.page_id
  entry.add_string "blog_post", page.is_blog_post? ? "yes" : "no"
  docid = @index.add_entry(entry)

  @pages_by_docid[docid] = page
end

In bugsplat a Page encapsulates everything about an entry writen in Markdown. I maintain six indexes on the pages, including body rendered with a Markdown-stripping, downcasing renderer, name which is the canonical name of the post, title, tags, page_id which is a short-code type of thing, and blog_post which is a simple boolean as to whether the post has a date or not.

"Why so many indexes?" you may find yourself asking. Because instead of just implementing search and being done with it, I went and refactored the guts of the blog to use it throughout. See, I had these terrible little things everywhere, all over the place:

@page = @pages.find_all { |p| p.has_tag? params[:tag] }

Doing linear searches across the list of in-memory pages isn't too terrible but man it bugged me to have to repeat that everywhere. Instead of that, I can do nice things like this:

@tagged_pages = @pages.search(params[:tag].downcase, "tags")

Each time I found myself iterating over all of the pages to get a subset I replaced it with a search query. The code is much nicer to read and faster, although almost all of it is cached as static HTML in production.

Try it out!

Go ahead and search for some stuff and let me know what you think! And next time you find yourself with a full text search problem, see if Whistlepig would help you out. It's not for everybody, but it's very good at what it does.

How I run my own DNS servers

2012-12-31T12:15:00+00:00

For the longest time I used zoneedit as my DNS provider of choice. All of my important domains were hosted there, they never really did me wrong. A few months back I decided that I wanted to learn how DNS actually works in the real world, though. Like, what does it actually take to run my own DNS servers?

Step 0: Why would you ever do that?!

I'm mostly motiviated by curiosity, but also by frustration. When something isn't going my way it just starts to make sense to do it myself. My frustration with zoneedit wasn't anything super specific. Their dynamic DNS system wasn't too terribly dynamic and adding and editing zones through their web interface got to be pretty tedious after awhile. I have a bunch of zones (32 at last count), most of which are very simple setups. bugsplat.info is way more complicated, but we'll get into that later.

Step 1: The Hardware

I decided that if I'm going to do this, I'm going to go all out. To that end, I rented two VPSs, one from RamNode (notice: affiliate link) in Atlanta and another from Prgmr in San Jose. Overall I would say that my RamNode experience has been more positive than my Prgmr experience. The network links have gone down twice in the past six months at Prgmr, which isn't the end of the world when you're running a redundant service but it's still pretty annoying. Ramnode has had 100% uptime so far.

Specs on these bad boys:

prgmr (teroknor.bugsplat.info): 1 core, 1024MiB ram, 24GiB Disk, 160GiB transfer
ramnode (empoknor.bugsplat.info): 4 core, 2048MiB ram, 30GiB SSD-backed Disk, 4000GiB transfer

I'm not even close to exploiting these two machines. I'm planning on moving more and more of my apps and sites over to them, but right now they're mainly handling this site and my email and DNS.

Why two machines? To host your own DNS servers the registrars require you to list two IP addresses with the idea that you'll be providing redundant service. The one thing you don't want is downtime with DNS, it screws everything up.

Step 2: The Software

Once you decide to down this DNS rabbit hole there are a bunch of decisions to make on the software side. I considered PowerDNS and BIND and finally settled on tinydns managed via puppet and supply drop. Tinydns is a project started by Daniel J. Bernstein many years ago and has proven to be extremely reliable when run as intended (no axfr, configuration propogation via scp, etc). My setup is thus:

Puppet managing the config for both boxes
Supply drop deploys this configuration via Capistrano
Tinydns has a static config file checked into git controlling most of my zones
Tinydns also has a dynamic file that does my dynamic DNS updates for the home router

bugsplat.info is my oldest and thus most complicated domain. It's not even that complicated, really, it just handles a lot of stuff. My Mac mini runs a cron job every minute that ssh's into both machines and rebuilds the tinydns config file if it's IP has changed. That IP is then assigned to subspace.bugsplat.info and I have a wildcard CNAME for *.bugsplat.info pointing at subspace. This lets me do things like various services running on that mac mini with distinct hostnames, all hiding behind a common nginx. In addition, each VPS has a wildcard CNAME pointing to it from *.<hostname>.bugsplat.info which lets me set up new apps and sites easily.

Step 3: The Email

One of the other problems I had with zoneedit was their free email forwarding setup. It was slow. So slow. Slower than molasses spread onto the back of the slowest dog. Even before this whole DNS adventure started I knew I wanted to get rid of that.

Each VPS runs it's a copy of my Postfix setup (also managed via puppet), which mostly just forwards incoming email into my gmail account. I don't send through it, since I haven't quite figured out all of the various DKIM and DMARC and SenderID and SPF things I need to do, and besides which Gmail won't send out through my SMTP server anyway.

Step 4: Logging

One of the more interesting aspects of this whole project has been getting a comprehensive view of everything that goes on in my little empire. The other day I set up global logging using Papertrail, a hosted logging service. It doesn't do a whole lot, mostly it just seeps up logs from all of my services including these two VPSs and a bunch of Heroku apps, makes them searchable for a few days, and drops tarballs of them onto S3 nightly. It's given me really valuable insight into at least two things: my gmail backup wasn't working, and I get hit a lot by Chinese and India SSH breakin attempts. Still working on how to deal with that one, but the gmail backup is up and running.

Conclusion

So after all of that, what have I learned? Mostly that I'm a very particular person with regards to this stuff. It's fun right now but I can see it getting kind of tedious down the line. We'll find out! It's been an interesting ride thus far and I've learned quite a bit which is the most important thing.

This article was featured in Hacker Monthly Issue 35.

Want more stuff like this? DNS: The Good Parts is my collection of essays about DNS, packaged up in a beautiful, ready to print PDF. Learn how DNS works, how to run your own DNS server, and why maybe that's not the best idea after all.

I Dig DNS ($9)

Deploy 12-Factor Apps with Capistrano::Buildpack

2012-12-30T12:04:31+00:00

Last month I wrote a short article describing a method of deploying a 12-factor application application to your own hardware or VPS, outside of Heroku. Today I'm happy to announce a gem named capistrano-buildpack which packages up and formalizes this deployment method.

Basically, all this does is wrap up the before and after hooks that the other article talks about, which lets you set up a Capfile that looks like this:

require 'rubygems'

set :application, "bugsplatdotinfo"
set :repository, "https://github.com/peterkeen/bugsplat.rb"
set :scm, :git
set :additional_domains, ['bugsplat.info']

role :web, "examplevps.bugsplat.info"
set :buildpack_url, "https://github.com/peterkeen/bugsplat-buildpack-ruby-simple"

set :user, "peter"
set :base_port, 6700
set :concurrency, "web=1"

set :deploy_env, {
  'LANG' => 'en_US.UTF-8',
  'PATH' => 'bin:vendor/bundle/ruby/1.9.1/bin:/usr/local/bin:/usr/bin:/bin',
  'GEM_PATH' => 'vendor/bundle/ruby/1.9.1:.',
  'RACK_ENV' => 'production',
}

load 'deploy'
require 'capistrano-buildpack'

This sets up the standard boilerplate Capistrano variables and roles, as well as :buildpack_url which controlls which buildpack to clone/update, as well as :base_port, :concurrency, and :deploy_env which tell foreman and foreman-export-nginx what to do. When you run cap deploy with this Capfile, these steps happen:

Clone/update the buildpack
Clone/update the code repository
Apply the buildpack to the code repository
Create upstart-style init files in /etc/init and start up the app
Create an nginx config file at `/etc/nginx/conf.d/.conf and restart nginx

The nginx config will set up one default hostname, <application>.<deploy host>, as well as list out the additional domains specified in the :additional_domains setting. Make sure to set up your DNS properly for these hostnames.

capistrano-buildpack defaults to deploying apps to /apps/<application>, nginx configs to /etc/nginx/conf.d, upstart files to /etc/init, and logs to /var/log/apps/<application>-*.log. Everything except the log path can be customized by setting these vars:

set :deploy_to, "/your/path/#{application}"
set :foreman_export_path, "/your/init/path"
set :nginx_export_path, "/your/nginx/conf/path"
set :foreman_export_type, "runit_or_whatever"

Right now Capistrano::Buildpack will attempt to run sudo service <application> restart when running services. This may not be appropriate for all environments. If you want to generalizethis, please submit a pull request and I'll merge it.

Docverter is now Open Source

2012-11-23T21:35:19+00:00

A few months ago I created a hosted document conversion service named Docverter. The idea was to collect together the best document conversion tools I could find into one comprehensive service and sell access. Many of these tools are difficult to install if you're used to a service like Heroku, so it only made sense to wrap it all up.

After two months of trying to make a go of it, I've made the decision to open source the server component of Docverter. I've put the source on Github, along with installation instructions. I'm also available on a consulting basis for installation and integration work. Email me if you're interested.

Deploying a 12-Factor App with Capistrano

2012-11-11T13:52:53+00:00

Deploying Heroku-style 12 factor applications outside of Heroku has been an issue for lots of people. I've written several different systems that scratch this particular itch, and in this post I'll be describing a version that deploys one particular app using a Heroku-style buildpack, Foreman, and launchd on Mac OS X via Capistrano.

I've been deploying a customized version of ledger-web on my Mac mini using dokuen for almost six months. A few nights ago, however, I tried to deploy a version and discovered my Dokuen install was completely busted. Instead of doing the correct thing and fixing my Dokuen install I wrote a completely new deployment system using Capistrano.

Essentially, this deployment uses the standard :checkout deploy strategy with hooks that clone and run a buildpack, build a .env file, and run Foreman to create launch scripts.

Dependencies

This config depends on the following on the deployment target:

Mac OS X
Ruby 1.9.3 (installed from homebrew)
the Foreman gem

Configuration

There's a bunch of config that happens at the top of file. First, the standard config settings:

set :application, "ledger"
set :repository,  "git@git.mydomain.com:peter/ledger-app.git"
set :deploy_to, "/Users/peter/apps/ledger"
set :scm, :git

role :web, "lionel.local"
role :db,  "lionel.local", :primary => true

set :user, "peter"

These define my app, the repository, and a few other standard things. It also sets my Mac mini, named lionel to be the deployment target.

default_run_options[:pty] = true
default_run_options[:shell] = '/bin/bash'

:pty and :shell are required by several scripts that run later.

Next are settings that are used by my custom hooks:

set :base_port, 6500
set :buildpack_url, "https://github.com/peterkeen/heroku-buildpack-ruby"
set :buildpack_hash, Digest::SHA1.hexdigest(buildpack_url)
set :buildpack_path, "#{shared_path}/buildpack-#{buildpack_hash}"
set :concurrency, "web=1"
set :launchd_conf_path, "/Users/peter/Library/LaunchAgents"

These set up my buildpack, more deployment paths, etc. Of particular note are :concurrency, which controls what Foreman exports, and :base_port which is what Foreman will set as the first port for the web procfile entries.

set :deploy_env, {
  'DATABASE_URL' => 'postgres://user@dbhost/database',
  'LEDGER_FILE' => '/path/to/ledger.txt',
  'LEDGER_USERNAME' => 'username',
  'LEDGER_PASSWORD' => 'password',
  'LANG' => 'en_US.UTF-8',
  'PATH' => 'bin:vendor/bundle/ruby/1.9.1/bin:/usr/local/bin:/usr/bin:/bin',
  'GEM_PATH' => 'vendor/bundle/ruby/1.9.1',
  'RACK_ENV' => 'production',
}

:deploy_env sets up a hash of environment variables that will be exported later. I don't run bin/release because I found that it will always return the same set of environment variables and I don't care about the default procfile entries or addons. If you do, feel free to parse out the results of bin/release, which returns a YAML hash.

Hooks

So now that the setup is done, deploy happens as normal with just a few hooks. First, a before deploy hook that sets up the buildpack and build cache:

before "deploy" do
  run("[[ ! -e #{buildpack_path} ]] && git clone #{buildpack_url} #{buildpack_path}; exit 0")
  run("cd #{buildpack_path} && git fetch origin && git reset --hard origin/master")
  run("mkdir -p #{shared_path}/build_cache")
end

Next, after the normal deploy happens but before the symlink is switched, we hook in and run the buildpack:

before "deploy:finalize_update" do
  run("cd #{buildpack_path} && bin/compile #{release_path} #{shared_path}/build_cache")

  env_lines = []
  deploy_env.each do |k,v|
    env_lines << "#{k}=#{v}"
  end
  env_contents = env_lines.join("\n") + "\n"

  put(env_contents, "#{release_path}/.env")
end

This hook also writes out the environment variables we defined earlier in a way that Foreman can pick up.

Finally, we redefine the deploy:restart task to run Foreman and restart the generated LaunchAgent:

namespace :deploy do
  task :restart do
    sudo "launchctl unload -wF #{launchd_conf_path}/ledger-web-1.plist; true"
    sudo "foreman export launchd #{launchd_conf_path} -d #{release_path} -l /var/log/#{application} -a #{application} -u #{user} -p #{base_port} -c #{concurrency}"
    sudo "launchctl load -wF #{launchd_conf_path}/ledger-web-1.plist; true"
  end
end

This hardcodes the plist name that Foreman generates because it was late and I was tired. Also, sudo didn't like my initial stab at a for loop and I cut my losses. It wouldn't be too hard to write out a tiny script and execute it, though.

Nginx

Dokuen was also managing my nginx configuration for each app. I added a simple proxy definition for ledger instead:

server {
  server_name ledger.mydomain.com;
  listen 443;
  ssl on;
  location / {
    proxy_pass http://localhost:6500/;
  }
}

Result

At this point I think this is a better model than Dokuen for deploying 12 factor applications on my own hardware. There are no extra daemons to keep running, there's no extra software on the server (except Foreman), there's no weird sudo definitions.

Deploying on a cluster is a slightly different story. I would probably change this do build a tarball on an Anvil server and then distribute the tarball out to the rest of the machines instead of building on every machine, among other changes.

Run Anything on Heroku with Custom Buildpacks

2012-11-05T18:13:41+00:00

Heroku is a Platform as a Service running on top of Amazon Web Services where you can run web applications written using various frameworks and languages. One of the most distinguishing features of Heroku is the concept of Buildpacks, which are little bits of logic that let you influence Heroku as it builds your application. Buildpacks give you almost unlimited flexibility as to what you can do with Heroku's building blocks.

Hanging out in the #heroku irc channel, I sometimes see some confusion about what buildpacks are and how they work, and this article is my attempt to explain how they work and why they're cool.

Before we tackle the specifics of a buildpack, let's talk about how Heroku works in more general terms. When you push your application to Heroku it turns your code into an executable slug, which includes your application code and all of it's dependencies. For a Ruby on Rails application, this would include every gem listed in your Gemfile along with the specific version of Ruby that you want. For a Python app it includes all of the dependencies listed in requirements.txt.

Heroku also generates or adds to a file named Procfile which lists all of the executable processes that your application uses. For example, most Ruby web applications will have an entry in their Procfile that looks like this:

web: bundle exec rackup -p $PORT

When you scale your application Heroku starts up little linux virtual machines named dynos. Each dyno corresponds to a particular slug and a particular set of environment variables set using heroku config:add, along with a single entry from your application's Procfile.

What is a Buildpack?

Heroku turns your application code into a slug using a Buildpack which consists of a small set of executable scripts. We're going to use heroku-buildpack-hello (github) as a simple example.

Stage 0: Buildpack Clone

The very first thing the slug compiler does is download your custom buildpack if you have one. You can set a custom buildpack at app creation time like this:

$ heroku create --buildpack=http://github.com/you/your-buildpack.git

After application creation you can set a custom buildpack or switch to a different one by setting the BUILDPACK_URL configuration value:

$ heroku config:add BUILDPACK_URL=http://github.com/you/some-other-buildpack.git

If you haven't set a custom buildpack, Heroku uses their standard set of buildpacks covering a wide variety of different language runtimes and frameworks.

Stage 1: Detect

Heroku runs bin/detect from each candidate buildpack, passing in the path to a temporary directory containing your application code. The first one that returns successfully (i.e. exit 0 in bash) determines the buildpack to use in the next few stages. Here's heroku-buildpack-hello's detect:

#!/bin/sh

# this pack is valid for apps with a hello.txt in the root
if [ -f $1/hello.txt ]; then
  echo "HelloFramework"
  exit 0
else
  exit 1
fi

The if statement looks for a specific file named hello.txt in the root directory of your app passed to detect as the first argument (in bash that's $1).

Whatever bin/detect prints to STDOUT is used as the runtime label in the slug compiler output. In this case, detect prints HelloFramework which will result in this output:

-----> HelloFramework app detected

Stage 2: Compile

The slug compiler next runs bin/compile passing in the path to your application code as well as a path to a directory the compiler can use as a build cache. Here's heroku-buildpack-hello's compile script:

#!/bin/sh

indent() {
  sed -u 's/^/       /'
}

echo "-----> Found a hello.txt"

# if hello.txt is empty, abort the build
if [ ! -s $1/hello.txt ]; then
  echo "hello.txt was empty" | indent
  exit 1
fi

# replace hello with goodbye in a new file
cat $1/hello.txt | sed -e "s/[Hh]ello/Goodbye/g" > $1/goodbye.txt

Here we find a simple indent() function that indents output by eight spaces as recommended by the Heroku docs. Next, it prints out a log line that basically says everything is working as expected. It then tests to see if hello.txt is empty or not and aborts if it is. Finally it does the only real "compilation" step in this buildpack, which replaces Hello with Goodbye.

Stage 3: Release

After the compilation step is done Heroku runs a script named bin/release. This takes the path to your application code as an argument and prints YAML to STDOUT describing default values for config variables and default Procfile entries. release can also specify default addons that your application should receive. For example, most release scripts will specify that the application will get a database instance by default. Here's heroku-buildpack-hello's release:

#!/bin/sh

cat << EOF
---
addons:
  - shared-database:5mb
config_vars:
  PATH: bin:/usr/bin:/bin
default_process_types:
  hello: cat hello.txt
EOF

Notice that it specifies we should get a small database instance, that our application should receive a default PATH environment variable, as well as a default process named hello that just prints out the contents of hello.txt.

Why is this cool?

Buildpacks are cool because you can do whatever you what in the compile step. Want to statically compile some pages in your app? Want to run an application with some parts written in Python and some in Haskell? Want to check in binaries and run them? All of this is possible. In addition to the default buildpacks here are some of the more interesting custom ones I've run across:

heroku-buildpack-multi: Run multiple buildpacks on your application
heroku-buildpack-ruby-jekyll: Build a static Jekyll site at compile time
heroku-buildpack-static: Run an Apache webserver serving static HTML from a public directory.
heroku-buildpack-testrunner: A unit-testing framework for buildpacks

There's a big list of third-party buildpacks on Devcenter which I encourage you to check out.

A Real Example: Vendoring Binaries

For Docverter I've needed to include some 3rd party software that isn't packaged. For the first version I just included the binaries in my git repo, but that's pretty lame. Let's make a buildpack that pulls tarballs off of S3 and extracts them into the app directory.

First, the detect script:

#!/bin/bash

if [ -f $1/.vendor_urls ]; then
    echo "VendorBinaries"
    exit 0
else
    exit 1
fi

This script just looks for .vendor_urls in your app's root directory. Now, the compile script:

#!/bin/bash


indent() {
  sed -u 's/^/       /'
}

echo "-----> Found a .vendor_urls file"

# Bail early but noisily
if [ ! -s $1/.vendor_urls ]; then
  echo ".vendor_urls empty. Skipping." | indent
  exit 0
fi

cd $1

while read url; do
  echo Vendoring $url | indent
  curl -s $url | tar xz
done < .vendor_urls

From the top, this has the same indent() function as the compile from heroku-buildpack-hello. Then it checks the .vendor_urls file for validity and loops over the contents. Each line is fetched with curl and piped through tar.

Finally, the release script is very simple, just returning an empty YAML hash:

#!/bin/sh
echo "--- {}"

In my project's root directory I've created two files, .buildpacks which contains the list of buildpacks:

https://github.com/peterkeen/heroku-buildpack-vendorbinaries.git
https://github.com/heroku/heroku-buildpack-ruby.git

and a .vendor_urls file containing the list of binaries to vendor:

https://s3.amazonaws.com/my-bucket/pandoc.tar.gz
https://s3.amazonaws.com/my-bucket/calibre.tar.gz

I've created this buildpack and put it on Github for you to use. This is just one example of the infinite variety of things you can do, so go forth and experiment!

Private Git Repositories with Gitolite and S3

2012-10-27T15:59:41+00:00

Earlier this year I bought a new Mac mini for various reasons. One of the big ones was so I would have a place to stash private git repositories that I didn't want to host on 3rd party services like Github or Bitbucket. This post describes how I set up Gitolite and my own hook scripts, including how I mirror my git repos on S3 using JGit.

Step 1: Install Gitolite

Gitolite is a system for managing git repositories using git itself to manage the configuration. Essentially, after initial configuration you make all changes by editing a config file, committing it, and pushing up to your git server.

Gitolite installation is pretty straightforward:

Install git on your server. On a Mac the easist way is to use Homebrew. On Ubuntu or Debian you would apt-get install git-core. Redhat systems are similar.
Create a user named git
Login as the git user
Remove any existing authorized_keys file
Put your public key in a file named your_name.pub. Mine is called pete.pub.
Ensure ~/bin is in your PATH

Run these commands in the git user's home directory:

 $ git clone git://github.com/sitaramc/gitolite
 $ mkdir -p $HOME/bin
 $ gitolite/install -to $HOME/bin
 $ gitolite setup -pk YourName.pub

Move to your workstation and run this command:

 $ git clone git@your_server:gitolite-admin

If everything has gone well you'll be able to clone that repo without being asked for a password. There are a ton of things you can do with Gitolite and I don't have room to get into it here. Check out the README on Github and the extensive documentation for more instructions and details on how exactly Gitolite processes things.

Step 2: Write some hooks

Well, not really. You can use mine if you want, if they suit your goals. I wanted to hit a few different areas with my git server:

Simple, flexible mirroring and backups

One of the big things that I was paying Github for was to serve as an off-site repository backup. If for some reason I lost my laptop, my various projects and businesses would be safe because the code was also at Github. But what if I could just push to S3? Amazon S3 is far more cost effective if all you need is a place to shove files, and it so happens that the JGit project lets you use an S3 bucket as a remote.

Simple to use pre- and post-receive hooks

My hook scripts let you set up a pre- or post-receive hook directly in your Gitolite config with an optional branch filter regex.

Local clones

One of the other things I do with my Mac mini is run a customized reporting application on top of my Ledger file, which contains close to six years of personal finance data. The reporting application runs on top of a postgresql database which I load with the combination of a local clone of my finances repo and a post-receive hook that starts the dump and load.

Hook Installation

So those are my hooks. If you want to use them, clone the repo onto your server and copy or symlink pre-receive and post-receive into $GITUSER_HOME/.gitolite/hooks/common/ and jgit into $GITUSER_HOME/bin.

You'll also need to modify your $GITUSER_HOME/.gitolite.rc file slightly. Add these lines somewhere toward the top of the %RC hash:

GIT_CONFIG_KEYS => '.*',
AUTH_OPTIONS => 'no-port-forwarding,no-X11-forwarding,no-pty',

The first line allows the config file to contain any git config options you want. The second removes the default agent forwarding restriction to allow you to push to remote repos using the mirrors configuration described below. If you aren't going to be using my script's mirrors you don't need to add that line.

If you want to push to S3 buckets, you'll need to create a file named .jgit in the git user's home directory with these contents:

accesskey: YOUR-AWS-ACCESS-KEY
secretkey: YOUR-AWS-SECRET-KEY

S3 mirror URLs follow the format amazon-s3://<filename>@<s3-bucket-name>/<repo_name>.git. See below for an example.

Step 3: Profit

Here's my gitolite config after installing my hooks:

repo @all
    config mirrors.s3 = "amazon-s3://.jgit@my-s3-bucket/%GL_REPO"

repo gitolite-admin
    RW+     =   peter

repo CREATOR/[a-zA-Z0-9].*
    C = @all
    RW+ = CREATOR
    RW = WRITERS
    R = READERS gitweb

repo apps/[a-zA-Z0-9].*
    C                   = @all
    RW+                 = CREATOR
    config hooks.pre    = '/usr/local/var/dokuen/bin/dokuen-deploy'

repo financials-master
    RW+ = peter
    config hooks.clone.path = "/usr/local/var/repos/financials"
    config hooks.post = "sudo -u peter /usr/local/var/dokuen/bin/dokuen run_command rake load --application=ledger"

repo peter/git-hooks
    config mirrors.github = "git@github.com:peterkeen/git-hooks.git"

repo peter/bugsplat
    config mirrors.github = "git@github.com:peterkeen/bugsplat.rb"
    config mirrors.heroku = "git@heroku.com:bugsplat.git"

At the top, every repo gets transparently mirrored to my S3 bucket. %GL_REPO gets replaced with the actual path of the repo. After some boilerplate about the gitolite-admin repo comes the meat of the config. I use a gitolite feature called Wild Repos which will automatically create a repo matching the pattern (in this case CREATOR/[a-zA-Z0-9].*) the first time I push to it. The apps entry is the exact same idea with the addition of a pre-commit hook that fires off my Dokuen deploy script.

I described the financials-master repo earlier. After that is some additional config for a few auto-created repos. Gitolite stacks your configurations together which is what lets me get away with only specifiying the mirror config. Everything else is in the wild repo definiton.

Conclusion

Running a private git server probably isn't for everyone but for me, it allows me to have a huge amount of flexibility in how I set up my repos. It's also been basically maintenance free with the exception of some small config changes here and there.

On-the-fly Markdown Conversion to PDF and Docx

2012-10-20T12:21:33+00:00

Today I added PDF, Docx, and Markdown download links to the bottom of every post here on Bugsplat. Scroll down to the bottom to see them, the scroll back up here to read how it works.

For the past few weeks I've been working on a product named Docverter, which does on-the-fly plain text to rich text formatting in a variety of formats with a simple HTTP API. I write entries on this blog in Markdown, which makes it a natural candidate for these types of conversions. Simplified, the code boils down to this:

Docverter.api_key = "<API-KEY>"

result = Docverter::Conversion.run do |c|
  c.from     = 'markdown'
  c.to       = 'pdf'
  c.content  = 'page content'
  c.template = 'template_filename.html'

  c.add_other_file 'template_filename.html'
end

result is the string of the converted PDF or Docx from Docverter. template_filename.html is a simple HTML template that Docverter plugs the HTML that results from the Markdown into before sending it to the HTML to PDF converter. The Docx conversion code is very similar but uses a different template and to format. All of the options are documented in the API docs, but that's basically all you need to get started converting Markdown to PDF.

Just for kicks I added the Markdown download so it's easy to see what exactly bugsplat uses as input. The icon comes from the Markdown-mark project.

Keeping a Programming Journal with Marginalia

2012-09-08T08:06:51+00:00

In addition to writing on this blog, I've been keeping notes for various things on Marginalia, my web-based note taking and journaling app. In my previous post I talked about the why and how of Marginalia itself. In this post I'd like to talk more about what I actually use it for day to day, in particular to keep programming journals.

Update 2013-10-19: Marginalia is shut down and open source on GitHub

Programming Journals

I've been keeping programming journals in Marinalia since the beginning, both for work and home. I've found that having a consistent place to write out my thoughts on whatever I'm working on to be really valuable, both in the moment and looking back.

At work we've used various story and issue tracking systems with more or less decent integration with our source code repository. Currently we use Pivotal Tracker along with a read-only mirror of our code on Github. Commits referencing stories get appended to the story in Tracker, which is nice for single stories If I'm doing a bunch of stuff in one day it makes it impossible to pull that together to present at daily standup.

To get around this, I write down little snippets of what I'm working on in Marginalia using the "append" feature. Appending to a note or journal is a single API call to POST /notes/:id/append. Of course, I don't want to be driving Marginalia with curl all the time, so I put together a little Ruby API and example command line program and pushed it to rubygems as marginalia-io (github). Appending with it is really simple. I can either say something like this:

$ marginalia append 139 I did something just now

which would append a timestamp and the text "I did something just now" to note 139. I could also do this:

$ marginalia append 139

which would pop up my editor and let me type a longer form entry. I use this for appending things like SQL queries or longer rants.

Automatic Entries

In addition to manually adding things that I'm working on, I've added calls to marginalia append to various interesting places in my development tools. For example, I have a tool named git-qa that does a few interesting things including pushing to various git remotes and deploying to staging servers. I added a simple marginalia append to the bottom of this script with the branch that I pushed to QA. Thus, I have an automatic record of what I pushed when that I can look back at, even if I didn't bother to write any actual notes about it. Adding these automatic entries to my tools makes pulling together my daily standup a breeze.

Project-specific Journals

My work and home programming journals are one big use of Marginalia. The other thing I use it for quite frequenly is to make project-specific journals and todo lists. For example, my todo list for Marginalia itself is over 5KB of text and has over 110 versions. I have a journal for my on-going Dokuen rewrite that has 24 versions and is almost 10KB of text.

Conclusion

To check out Marginalia, just go to http://www.marginalia.io and click the "Try for Free" button. If you register your email address and a password with the free trial you can use the API and command line tool as well. Give it a shot, I think you'll like it.

Marginalia: A web-based journaling and note taking tool

2012-09-03T17:12:47+00:00

I'd like to present my new webapp, Marginalia, a web based journaling and note taking tool. Notes are written in Markdown, and there are some simple shortcuts for appending timestamped entries at the end of a note, as well as a few email-based tools for creating and appending to notes. You should check it out. Look below the fold for technical details and the origin story.

Update 2013-10-19: Marginalia is shut down and open source on GitHub

Origin

For a very long time I've had the ridiculous problem of too many ideas. Basically, I would get an idea for something, be it a new app or a tiny implementation detail for work or something. This idea would circle around and around my head for hours, sometimes days, until every single detail was worked out seventeen different ways. Then, satisified, I would promptly forget the entire thing when some other idea jumped out of nowhere.

Sometimes I would write these ideas down somewhere. That was great! I could fully hash out whatever it was on paper or in a random file somewhere. Maybe that file would even be in a directory with some code. Except, I would never remember where these papers or files ended up. Still problematic. Over the years I tried various different systems but none of them ever stuck. They were just more silos for me to forget about. The only system that sort of stuck was to email notes to myself.

In January I finally decided to write something that would fit how my brain works instead of trying to change my brain. The result is a Ruby on Rails app named Marginalia (to be completely honest, until Saturday it was creatively named "notes").

Technical Details

Marginalia is a Ruby on Rails app running on Heroku and Heroku's PostgreSQL database, along with a few addons and libraries:

Memcacheier for caching
New Relic for error and performance tracking
A/Bingo for a/b testing
Stripe for credit card processing
Mailgun for email processing

I've been living in Marginalia for the last eight months and it's been a huge boon to my creativity and memory. I can flesh out ideas using whatever method I want and actually find it later on. If you want to try it out, just go to the home page and click the "Try for Free" button.

Task-oriented Dotfiles

2012-08-11T20:38:36+00:00

Recently I sat down and reorganized my dotfiles around the tasks that I do day-to-day. For example, I have bits of configuration related to ledger and some other bits related to Ruby development. In my previous dotfile setup, this stuff was all mixed together in the same files. I had started to use site-specific profiles (i.e. home vs work), but that led to a lot of copied config splattered all over. I wanted my dotfiles more organized and modifiable than that.

I borrowed the basic ideas from Zach Holman, who borrowed them from Ryan Bates. In fact, I stole their Rakefile and only made a few minor additions. Essentially, each module", where a module is a unit of config related to a single task, has it's own directory in my dotfiles repository. This directory can contain any number of files with names like foo.symlink. This will get symlinked to ~/.foo. In addition, each module can contain a init.sh and init.el file. These get loaded by bash and emacs, respecively, at runtime. The emacs initialization code contains a bunch of clever things that allow me to require external emacs packages using el-get, as well as run code before and after el-get packages get initialized. The bash initialization code contains no such cleverness (yet). Each module can also contain a bin directory, which will get added to $PATH.

So, this is great, but what if I don't want to load my ledger configuration on my work computer? Or what if I have some work-specific module that I don't want to be loaded at home? That's where the ~/.modules file comes in. This file lists the modules that bash and emacs will load, in order. This file is not checked in, because it can and will be different between machines.

One other interesting thing I've done is set up an auto-update system. I have cron set to run a git fetch every minute or so, and then I have my bash prompt set to inform me if there are updates available or if the dotfiles repo is dirty. I don't have a one-button command to apply the updates yet, but it's something I'm considering.

I owe Zach Holman a lot of credit here, but I think I've improved upon the initial design, at least for my needs, with the explicit modules list and the $PATH manipulation. I expect that my particular implementation won't be very useful for anyone else, but if you'd like to use it for inspiration, the Rakefile would probably be the easiest thing to copy.

Dokuen 0.0.8, Now with Linux Support

2012-05-29T19:00:02+00:00

When I released Dokuen last week I had no idea it would get as much press as it did. I'm excited that so many people want to give it a shot. To that end, ~~v0.0.6~~ ~~v0.0.7~~ v0.0.8 has rudimentary Ubuntu support, along with revised Mac support. See below for the changes.

Here's the list of changes:

Process Management

The first version of Dokuen used a LaunchDaemon to start up an instance of foreman for each application. This was fine but didn't scale very far. This new version manages processes itself, using foreman more as a library. Each process becomes it's own daemon, launched by dokuen boot, dokuen scale, or dokuen deploy.

Port Management

Dokuen will now manage your app's ports for you, so you don't have to worry about it. If you're not using a wildcard CNAME you'll need to put entries in your hosts file for each app.

Revised Mac Install

Because there's no need for a custom LaunchDaemon per app, just one global one that launches off all of the application daemons.

Linux Support

I've included a rudimentary ubuntu upstart script. All it does is run dokuen boot, just like on Mac. If you're not using ubuntu, feel free to write up an init script and submit a pull request on github.

Questions? Comments? Email me. I'm also loitering in #dokuen on freenode if you feel like chatting.

Dokuen Update

2012-05-20T15:50:09+00:00

After writing last blog post I started to build out another app using Dokuen and the pain really got to me. I've addressed the caveats that I listed at the bottom of that article and I think Dokuen is ready for tryouts. I wouldn't put anything mission critical on it, but that's not really what it's for anyway.

Please go try it out and let me know what you think. There are still some rough edges but it's entirely usable as is.

Dokuen, a Personal App Platform

2012-05-17T16:29:30+00:00

Dokuen (Japanese for "solo performance") is an amalgamation of open source components that I mashed together so I could run Heroku-style services on my shiny new Mac mini while retaining the paradigm of git push deployments and environment variables for configuration. Effectively, I wanted to be able to seamlessly deploy 12 factor applications in my local environment.

Update: I've rewritten Dokuen and released it as a gem. See this article for details.

Update 2: I've added linux support.

The whole idea started when I got a new mini and wanted to exploit it as much as possible. It's so low power and it's mostly just sitting around doing nothing, so it might as well run some interesting things. For example, I have a personal note-taking app that is currently running on Heroku but storing that kind of data on a 3rd party server kind of makes me nervous. I have another app that contains all of my finances that I wouldn't ever want to live on another server, but up til now there was no where to put it other than my laptop.

Heroku is super cool, though, and David Dollar has extracted a lot of very interesting things from the Cedar platform that I've been itching to try. Dokuen is thus a learning lark wrapped in a good excuse. The best combination.

Components

Dokuen breaks down into two piles, platform and application The platform consists of:

Gitolite is the core of the whole system. In this application I'm using it for it's simple repo creation and configuration, as well as the ability to stick an arbitrary git hook in every repo.

Mason and Foreman are the two Heroku projects that I'm using. Mason consumes an application clone and one or more buildpacks and produces an application instance that can be run using Foreman. envdir, from the daemontools package, manages environment variables. Nginx proxies from a CNAME subdomain to the actual running application.

Applications are actually launched in a slightly round-about way. The pre-receive hook generates a launchd plist file and drops it in /Library/LaunchDaemons and then unloads/reloads it. This config file just runs foreman with the configured environment and concurrency settings. Foreman has the capability to generate these types of configs, but it felt more natural to use it to run the code directly.

The applcation side of things consists of:

Buildpacks are a really neat concept for a platform. They consist of a trio of scripts: detect says if the buildpack applies to the application in question, compile builds a runnable application instance (compiles, installs gems, runs setup.py, whatever), and release returns metadata about the application that the platform needs. I'm exploiting the sweat and tears that the Heroku devs obviously poured into these buildpacks for my own selfish needs in Dokuen, and they fit very very well.

astrails-safe is a small backup system that knows how to talk to both S3 and PostgreSQL, putting dumps from the latter into the former. I've cron'd it to run nightly.

I'm using jgit because it's been blessed with the ability to push and pull git repos to S3, which means it's convenient to use for git backups. Every repo in my gitolite install gets a post-receive hook that provides a world-class backup for pennies a month.

PostgreSQL doesn't really need explanation. It's awesome. Use it.

Caveats

There are some obvious flaws in this thing I put together in a day (shocker) that I'll be working on rectifying. First of all, way too much information is in the git config. I thought this was very clever at the time but it turns out it's not very flexible at all. Second, the code is terrible. I think the approach is fundamentally sound, but it's one big monolithic script right now. Third, there are a lot of places where things are hard-coded for my setup, especially around the Nginx and LaunchDaemon configs.

Dokuen is extremely rough right now. If I were you I wouldn't try to use it directly. The scripts that tie everything together are on github if you want to take a look, though, and perhaps derive some inspiration.

A Robust Reporting System for Ledger

2012-01-01T10:01:14+00:00

Note: you can find much more information about ledger on ledger-cli.org, including links to official documentation and other implementations

For the last five years I've kept my personal finances in order using the ledger system, a sophisticated command line program that consumes a lightly formatted text file. It's helped me repay debts and get everything in order, helping me financially absorb an injury last month that would have been extremely detrimental just a few years prior.

The stock ledger program is exclusively command-line oriented. For quick checks and greping over output, this is fine. For some time, though, I've wanted a more graphical, more robust way of looking at my finances. I've also wanted a more familiar query language, since version 2.0's queries were someone limited and version 3.0's query syntax is not very well documented yet. Last year I wrote a simple system that pushed monthly reports out to static HTML files, which got me part of the way there but I really wanted something more flexible. Something where I can just write an arbitrary query and have the output dumped to HTML.

Thus, I present Ledger Web. In Ledger Web, your ledger is kept in a text file, just the same as always, reports are ERB files, and queries are SQL. Ledger Web watches your ledger file and whenever it changes dumps it into a PostgreSQL database table. It's also quite customizable, letting you set hooks both before and after row insertion and before and after ledger file load.

Installation

Ledger Web installation is pretty simple. First make sure you have PostgreSQL version 9.0 or greater installed on your machine. Then, run these commands:

$ gem install ledger_web
$ createdb ledger
$ ledger_web

Then, open your web browser to http://localhost:9090/ where you'll see some simple example reports.

Example Report

Let's walk through a simple pair of reports that shows off most of Ledger Web's features. Yesterday I ran across this blog post which draws a comparison between a typical person's budget and a wooden ship, always springing leaks and at risk of sinking to the bottom. I decided to write a report that shows my expenses both summed by year and broken out into individual lines. First, the Leaky Ship report itself:

<% @query = query({:pivot => "Year"}) do %>
select
    account as "Account",
    xtn_year as "Year",
    coalesce(sum(amount), 0) as "Amount"
from
    accounts_years
    left outer join (
        select
            xtn_year,
            account,
            amount
        from
            ledger
    ) x using (account, xtn_year)
where
    account ~ '(Income|Expenses)'
    and xtn_year <= date_trunc('year', cast(:to as date))
group by
    account,
    xtn_year
order by
    account,
    xtn_year
<% end %>
<div class="page-header">
  <h1>Leaky Ship</h1>
</div>
<%= table(@query, :links => {/\d{4}-\d{2}-\d{2}/ =>
    '/reports/register?account=:0&year=:title'}) %>

It starts off with a database query, defined using a helper named query. It uses a table named ledger, which is where your ledger data will be dumped, as well as a view named accounts_years, which is the cross product of every account by every year. This makes sure that rows show up properly even if there's no data for that particular year. Also, it uses :pivot => "Year", which will pivot the report such that each xtn_year will become it's own column.

The :to param in the where clause is automatically populated with the second date in the range at the top of all reports.

Next, it uses some basic Twitter Bootstrap HTML markup to display a nice title, and then uses the table helper to actually dump the query results to an HTML table. The :links option tells the table helper to link the values in any column who's title matches the regular expression /\d{4}-\d{2}-\d{2}/ to /reports/register?account=:0&year=:title, where :0 will get replaced with the value in column 0 (starting from the left, 0 indexed) and :title will be replaced by the title of the column.

Here's a screenshot of what this report looks like (Note: this uses the Stan example ledger that I generated for my previous reporting system):

The register report that Leaky Ship links to is pretty trivial in comparison. Here's the source:

<% expect ['account', 'year'] %>
<% @query = query do %>
   select
       xtn_date as "Date",
       account as "Account",
       note as "Payee",
       amount as "Amount"
   from
       ledger
   where
       xtn_year = :year
       and account = :account
   order by
       xtn_date
<% end %>
<div class="page-header">
  <h1>Register</h1>
</div>
<%= table @query %>

The only thing new that this does is use the expect helper to ensure that account and year are query params. If they are not, expect throws an exception rather than showing bad data. Here's what this one looks like:

Both of these reports, as well as a few others, can be found in my Ledger Web configuration. My config also shows off some of the more advanced customizations you can do.

The README goes into much more detail on how the helpers work and the various config settings work. Please, install it and let me know what you think!

If you're looking something that does some of this for you automatically, check out Personal Capital. I use it along side my ledger files to help me track investments across all of my accounts, as well as help plan my retirement.

Program Your Finances: Automated Transactions

2011-12-18T18:32:27+00:00

Note: you can find much more information about ledger on ledger-cli.org, including links to official documentation and other implementations

I've been using Ledger for almost five years now to keep track of my personal finances. For three of those years I've lived with a roommate of one form or another. Part of living with a roommate is splitting up bills. Some people decide to do this by dividing the bills up between roommates. For example, Pete pays the electric and gas bills and Andrew pays the water and the cable. Other roommates decide to nominate one person to have all of the bills in their name and post the amounts due every month for everyone to see. This is what my girlfriend and have been doing and it's been working great. All of the bills are in my name and I give her a summary every month and she hands me a check. Easy peasy.

Of course, being a complete and utter nerd means that I have to make this more complicated than it needs to be in the name of reducing the amount of work I have to do.

Automated Transactions

Ledger has an extremely handy feature named automated transactions. The basic idea is that you provide a template transaction and a pattern to match, and ledger will insert the filled-in template transaction every time the pattern matches. Here's an example:

= /Expenses:Utils:/
    $account                        -0.5
    Assets:Receivable                0.5

This instructs ledger to insert a transaction for 50% of the total transaction amount every time a transaction matches the given regexp (/Expenses:Utils:/). The template variable $account will be replaced with the matched account. So if we have this transaction:

2011/12/18 Electric Company
    Expenses:Utils:Electric          $50
    Assets:Checking

ledger will automatically insert this immediately following:

2011/12/18 Electric Company
    Expenses:Utils:Electric         $-25
    Assets:Receivable                $25

I use an automatic transaction identitcal to this one in my personal ledger file to split utilities with my girlfriend. From there I can run a simple report and copy and paste the results into an email once a month.

Virtual Transactions

I wanted to mention another advanced ledger feature that I use every day. For various reasons I keep most of my money in my interest-paying checking account. I have most of it allocated away into various "funds", which are just fake buckets that only exist for me. It's the same idea as ING subaccounts, but implemented in ledger instead of at the bank.

I've implemented these buckets using ledger's virtual transaction feature. Basically, if you surround an account name in square brackets, ledger treats that portion of the transaction as virtual. Ledger will include this transaction in all reports unless you include the --real flag in your report command. Here's an example:

$ ledger bal checking
               $1000  Assets:Checking

Then, we insert this transaction:

2011/12/01 * Establish Emergency Fund
    [Funds:Emergency]                    $500.00
    [Assets:Checking]

and run some more reports

 $ ledger bal checking funds
                 $500  Assets:Checking
                 $500  Funds:Emergency
 --------------------
                $1000

$ ledger --real bal checking funds
               $1000  Assets:Checking

By our powers combined...

On their own, these two features are pretty useful. It's when you combine them that the awesome power of ledger starts appearing. As some of you may remember, I has a bit of a medical emergency a few weeks ago and being a citizen of these great United States I have private insurance, so of course I'm going to be paying a not-inconsiderable sum out of pocket. How much? Only time will tell. I can't live like that though, I have to put some kind of structure to it or I'll go crazy. So, I looked up my out of pocket maximum and carved out a portion of my emergency fund into a new medical fund:

2011/12/18 * Establish Medical Fund
    [Funds:Medical]                    $4,000
    [Funds:Emergency]

I also added an automatic transaction that will withdraw from the medical fund whenever I record a medical expense:

= /^Expenses:Medical/
    [Funds:Medical]                      -1.0
    [Assets:Checking]                     1.0

Putting it all together, adding a transaction like this:

2011/12/18 * Corner Drug Store
    Expenses:Medical:OTC               $15.00
    Assets:Checking

Will result in these reports:

$ ledger reg funds:medical
11-Dec-01 Establish Medical Fund    [Funds:Medical]       $4000.00   $4000.00
11-Dec-18 Corner Drug Store         [Funds:Medical]        $-15.00   $3985.00

$ ledger reg checking
11-Nov-01 Checking Deposit          Assets:Checking      $10000.00  $10000.00
11-Dec-01 Establish Emergency Fund  [Assets:Checking]    $-5000.00   $5000.00
11-Dec-18 Corner Drug Store         Assets:Checking        $-15.00   $4985.00
                                    [Assets:Checking]       $15.00   $5000.00

$ ledger --real reg checking
11-Nov-01 Checking Deposit          Assets:Checking      $10000.00  $10000.00
11-Dec-18 Corner Drug Store         Assets:Checking        $-15.00   $9985.00

As you can see, the transaction for Corner Drug Store pulled $15 from Assets:Checking which was then automatically replaced from Funds:Medical. The virtual amount available in checking stays the same but the real amount goes down by $15 without any additional input. These two features combined let me spend directly from a virtual account while keeping track of everything for me.

If you go to the Ledger website you can find the manual which has been recently greatly expanded and enhanced. There you'll see that the expression for an automated transaction can be much more advanced if you want it to be. Check it out.

Yet Another (not very) Static Blog Generator

2011-12-14T18:30:26+00:00

The very first post on this blog was about how I wanted a completely static blog and how it'll be great and wonderful and boy howdy was it ever. Over 500 lines of rather dense perl plus almost 20 separate template files because the kind-of-mustache that I decided to implement can't handle inlined templates for loops so I have to do everything as partials.

Needless to say, it isn't very fun to work on. It mostly does what I want but adding new things is pretty painful, as is changing any of the templates. Yesterday I decided that I would see what a Sinatra port would look like. Why Sinatra? It's fun, that's why. Ruby and Sinatra make writing new webapps easy and fun.

Details

The new version is called bugsplat.rb. It's 200 lines of ruby, which is actually more than I wanted but there a lot of functionality packed in there. Here's the complete feature set:

Entries are written in Markdown and checked into the app repo
Entries have a MIME-style header
Entries can have a --fold-- marker that specifies which content should be on the index page
Supports blog posts and static entries that can optionally be linked from the side navigation
Reads all entries into memory at startup
Uses ERB for templates
Caches rendered pages in memory

The production site is hosted on Heroku and uses a unicorn extremely similar to FivePad's setup without the background worker stuff.

Why not some other blog engine?

A while ago I tried porting to Jekyll but without heavy modification I wouldn't have been able to keep the URLs I've built up over the last year and a half. Also for some reason I couldn't wrap my head around liquid templates.

Wordpress or some other dynamic CMS? I could have done that, sure, but that would introduce other dependencies and I really like the emacs-centric workflow of writing markdown files and generating a site. A web-based CMS would have let me write from anywhere but then I'd have to write in the browser, which isn't my idea of fun.

Results

It's not a static site anymore but I think with the caching I have set up it's almost as fast as one. I could even transparently make it one by pre-rendering everything and stashing it in public using a rake task named assets:precompile that Heroku conveniently runs if it exists.

I don't think I'll do that, though. I like the flexibility that this setup gives me.

Re-using

I wouldn't recommend it. There's nothing that precludes someone else forking bugsplat.rb on github, deleting the entries, rewriting the templates, and running their own site, of course, but it wouldn't be trivial. If you actually want to use it on your own site, email me or leave a comment below and we'll work something out.

Another Tiny Webapp

2011-11-30T11:46:50+00:00

Literally ten minutes after hitting the publish button on my last post I took a little tumble and broke a rather important bone in my back, and now I'm on medical leave from work for awhile.

That doesn't stop me from doing fun things, though, so this morning I cooked up a tiny webapp using Sinatra, DataMapper, and Bootstrap that will help me keep track of when I take painkillers. It's called Painkiller Jane after the comic book character.

There's not much interesting going on here to be honest. Basically it's just one database table and an in-app configuration hash that lays out what pills are available, their dosage, and their cooldown period. I can click the buttons when the cooldown is over, but when it's not they tell me what time I can take the next dose.

The only other feature that I might add is a lockout so that it helps me manage which pill to take when because I'm alternating tylenol and advil.

Concurrency on Heroku Cedar

2011-11-27T18:52:36+00:00

I started a small product a few weeks ago called FivePad, a simple easy way to organize your apartment search. It's basically the big apartment search spreadsheet that you and me and everyone we know has made at least three times, except FivePad is way smarter.

The initial versions of FivePad did everything in the web request cycle, including sending email and pulling down web pages. The other day I was about to add my third in-cycle process when I threw up my arms in disgust. The time had come to integrate resque, a great little redis based job queueing system. Except if I ran it the way Heroku makes things easy my costs would get a little bit out of control for a project that isn't making much money yet.

Backstory

First, a little backstory. Earlier in 2011 Heroku announced their new cedar stack, which is a much more general platform for running webapps than their previous platforms. Cedar lets you describe the processes you want to run using a Procfile. Your processes can use one of a large selection of languages, but FivePad is all ruby. Here's what a Procfile can look like:

web: bundle exec rails server
worker: bundle exec rake resque:work QUEUE=*
scheduler: bundle exec ruby ./config/clock.rb

This creates three different types of processes, web, worker, and scheduler. Heroku intends that you run one of each of these on three different dynos which all charge by the hour, but you get 750 hours free every month.

Options

The official way to do this, of course, is to just spin up multiple dynos. Heroku makes this extremely easy:

$ heroku scale web=2 worker=3

Bam. Done. Two web dynos and three worker dynos all running your code and talking to the same database. Gets to be a bit expensive for hobby/tiny projects, though.

Another option is to run multiple Heroku apps with a shared database. This is decent, in that you get multiple full-powered dynos for free. However, it's kind of a pain to manage. You have to deploy to multiple git repos whenever you do a deploy and you have to make sure all of your environment variables are synced. I've tried it, it's not very much fun.

Yet another option is to auto-scale your workers. The basic idea here is that when a job comes in, the resque client triggers a worker to start up in another dyno, process the job, and then shut down when there are no more jobs left. I tried using this for a long time with Remindlyo and experienced an annoying set of race conditions due to how long rails takes to start. Jobs would get lost and dropped kind of randomly.

Cheating With Style

The system that I've devised for FivePad was inspired by this post by Michael van Rooijen. There, he describes how best to run your rails app on Unicorn, a simple pre-forking rack server. Here's my Procfile:

web: unicorn -p $PORT -c ./config/unicorn.rb

Not much to it. Here's what that config/unicorn.rb file looks like:

worker_processes 3
timeout 30

@resque_pid = nil

before_fork do |server, worker|
  @resque_pid ||= spawn("bundle exec rake " + \
  "resque:work QUEUES=scrape,geocode,distance,mailer")
end

This starts out pretty simply. Three worker processes and a 30 second request timeout. But then there's that before_fork hook. This simply runs a specified rake task if and only if it hasn't been run before, immediately prior to forking off the next web worker. In this case, it runs the resque:work task, which is how resque processes jobs.

This will actually result in six processes in each web dyno:

1 unicorn master
3 unicorn web workers
1 resque worker
1 resque child worker when it actually is processing a job

This may be a bit much if your application is super heavy, but for FivePad it's working pretty well. Things are much faster now that all of the heavy duty stuff is done in the background, and scaling up to another dyno automatically scales the workers as well. One thing to consider in the future is to drop the web workers down to two and add another dyno, but I'm not going to do that until it actually has significant revenue coming in.

Another drawback of this is that if the worker falls over for some reason I'll have to restart the whole dyno, but the chances of that happening are pretty low. Resque forks off a child worker for every job it processes, which insulates the master worker from any problems with jobs.

Anyway, for now this is how FivePad is running. Scaling up is will be simple in the future when it's necessary and I can control costs right now when that's really important.

Introducing FivePad

2011-11-17T17:17:12+00:00

For over two years I've been ruminating on an idea for a webapp that would help people coordinate apartment searches amongst roommates. Finding a place is pretty tough, and finding the right place for you and one or more other people is even harder. Emails, phone calls, spreadsheets, links, bookmarks. It's a mess. So, I built FivePad. More details after the fold.

You can add places by simply pasting in a Craigslist URL or typing in the street address, and FivePad takes care of building a page that shows you where it is in the city and the street view of the address. You can add notes to your places, too. If you're searching with roommates you can add invite them and they'll get emails whenever you add stuff, and you'll get emails when they add stuff.

If the place you're looking at has an email address, you can send a note to that address with one click. When they respond their email will get attached to the right place.

The basic product is there but I have a list of features a mile long that I want to add. Expect more postings here as I add stuff :)

It's free to try, so give it a run and let me know what you think.

What sucks about apartment hunting?

2011-10-19T06:31:23+00:00

I want to brainstorm everything that sucks about searching for an apartment. Why is it always so painful? Does it have to suck so much? I have some ideas but I haven't done this in almost a year so the pain, it is not so fresh.

Here's what I've got so far:

Keeping track of everything. Losing phone numbers, did I call that one back, wait is that a duplicate or another unit in the same building?
The uncertainty. Will this landlord suck? How can I find out? Is the neighborhood ok?
Finding some place close. What's the commute time? Where is the nearest grocery store?
Coordinating with roommates or significant others. Will they like it? I guess I could email them...

In the past I've had a few solutions to this, as I'm sure everyone else has too. The first time I searched for a place I didn't really keep track of anything except in my email inbox. It was kind of a disaster. By the time I picked one place I had already lost the others, but didn't find out until after I called them back. The next time was a little better but still I think my roommate and I mostly communicated via email. The third time I built a tiny Rails site for my girlfriend and I to keep track of stuff, but I never really fleshed it out and we kind of just sat next to eachother on the couch and shared craigslist links over IM.

What I'm looking for here is everything else that sucks about searching for a place and why, and what kind of solutions you've duct taped together to make it work. Websites that make any aspect of it easier would also be super helpful. Everybody who responds gets $2.00 of Remindlyo credit :)

Remindlyo is a Go

2011-10-08T17:24:56+00:00

Eight weeks ago I embarked, almost by accident, on one of the most interesting challenges that I've ever set up for myself. I've created something new, something that I don't think the world has had before, and that makes me feel good. So, if you're reading this, go check out Remindlyo, because it's as done as it's going to get for now.

Remindlyo, at it's core, is about procrastination and forgetfulness. Do you forget to call someone important? Do you "forget" because you can't bring yourself to actually dial the phone? Remindlyo helps out with that, since the friction of hitting a single button on a keypad is so much lower than that of dialing the phone in the first place. You've already answered the phone, might as well keep the ball rolling.

For those of you who care about such things, here's the tech stack:

I'm doing this on my own without much feedback, so if you have any please share it. I would really appreciate anything you have to say.

Join the discussion on Hacker News.

Random Remindlyo Things

2011-09-25T15:49:33+00:00

I think everyone is due for an update on my remindlyo progress, but I don't really have an organized post. Here's a bunch of random thoughts instead.

WePay is awesome

Instead of the tried and true but well-known-to-be-horrible PayPal, I've decided to go with WePay for my payments integration. I made this decision last weekend and today I finally hit the "Submit for Approval" button on remindlyo's application page. The staging server made it trivial to write my integration layer, except that I found a bug when trying to test some things. What makes WePay so awesome is the fact that I was able to email some random person at WePay support and they put their team on the problem right away. Gives me great confidence on them having my back when actual production payment-related things go wrong.

GitHub and Heroku are great too

I've been using GitHub for a number of years now but this is the first project where I've extensively used a private repository. Seamless experience all around.

I also have no complains about Heroku. Being able to get this thing up and running and scaleable with basically no efford has be a lifesaver.

Rails? Eh. It's ok.

I've been building remindlyo in rails at the same time I've been working on another side project (Twitter Fiction Reader, go check it out) in sinatra, and while they're both ruby web development frameworks, they're worlds apart. Rails is heavy, opinionated, all-the-batteries-you-will-ever-need-included framework, while sinatra is svelte and agile and fun. The one saving grace of Rails is that there's an awful lot of people out there struggling with it too, so it's easy to find different ways of going about things.

Launching

I'm pretty sure I'm close to launching this sucker. There are only a few things things holding me back right now. WePay, but I'm assuming the people that approve applications don't work on Sunday so hopefully they'll get back to me tomorrow. The only other thing is the marketing site, which needs some love. I might just go buy a template from Theme Forest that includes the bits I need, because making attractive web pages is definitely not something I'm good at.

So, maybe next week? We'll see.

Quadrotor Update 3

2011-09-11T21:06:12+00:00

Quick update on the quadrotor. I went through all of the motor wiring and found that the motors were hooked up to the wrong terminals! [This diagram][multiwii-diagram] plus watching the orientation of the machine on the configuration software helped me figure out where I had gone wrong. After that, I calibrated the throttle range on all of the speed controllers and then tried flying again.

Except, it still doesn't really work. I played around with it for quite awhile but one of the speed controllers is still basically either on or off. No proportional control at all. Also, I relocated the gyro and accelerometer board to dead center of the body, which helped a little.

And then, when I was holding it and playing with the throttle and trim on the transmitter, one of the motors stalled out somehow. I'm wondering if maybe the battery is low again. Going to throw it on the charger for awhile tomorrow and try again.

Quadrotor Update Part 2

2011-09-09T19:39:35+00:00

The new propellers came today. I ordered a pair of sets of 10x4.5 propellers from Amazon and promptly failed to get them to work with the motors. Turns out they're hard to mount! I ended up making a trip to the hardware store to pick up some O-rings, which are much more stable than the little rubber bands the motors came with. Here's what it looks like:

And here's the mess of wires close up:

Awesome, right? Right! I hooked up the battery, turned on the transmitter, and gave it a shot.

Aaaaaaaaand....

It promptly flipped over.

After a few more or less comical runs of this I read up a little bit, and it turns out sometimes this happens because they battery is low. Put it on the charger for awhile and then tried again with no luck. So then I put the small props back on and held the thing in my hand while reving it up, and it turns out one of the motors is way more powerful than the others. I'm not sure why, but I think maybe one of the speed controllers isn't configured right. Now to figure out how to calibrate them without taking everything apart.

In any case, I'm pretty happy about having it together and making some progress. Maybe by this time next week I'll actually have a flying machine!

Quadrotor Update

2011-09-05T11:34:33+00:00

After too many days of everything remindlyo I decided that today would be the day I finally got off my ass and finished putting together my quadrotor. There were a bunch of minor dramas that made me drop it for much too long, all detailed below the fold.

Drama number one was trying to get my Wii Motion Plus knockoff working. It never did, and in fact I'm pretty sure I completely fried it somehow. The software stopped reading the sensor, and some test software I wrote made it apparent that the sensor, even if it was responding to I2C protocol, was only responding with zeroes.

So, that sucked. I eventually pulled myself out of that deep black hole and ordered a replacements sensor from SparkFun. It was pretty expensive but it includes a three axis gyro and a three axis accelerometer on one tiny little board. Ordered, shipped, received. As I was soldering pins onto it I noticed a little 'VCC 3.3v' mark on it. Uh oh. My Arduino is 5v! Back into the deep black hole!

Literally months later I circled back and ordered this little part, a 5v to 3.3v logic level converter. Oh, and a 5v to 3.3v regulator. Oh actually two of each, in case I blew something up again. Took me awhile to get around to actually putting it together, which brings us to today. Soldered up the level converter, hooked everything up, put the propellers on, aaaaaaaaand....

It's too heavy.

The propellers flew off the motors before it moved even an inch above the floor.

Crap.

Anybody have suggestions for better propellers, or how to calculate what kind of propellers I need? As is, the whole thing is 445 grams incluing the battery.

Bootstrapping a Side Business - First Steps

2011-09-04T08:38:05+00:00

For the past few weeks I've been working on a little product that I'm calling remindlyo, which I'm hoping to turn into a secondary income stream. The basic idea is that you put events about the important people in your life, like birthdays, anniversaries, or what have you, into remindlyo. On the day of the event, remindlyo calls you to remind you and connects you to them, all on the same phone call. You can read more about it on the main remindlyo site. In this post I want to talk more about the why instead of the what.

Since I first figured out what a business does, I've wanted to get in on it. I think I was 12 when I first said to my mom "I want to start a business. I don't want a regular job." Of course, like any responsible mother, she replied "How about you go to college and get a few years experience before starting?" Not exactly satisfactory, but it was sound advice. Now, 15 years later, I'm in a position where I can actually follow through.

I've been aware of Patrick McKenzie for a few years now, but I recently came across his Greatest Hits list and over the course of maybe two days I devoured every single post. They're all great. I had already had an idea for a little project like remindlyo, but Patrick's posts inspired me to expand it into something that I could actually sell.

So then the problem was, what to sell? What can I build that people will buy? I've had a few ideas in the past, but then a few weeks ago my girlfriend kind of dropped the idea for remindlyo in my lap one day. When I told another developer at Emma about it he suggested a whole suite of improvements, which pretty much leads directly to the marketing site I have up and running today.

I can hear you asking "but what if people don't buy it?". That's the biggest fear, but also the most irrelevant, at least for me. I'm doing this first and foremost to learn how a business works from the inside. A very close second is the chance a bunch of technologies that I haven't played with until now, including Ruby on Rails, Twilio, and Heroku, as well as how web marketing and the good kind of SEO works. The opportunity to gain an additional income stream is a somewhat distant third.

Program Your Finances: Vacation Tracking

2011-08-04T16:47:48+00:00

Note: you can find much more information about ledger on ledger-cli.org, including links to official documentation and other implementations

Recently my girlfriend and I visited the wonderful city of Vancouver, Canada. While out of country we tend to use my Schwab Investor Checking account because it carries no fees whatsoever, including currency conversions, and it refunds all ATM fees. Last year when we went to Ireland we just kept all of the receipts and figured it out when we got back, which was excrutiatingly painful. Lost receipts, invisible cash transactions, ugh. It hurts to even think about it. This year, I decided to cobble together a simple system so we could track on the fly. Read on to see how it came together.

This system has the following moving parts:

Dropbox
Ledger
Nebulous Notes, a text editor for iOS that syncs with Dropbox
an always-online machine hooked up to Dropbox and capable of running a python script every once in a while

The workflow is pretty simple. Whenever we spent some money, I recorded it in a very simplified manner at the bottom of a ledger file that lives in Dropbox. Here's a few examples:

2011/07/23 Cash
    Cash  160.00 CAD
    ATM Fees  2.50 CAD
    Checking

2011/07/23 SkyTrain tickets
    Transit  5.00 CAD
    Cash

2011/07/24 Acme Cafe
    Food:Breakfast  29.40 CAD
    Checking

Note that, while simple, these are all valid ledger entries. They just have a different account structure than what my main ledger uses. I used a different file and an abbreviated account structure for a few reasons. First, using shorter account names means I didn't have to type as much on the iPhone screen. Second, this was an experiment, so having a separate file means I didn't have to worry about corrupting my main ledger. Third, loading up my main ledger on the phone slowed Nebulous Notes to a crawl, which wouldn't have been fun dealing with on the move.

Nebulous Notes also has a great macro system, so I was able to program a few interesting templates. "Go to the bottom of the file", Food, and Transit were all one button.

So, now that we have transactions going in and being synced, let's get a little fancy. This was a frugal vacation, so I wanted to see totals by account while we were out and about. Here's that python script:

#!/usr/bin/env python

import sys
import re

amount_by_account = {}

ledger_file = sys.argv[1]
summary_file = sys.argv[2]

with open(ledger_file) as f:
    for line in f.readlines():
        if len(line.strip()) == 0:
            continue

        match = re.match("\s+([\w:]+) \s+([0-9.]+) CAD", line)
        if match is not None:
            account = match.group(1)
            amount = match.group(2)

            prev_amount = amount_by_account.get(account, 0.0)

            amount_by_account[account] = prev_amount + \
                float(amount)


with open(summary_file, "w+") as f:
    total = 0
    for account in sorted(amount_by_account.keys()):
        total += amount_by_account[account]
        f.write("{0:<20}{1:>10}\n".format(account, "%.2f" %
            (amount_by_account[account])))

    f.write("------------------------------\n")
    f.write("{0:<20}{1:>10}\n".format("Total", "%.2f" % total))

Why python and not ledger itself? The machine that I had available to run this thing is a PowerPC Mac mini, which I've never been able to get ledger running properly on. So, python it is! Basically, it looks for lines in ledger_file matching the pattern of a ledger posting, totals up the amounts, and prints them out in alphabetical order to summary_file, which also lives on Dropbox. I had this on a one-minute cron schedule, so whenever I wanted to see our totals (and was in wifi range) I could just open summary_file in Nebulous and sync it up.

I could have used one of any number of iOS expense tracking apps, and that might have been the smarter way to go, but what fun would that be? Also, this system automatically backs itself up whenever Nebulous connects to Dropbox AND it lets me do another fun thing: import (almost) directly into my main ledger.

I say "almost" both because of the simplified account structure and because it's in Canadian dollars and my ledger is exclusively US dollars. Problematic! The easiest way to fix this was by applying the power of perl:

#!/usr/bin/perl

use strict;
use warnings;

my %rates = (
    '2011/07/23' => 1.0531,
    '2011/07/24' => 1.0531,
    '2011/07/25' => 1.0595,
    '2011/07/26' => 1.0596,
    '2011/07/27' => 1.0619,
    '2011/07/28' => 1.0615,
    '2011/07/29' => 1.0615,
);

my $current_date = undef;

while (<>) {
    if (/(\d{4}\/\d{2}\/\d{2})/) {
        $current_date = $1;
    }

    if (/(\s+)([\w\d: ]+)  (\d+(\.\d+)?)/) {
        $_ = sprintf(
            "%sExpenses:%s  \$%.2f\n",
            $1,
            $2,
            ($3 * $rates{$current_date})
        );
    }

    s/    Cash/    Expenses:Cash/;

    s/Checking/Assets:Schwab:Checking/;
    print $_
}

Oh perl. So useful. So ugly. For every line, if it looks like a date, save off the date. If it looks like a transaction line, do the currency conversion (rates are calculated by picking a sample transaction from the bank and backing into it). If it looks like a Cash or Checking account, replace it with the long form of the account name. Finally, print it back out. This got me most of the way there, but I had to tweak some of the amounts due to posting lag at the bank and using a different day's conversion rate. I also piped it through ledger to get the formatting cleaned up:

$ convert-vancouver-ledger.pl vancouver_ledger.txt | \
  ledger -f - print > final_ledger_file.txt

A few quick adjustments and sanity checks later and I was able to copy and paste into my main ledger. I also added some metadata so I could get a report for my girlfriend so she could see what we spent and write me a check for half. Frugal, remember? The final transactions ended up looking like this:

2011/07/23 * Cash
    ; :vancouver:
    Expenses:Cash                            $168.51
    Expenses:ATM Fees                          $2.63
    Assets:Schwab:Checking

2011/07/23 * SkyTrain tickets
    ; :vancouver:
    Expenses:Transit                           $5.27
    Expenses:Cash

2011/07/24 * Acme Cafe
    ; :vancouver:
    Expenses:Food:Breakfast                   $31.15
    Assets:Schwab:Checking

I think it turned out pretty well. I spent less than an hour the day after we got home getting everything cleaned up and copied into my main ledger. When we next go out of the country, I'm sure I'll use this same system. I'll probably put the summarizer script in Dropbox, though, so I can tweak it if necessary.

Program your Finances: Reporting for Fun and Profit

2011-07-09T08:14:55+00:00

Note: you can find much more information about ledger on ledger-cli.org, including links to official documentation and other implementations

Another note: I've written a new version of this that is much more dynamic and flexible named Ledger Web.

Last year I wrote what ended up being the most popular article on this blog ever, Program Your Finances: Command-line Accounting. That post went over how I track and report on my finances using a program called Ledger along with a few helper scripts. Recently I expanded that toolset quite a bit and wanted to show how keeping meticulous track of your finances can give you superpowers. Read on for the gory details.

Stan the Example Man

Talking about personal finances is kind of a tricky thing. If you want to give anything more than a cursory treatment of the subject you have to have some data but the closest source of data to hand is always your own. Some people have decided to talk publicly about their data but I'm not quite ready to to that. Instead, I've written a little python tool to generate a plausible but random history when given a simple json config file. Here's a super simple example:

[
    {
        "payee": "Kettleman Bagels",
        "dow": 3,
        "postings": [
            ["Expenses:Food:Breakfast", [7.20, 7.80]],
            ["Assets:Checking"]
        ]
    }
]

This says, "Every day on Thursday, buy breakfast at Kettleman Bagel Company. It should cost between $7.20 and $7.80." The dow key is the day number, where 0 is Monday and 6 is Sunday. The postings array gives a list of Ledger postings that should be inserted for this entry. The first element is the account name, the second is one of: a single float representing the amount in dollars; empty, meaning that this entry should be the balance of all the other entries; or an array of arguments to pass to Python's random.triangular function. There are a bunch more options that I won't get into here but you can see in the github repo.

Using generate.py and this config, I've created a ledger file for a gentleman who we'll call Stan. Why "Stan"? Because he's the man, that's why. Stan is an unattached twenty-something software developer living in Portland, Oregon. He has a car, a moderately sized student loan, and a pretty decent apartment in a so-so area of town. He's been tracking his expenses for almost four years using Ledger, and he's pretty good at it now. (For the curious, Stan is loosely based on me. Simplified in places, exaggerated in others, cheerfully optimistic in salary.)

Reporting? What's that mean?

Collecting all of this data wouldn't be worth a whole lot if I couldn't analyze it in various ways. Ledger lets me look at things lots of really interesting ways, but sometimes it's a little bit too low level. Too nitty gritty. Too miss-forest-for-the-trees. Sometimes I want to step back and get a bigger view of where my financial life has been, and were I can expect it to lead, and maybe where I should make some changes. When a business wants to do this, they create a series of financial reports. Lots of businesses are compelled to do this by the SEC because they're public corporations, but every well-run business will create these reports regularly to help them keep on track.

Well, I'm kind of a business, right? I do work and receive money as the result of that work. I have short and long term debt, investments, equitiy, assets, etc etc. My sole motivation isn't profit, of course, but in a lot of other respects I try to run my finances as if they were a business. To that end, I've made a series of tools that produce a suite of reports that are fairly similar to what a business would want. From top to bottom we have:

Balance sheet A snapshot of important accounts and a general idea of "net worth" over time.
Net worth chart A monthly overview of the "Total" line from the balance sheet for all available months.
Income Statement A monthly breakdown of income, expenses, and liability payments.
Burn Rate Given Stan spends the "Burn" column on average every month for the trailing 12 months and assuming he'll spend about that same amount going forward, his savings will last him "Months" months. He'll run out of money sometime in February 2012.

But Ledger is a command line program!

Ledger is a command-line program, that's true. I couldn't go directly from my ledger file to pretty html reports with charts and tables, so I invoked two of my favorite chainsaws to hack this out: PostgreSQL and python. PostgreSQL is a wonderfully powerful database that happens to be open source and community driven, and also very easy to use. Python is, well, it wouldn't have been my first choice until pretty recently, but now that I've started using it perl has kind of dropped off my radar. It's pretty great.

Here's the outline of how this thing works: 1. Start maintaining a ledger file 1. Create a PostgreSQL database with the ledger schema

Export the ledger to csv using ledger csv and load it into PostgreSQL using load_ledger.sh
Run some sort-of complicated queries and dump them into HTML tables using run_reports.py
Style the html tables using jquery.datatables and build a chart using jqplot

When I started this I knew I wanted a sql database. I chose PostgreSQL in particular over sqlite both out of familiarity, but also because it handles dates so well. Date is a top-level data type in postgres, instead of having to do weird things with strings like in sqlite.

Why a SQL database instead of just futzing with stuff in python data structures? Because in SQL I can express a rotated dataset pretty easily, whereas in python it would have been a lot of code. See run_reports.py for examples of this. Also, it lets me index the hell out of the tables, build summary tables with weird conditions, and still be able to do neat queries.

Neat Queries You Say?

Honestly, with a lot of work these reports could have been expressed using straight ledger without involving the database at all. It would have been nastier and terser and kind of weird, but I could have done it.

Here's a query that ledger would not have been able to do as far as I know, however:

select
    xtn_month,
    sum(case when pay_period = 1 then amount else 0 end) as pp1,
    sum(case when pay_period = 2 then amount else 0 end) as pp2
from
    aggregated_accounts
where
    account ~ 'Expenses'
    and account !~ 'Taxes'
    and account !~ 'Interest'
group by
    xtn_month
where
    xtn_month >= '2011-01-01'
order by
    xtn_month;

 xtn_month  |   pp1   |   pp2   
------------+---------+---------
 2011-01-01 |  418.79 | 1249.39
 2011-02-01 |  477.18 | 1146.11
 2011-03-01 |  432.92 | 1316.65
 2011-04-01 |  439.95 | 1274.56
 2011-05-01 |  385.60 | 1417.73
 2011-06-01 |  547.77 | 1193.86
 2011-07-01 |  189.75 |       0

Being able to group by completely arbitrary things in ledger has been a pain point for me since I started using it. In this case, I'm grouping by pay_period, a column that has this definition in aggregated_accounts:

CASE
    WHEN (
        xtn_date >= '2010-12-05'
        and extract('day' from xtn_date) between 1 and 14
     ) THEN 1
    WHEN (
        xtn_date < '2010-12-05'
        and (
            extract('day' from xtn_date) between 1 and 6
            or extract('day' from xtn_date) between 22 and 31
        )
    ) THEN 1
    ELSE 2
END as pay_period

The "Burn" calculations are another example. Before I had the data in postgres I had an extremely messy shell script that invoked ledger, date, and dc to calculate it, and if anything broke it all fell down with a weird error.

Drawbacks

The only drawback right now is that the postgres/python setup can't handle differing commodities very well. I track my investment accounts in ledger right along side my transactional accounts and ledger has the ability to go download price quotes for the holdings in those accounts whenever you want and do various calculations on them, but the way I'm doing the CSV export right now doesn't do any of that.

Conclusion

With this setup, I'm able to keep my financial data in a simple, easy to use format and retain the ability to do quick checks on it using ledger. In addition, I can do compilcated queries that would get extremely nasty in straight ledger. It's really the best of both worlds. I've put the tools on GitHub if you want to check them out and maybe install them and try them out.

Quadrotor Motors Are Alive!

2011-04-24T14:48:36+00:00

I found some time today to work on my quadrotor project some more. A few weeks ago I got one motor mounted and spinning, just using the RC receiver and trasmitter. Today, I mounted the motors and set up a little test program on the arduino to make them spin. Check it out:

Test program and more info after the fold.

Among other things, this entailed soldering the speed controllers to the motors, mounting the motors, putting more headers on the arduino, and figuring out the basic wiring. Here's the test program:

#include <Servo.h>

#define ARMING_SPEED 900
#define ZERO_SPEED 1300
#define MAX_SPEED 1850

Servo front;
Servo right;
Servo back;
Servo left;

void write_speed(int in_speed) {
  front.writeMicroseconds(in_speed);
  right.writeMicroseconds(in_speed);
  back.writeMicroseconds(in_speed);
  left.writeMicroseconds(in_speed);
}

void setup() {
  front.attach(10);
  right.attach(11);
  back.attach(12);
  left.attach(13);
  write_speed(ARMING_SPEED);
  delay(10000);
}

int speed = 0;
void loop() {
  for(speed = ZERO_SPEED; speed < MAX_SPEED; speed++) {
    write_speed(speed);
    delay(20);
  }
  for(speed = MAX_SPEED; speed > ZERO_SPEED; speed--) {
    write_speed(speed);
    delay(20);
  }
}

All this does is setup four instances of the built-in Servo object on four different pins. After attaching the servo objects to pins, it sets them all to ARMING_SPEED, which is really just a speed that the speed controllers recognize as the throttle being completely off. Then, it waits for 10 seconds and then starts sweeping from ZERO_SPEED (idle but running) to MAX_SPEED (could be up to 2000 but the propellers have a tendency to fall off at that speed).

One note about these speeds. The way an RC receiver controls a servo is via PWM, "pulse width modification". The receiver sends out a train of pulses, each 2000 microseconds apart, to the servo. A width of 1000 indicates "full left", a width of 2000 indicates "full right", and 1500 "centered". A speed controller uses the same protocol, except it can't reverse direction and the range is a little bigger. 1300 is about idle, 2000 is full power, 900 is "safe".

The next step is to get the frame together the rest of the way and mounting the electronics. Oh, and adding in the Wii Motion Plus and Nunchuck boards to get the six axis IMU running. That's for another day, though.

ProcLaunch v1.2

2011-04-15T12:58:59+00:00

Just a few bug fixes this time:

When you send proclaunch SIGHUP, it will send all of the profiles their respective stop signals and then wait for them to shut down. You can tell proclaunch to stop without waiting by sending SIGHUP again.
You can pass the --log-path command line option to change where proclaunch writes it's log. By default this is $profile_dir/error.log

Get it from github! These changes were generously sponsored by Zipline Games, who are using proclaunch to launch lua mongrel2 handlers as part of their Moai Cloud platform.

ProcLaunch Improvements and v1.1

2011-03-04T16:31:23+00:00

ProcLaunch has learned a bunch of new things lately. I've fixed a few bugs and implemented a few new features, including:

A --log-level option, so you can set a level other than DEBUG
Kill profiles that don't exist
Instead of killing the process and restarting, proclaunch can send it a signal using the reload file
Instead of always sending SIGTERM, the stop_signal file can contain the name of a signal to send when proclaunch wants to stop a profile
Pid files are properly cleaned up after processes that don't do it themselves
You won't get two copies of proclaunch if one is already running as root

Get version 1.1 from github! Thanks a bunch to Matt, who hunted down the bugs and helped me figure out the features.

(Also, I added highlight.js syntax highlighting. Hope you like it!)

I Soldered Something!

2011-02-15T17:57:31+00:00

The Arduino is a cool little development board, actually a series of them, that make it a snap to get up and running with embedded development. I've wanted to get my hands on one for awhile but I haven't really had an application. That is, I didn't until I saw this:

And then I did some research and found this:

This is an r/c quadcopter, a four bladed helicopter that uses an Arduino running the MultiWii software wired up to some knock-off Wii sensors to stabilize itself. The concept is similar to an F-22 or F-117, in that the thing is completely unstable and would probably fall out of the sky without computer control. This is then connected to a four channel r/c receiver, controlled by a normal r/c transmitter.

So.

Cool.

Of course I immediately started scheming to get one of these up and running. The biggest problem was that I had no tools, nor an Arduino, nor any r/c equipment, nor materials to put this thing together. Naturally, I turned to SparkFun to set me up. They're a great online store that has all kinds of useful bits, including most everything electronics-wise that I'll need to get this project off the ground.

The box arrived today!

And here's what was inside it:

(click the picture to see Flickr notes)

Basically I needed everything, so I got a cheap but solid soldering iron, solder, wires, headers, a brass "sponge" for cleaning the iron, and of course a pair of Arduino Pro Mini 16MHz 5v as well as the appropriate programming cable.

After unboxing I installed the Arduino software, the USB driver, and tried getting the simple blink example to install on one of the Arduinos without soldering on some headers. Wouldn't program very reliably, so I broke down and actually heated up the iron and melted some stuff. Here's the result:

Most of the joints are fine, but TX0 didn't get much solder through the hole so I'm going to have to watch it. In any case, it works. I got the blink example working and then wrote up a stupid little S-O-S blinking program and installed it. Pretty lame in the grand scheme of things, but I got it all running, including the soldering, in less than an hour. Extremely gratifying and a great start.

Now to buy sensors, motors, propellers, r/c equipment, and batteries, build the frame, etc etc. I'm thinking strongly about refactoring and rewriting large chunks of MultiWii, but that will come after I get it flying with the stock code.

ProcLaunch v1.0

2010-09-23T19:39:47+00:00

I kind of started ProcLaunch as a lark. Can I actually do better than the existing user space process managers? It turns out that at least a few people think so. I've gotten a ton of great feedback from thijsterlouw, who actually filed bug reports and helped me work through a bunch of issues. ProcLaunch even has some tests now!

As of today, I'm releasing ProcLaunch v1.0, which you can download from the github downloads page. Interesting changes from the initial version:

Moved to an explicit state machine

In the first version there were a lot of edge cases where proclaunch would have a seemingly random sleep, or some other weird thing. I've removed all of the edge cases by creating an explicit state machine. Profiles have a _status() attribute, which is always one of stopped, starting, running, or stopping. The only sleep() is at the end of the main loop.

The main motivation for this change is because the old version was just plain bad design. Every iteration of the main loop woule create a whole new set of Profile objects, overwriting the old list. Awp, but what happens to profiles that should stop? Let's keep track of their pids and keep trying to kill them over and over until they finally die. But what happens if proclaunch dies before those pids die? Do they just live forever, the eternal zombies of a daemon gone wrong?

The new design eliminates both the repeated kill and the overwriting. Now, profiles are kept in a hash keyed on name and are never replaced after creation. Profiles that get stopped are put in the stopping state, which will check up on the pid every second until it finally dies, then moved to stopped, ready to be restarted.
Improved logging

Log lines have a static format: <Timestamp> <Log Level> <Tag> <Message>. <Tag> is either ProcLaunch or the name of the profile. If a message mentiones a pid, it will always be stated as pid <PID>. This change should make it easier to grep through the logs and automatically parse them for monitoring through nagios or what-have-you.

Please check it out and beat it up. If you notice any issues, don't hesitate to submit an issue or email me.

Perl with a Lisp

2010-08-22T14:22:36+00:00

Browsing around on hacker news one day, I came across a link to a paper entitled "A micro-manual for Lisp - Not the whole truth" by John McCarthy, the self-styled discoverer of Lisp. One commentor stated that they have been using this paper for awhile as a code kata, implementing it several times, each in a different language, in order to better learn that language. The other day I was pretty bored and decided that maybe doing that too would be a good way to learn something and aleviate said boredom. My first implementation is in perl, mostly because I don't want to have to learn a new language and lisp at the same time. The basic start is after the jump.

Building a lisp seems to center around two key decisions. First, how do you represent your core data structure? A two-element array? A struct? Something a little more complex? Second, what are your scoping rules. Lexical? Dynamic? Global? After that, everything else is gold plating. Substrate-langauge interop, how you represent scopes, how to get closures right, macros, etc, all can be determined later.

I've chosen to write this first implementation in perl. I know perl pretty well but more importantly I don't know lisp very well at all. I've done a little elisp hacking, but not much. I certainly don't know how all of the pieces fit together quite yet. This first post is really more about getting the fundamental data structure and list-manipulation routines and the reader down. Later posts will elaborate on eval and friends, as well as closures, scoping, and perl interop.

Data Structure

Lisp represents most things fundamentally in terms of what's known as a cons cell. This is some sort of object that has two slots for other objects, be they primitives or other cons cells. Being a good little modern perl programmer, I've chosen to implement this as a small Moose-based class:

package Cell;

use Moose;

use overload
    'bool' => sub { return !shift->is_nil() },
    'fallback' => 1
;

has 'car'    => (is => 'rw');
has 'cdr'    => (is => 'rw');
has 'is_nil' => (is => 'ro', default => 0);

1;

Using Moose, we define an object with two read-write slots named car and cdr. This is due entirely to historical precident: car is the first element in the pair, cdr is the second. is_nil is there to allow us to define a fixed nil value later on. The overload allows us to use a Cell in a boolean context. Anything that doesn't have is_nil set is true;

Fundamental Functions

Now that we've got the data structure done, let's define a few fundamental functions to work with it.

our $NIL = Cell->new(is_nil => 1);
sub nil
{
    return $NIL;
}

our $T = "t";
sub t
{
    return $T;
}

sub equal
{
    my ($a, $b) = @_;
    return t if $a eq $b;
    return nil;
}

Notice how $NIL is just hanging out there. It's the only Cell that will ever have _is_nil set. We return the reference to the singleton from the nil function. t is the opposite. We just return the atom t. equal exploits perl's built-in comparison operator eq to compare two things.

Now, the good stuff. List manipulation:

sub cons
{
    my ($thing, $list) = @_;
    return Cell->new(car => $thing, cdr => $list);
}

sub list
{
    reduce { cons($b, $a) } (nil, reverse @_);
}

sub car
{
    my $thing = shift;
    confess "Argument to car must be a list"
        unless ref($thing) && ref($thing) eq 'Cell';
    return defined($thing->car()) ? $thing->car() : nil;
}

sub cdr
{
    my $thing = shift;
    confess "Argument to cdr must be a list"
        unless ref($thing) && ref($thing) eq 'Cell';
    return defined($thing->cdr()) ? $thing->cdr() : nil;
}

cons creates new Cells, setting their car and cdr as appropriate. The list function is a pure convenience thing to make setting up singly-linked lists easy. car and cdr do a small amount of error checking and call out to the given Cell's car() and cdr() methods.

Functions also defines some functions that will be used later, as well as some things that can walk lists and trees made from cons cells and do something with them. It implements a list_string function which will be imported as the (print) function, once we have symbol tables and function importing defined.

There are a bunch of tests for these functions in 01listmanipulation.t.

Reader

Lisp's parser is referred to as the reader. Generally you interact with it using the (read) function, which pulls off of the input stream and returns the next parsed form as an AST. This reader consists of a hand-rolled recursive descent parser in Read.pm that implements these constraints:

Numbers consist only of numeric charcters and decimal points.
Symbols start with [a-zA-Z] and can contain anything within that range, as well as numbers, the ':' character, underscores, and dashes.
String literals start and end with the '"' character. Escaping is not implemented yet.
Lists start with '(', end with ')', and contain one or more whitespace-delimited things.
Whitespace is skipped.

This most basic of readers is only 113 lines of perl, but it can parse a string of characters that look like lisp and turn it into a tree of cons cells, ready to be evaluated. Tests and examples can be found in 02_read.t.

Well, that's all for now. It's a good start, but doesn't really deal with any of the interesting bits yet. Next up: (eval).

Managing Your Processes with ProcLaunch.

2010-08-08T12:30:37+00:00

Edit 2010-08-08: ProcLaunch now has a CPAN-compatible install process. See below for details.

I finally got the chance to work some more on proclaunch, my implementation of a user space process manager, like runit or mongrel or god. I wrote up a big overview of the currently available options [previously][12], but in summary: all of the existing options suck. They're either hard to setup, have memory leaks, have a weird configuration language, or are just plain strange. The only viable option was procer, and even that was just sort of a tech demo put together for the Mongrel2 manual.

That's why I started putting together proclaunch. I need some of the features of runit, namely automatic restart, with none of the wackyness, and I wanted it to be easy to automatically configure. I also wanted it to be standalone so I wouldn't have to install a pre-alpha version of Mongrel2 just to manage my own processes.

What of it?

Grab the latest version off of github, unpack it, and run this in the unpacked directory:

$ perl Build.PL
$ ./Build
$ ./Build install

If everything went smoothly you'll have proclaunch somewhere in your path. Now, fire it up:

$ mkdir -p /path/to/some/state/directory
$ sudo proclaunch \
    --debug \
    --foreground \
    /path/to/some/state/directory \
    example_profiles/

If everything goes according to plan, you'll see a bunch of debug info scroll past showing that it scanned the profiles directory, found one called sleeper, and kicked it off. Then, every five seconds you'll see it rescan. If you look in your process list for sleep you'll see bash happily kicking off a sleep 10 in an infinite loop as the nobody user. Now, run this:

$ sudo kill `cat /path/to/some/state/directory/proclaunch.pid`

You should still see the sleep going on, but proclaunch shouldn't show up anywhere. If you launch proclaunch again, you'll see it startup but never start sleeper, since it's already running. This may seem really mundane, but you can't make runit behave this way without some major hacks. Oh, and to actually make proclaunch kill everything before dying, kill it with -HUP:

$ sudo kill -HUP `cat /path/to/some/state/directory/proclaunch.pid`

Now for the automatic restart. Change something about the profiles directory:

$ touch example_profiles/

In the log you should see that proclaunch saw something changed and rescanned immediately. Now change something about sleeper:

$ touch example_profiles/sleeper

Within a few seconds, proclaunch will notice that something happened and restart sleeper. Specifically, it will send sleeper's pid a SIGTERM, wait up to 7 seconds for it to actually die, and then send it a SIGKILL. Now something a little more drastic:

$ sudo mv example_profiles/sleeper example_profiles/sleeper2

proclaunch will notice that sleeper is gone, tell it to stop, then start sleeper2 since it obviously isn't running. You can use this to setup really simple deploys, especially if you're deploying with Capistrano. Just commit your profiles directory to version control and point proclaunch at that directory in the current symlink, making sure that the pid_file is within the deploy directory somewhere. Within 5 seconds of your deploy, proclaunch will see that the inode on the profiles directory changed.

What is a profile, anyway?

If you look in the sleeper directory, you'll see this set of files:

run
pid_file
user

run is a small script that proclaunch expects to execute and have it return in short order, having backgrounded itself and written it's pid to the path contained in pid_file. This forms the core of both proclaunch and procer. Really simple to setup and automate, since there isn't any complicated config file to manage. The user file is special to proclaunch, and tells it what user to start run as. By default, proclaunch will start run as root, which is generally not what you want. procer can do some fun things that proclaunch can't do yet, like manage dependencies between profiles. If there's any demand I'll work on adding that but I don't currently need it.

A small digression into Mac OS X

Initially I wanted to use Privilege::Drop from CPAN to drop privileges when spawning profiles. It's a really clean pure perl module that has no dependencies other than perl itself. It even does a bunch of sanity checking to ensure that the privileges you dropped to are specifically what you wanted to drop to. However, on OS X with perl 5.10, it seems that you can't drop a large number of auxiliary groups that Privilege::Drop doesn't know about, at least not in the way that it's currently written. That's why the code for dropping privileges is inlined in App::ProcLaunch::Profile. It still checks to make sure that the group you tried to drop to is in the list, but it doesn't assert the list matches exactly what you wanted to do.

Blog Generator Updates

2010-08-06T23:04:47+00:00

I've made some small changes to the way bugsplat.info is generated. First, I refactored publish.pl quite extensively. Instead of being a huge mess of spaghetti-perl, it's nicely factored out into functions, each one doing as little as possible. It got a little longer, but I think it's worth the tradeoff in readability.

Second, I added self-generated shortlinks. Each post on the site has an internal id, which is actually a monotonically increasing sequence number. The short link for a post is http://bugsplat.info/<id>. For this post, it's http://bugsplat.info/13. These are implemented as mod_rewrite rules in .htaccess which are generated using a template, just like every other piece of content on the site.

Third, I wrote a new convenience script named next-entry.pl, the idea for which I shamelessly stole from technosorcery. Basically, it'll prompt me for a post title using bash's read function, then generate a URL and some date strings, as well as comb through the entries/ directory to find the highest id, then increase it by one. It writes all this to a file and then opens emacsclient right at the correct spot to start typing an entry.

Fourth, I worked on the CSS a little bit. Hopefully it looks a little snazzier than it did before.

Daemons are Our Picky, Temperamental Friends

2010-08-01T18:09:00+00:00

Modern web applications are complicated beasts. They've got database processes, web serving processes, and various tiers of actual application services. The first two generally take care of themselves. PostgreSQL, MySQL, Apache, Nginx, lighttpd, they all have well-understood ways of starting and keeping themselves up and running.

But what do you do if you have a bunch of processes that you need to keep running that aren't well understood? What if they're well-understood to crash once in a while and you don't want to have to babysit them? You need a user space process manager. Zed Shaw seems to have coined this term specifically for the Mongrel2 manual, and it describes pretty accurately what you'd want: some user-space program running above init that can launch your processes and start them again if they stop. Dropping privilages would be nice. Oh, and it'd be cool if it were sysadmin-friendly. Oh, and if it could automatically detect code changes and restart that'd be nifty too.

There are quite a few of these things out there, and as Zed points out all of them suck to various degrees. Here's a list of just a few that I've come across.

runit

We actually use runit at work quite a bit. It's... interesting. Essentially you control it through specially-laid-out directories full of named pipes and control files and whatnot. The learning curve is rather steep, especially since it cannot control things that are already daemons, which flies in the face of everything Unix. It's also bizzarely difficult to get started, since it can't daemonize itself.
God

God is a process manager written in ruby. You configure everything with an internal ruby DSL and it takes care of the rest. It'll even kill things when they start taking up too much memory, which is nice, and it looks pretty extensible as far as adding new conditions. It also has a really nice notifications system, with built-in emailing and twittering and campfiring, if that's your thing. Unfortunately, it also looks kind of complicated. You have to have ruby loaded, you have to write your config in ruby, and it's way of loading configs is sort of weird. Oh, and it has memory leaks.
bluepill

Bluepill was written in reaction to god's shortcomings. It's also written in ruby, it's got a ruby DSL, but some things are slightly different. Mostly it's similar to God but without the memory leak, and without the nice notification support.
monit

The industrial-sized solution, monit seems to compete in the same space as Nagios, except with process management tacked on. Big web interface, mostly for whole-system management. I haven't personally tried it.
supervisord

Written in python, supervisord looks more like what we're looking for. It's specifically written for tracking application-level processes. I haven't personally tried it but I've heard nice things. However, the config system looks pretty intimidating, and it doesn't look to have a nice system for managing dynamic configs.
procer

Procer is what started me on this whole adventure. After struggling with runit for almost an entire week, procer was a breath of fresh air. It is structured in the same way as runit, as a directory full of directories full of files. The most basic config is just a directory containing a run script that daemonizes and writes a pid to the path that the pid_file file contains. Procer can also handle dependencies between services, which is nice if process A just has to be running for process B to even start.

Of all of these, procer seems like the easiest to understand and get going with. However, it's sort of a side project inside of the mongrel2 effort and was written specifically for the manual. It doesn't really handle the code changing underneath it. You have to kill off your processes and let procer restart them for you. Also it depends on a core library from mongrel2, which doesn't really make it suitable for other uses.

That being said, I started rolling my own user space process manager yesterday. It's called proclaunch, and it's heavily inspired by procer. Right now it's mainly just a toy. It can launch and restart processes and maintain pid files, but it has no idea how to drop privilages or restart when something changes. Written in core perl with no external dependencies, it should eventually be suitable at least for my specific use cases, and hopefully it will be for yours too.

Data Mining "Lost" Tweets

2010-06-02T17:45:00+00:00

Note: this article uses the Twitter V1 API which has been shut down. The concepts still apply but you'll need to map them to the new V2 API.

As some of you might know, Twitter provides a streaming API that pumps all of the tweets for a given search to you as they happen. There are other stream variants, including a sample feed (a small percentage of all tweets), "Gardenhose", which is a stastically sound sample, and "Firehose", which is every single tweet. All of them. Not actually all that useful, since you have to have some pretty beefy hardware and a really nice connection to keep up. The filtered stream is much more interesting if you have a target in mind. Since there was such a hubbub about "Lost" a few weeks ago I figured I would gather relevant tweets and see what there was to see. In this first part I'll cover capturing tweets and doing a little basic analysis, and in the second part I'll go over some deeper analysis, including some pretty graphs!

Capturing

Let me preface: I have never watched a single episode of "Lost". When it started I had way too much stuff going on to pay attention to television and since then I've sort of conciously stayed away. I pass no judgements on anyone who is a fan or not, or who is evil or not.

The streaming API is pretty easy to work with. You basically give it a comma separated list of search terms and it will give you any and all tweets that match those terms. For example, if you were to run this command:

$ curl -q http://stream.twitter.com/1/statuses/filter.json\?track=bpcares \
    -uYourTwitterName:YourTwitterPass

you would get a stream of semi-humorous tweets about the oil spill.
I wrote a little perl wrapper around curl which will automatically stop capturing after a given number of hours or until it has captured a given number of megabytes. It will also reconnect when the stream dies for any reason. To capture a workable number of tweets, I launched this script on May 23rd at 4:14pm PDT like this:

$ capture-tweet-stream.pl 24 10000 ~/data/lost-finale-tweets.txt \
    'lost,locke,jack,sawyer,smokemonster,theisland,jacob,shepard'

This means, capture any tweets matching those terms for 24 hours or 10 gb, whichever comes first.

A little analysis

For a while as I was running the capture I was tailing the output file and would pause the output whenever a gem of a tweet scrolled past, just so I could retweet it. Here's my favorite:

I hope Dexter shows up on Lost and kills them all. #FuckLost
— Ed Battes (@EdBattes) May 24, 2010

I happen to be a fan of Dexter, and would have gladly paid money for a crossover. Anyway.

If you want to play along the data is on my dropbox and the code is all on github. First, let's get an idea of how much raw data we're working with. Twitter sends carriage-return separated JSON blobs. Awk to the rescue!

$ gzcat lost-finale-tweets.txt.gz | awk 'BEGIN{RS="\r"}{n+=1}END{print n}'
779750
$

Almost 780,000 tweets. Tweeps were busy! Ok, so what were they saying? A normal approach would be to run through all of the tweets and count up occurances of each word, but because there's so much output I can't do it on my laptop or I'd run out of memory. Instead, here's a map and two stage reduce process. The map is a fairly small perl script that everyone and their mother can pretty much write from memory, the word count mapreduce example:

#!/usr/bin/env perl

use strict;
use warnings;
use JSON::XS qw/ decode_json /;
use Try::Tiny;

$/ = "\r";
binmode(STDIN, ':utf8');
binmode(STDOUT, ':utf8');

while(<>) {
    my $obj;
    try {
        $obj = decode_json($_);
    } catch { };

    next unless $obj;
    my $text = $obj->{text};
    next unless $text;
    $text =~ s/[^\w\d#\s]//g;
    my @w = split(/\s+/, lc $text);
    for my $i ( 0 .. $#w ) {
        print_if_all(1, $w[$i]);
        print_if_all(2, @w[$i..$i+1]);
        print_if_all(3, @w[$i..$i+2]);
    }
}

sub print_if_all
{
    my $n = shift;
    @_ = grep { $_ } @_;
    print join(' ', @_) . "\t1\n" if @_ == $n;
}

This one has a few modifications, though. First, it removes all punctuation except '#' and lowercases everything. Second, it will count each individual word as well as each two and three word phrase in the tweet. We can run it like this:

$ gzcat lost-finale-tweets.txt.gz | ./stem.pl | split -l 1000000 - output/out.txt

The reduce happens in two phases, both using this even smaller perl script that just sums the output from the first one:

#!/usr/bin/env perl

use strict;
use warnings;

my %sum;
binmode(STDIN, ':utf8');
binmode(STDOUT, ':utf8');

while(<>) {
    chomp;
    my ($key, $num) = split(/\t/, $_);
    $sum{$key} += $num;
}

print join("\t", $_, $sum{$_}) . "\n" for keys %sum;

Which we run like this:

$ find output -exec ./sum.pl {} \; | ./sum.pl | sort -t $'\t' -k 2,2nr > stems.txt

Sort of like a poor man's Hadoop, no? No, you're right. Not really. But it gets the job done, and that's what counts.

Ok, so now we have our word counts. Here's the top 26 words and phrases that people mentioned in these tweets after removing really common english words:

lost    104181
#lost   53188
finale  25322
watching    11204
de lost 10475
tonight 9588
final   9107
lost finale 9101
series  9000
watch   8105
series finale   7487
the lost    7444
jack    5747
episode 5507
lost series 5062
end 4806
lost series finale  4696
watching lost   4179
the lost finale 3631
final de lost   3519
the end 2981
to watch    2964
watching the    2920
spoiler 2804
love    2768
the finale  2765
#lost finale    2579

In amongst all the tiny junk words, we have some really nice indicators that we can use in the next phase to filter to just the tweets that are actually talking about lost the tv show vs their lost kitten named Mittens. Interestingly, the phrase "you all everybody" only showed up 67 times. Sad.

Iterating Elements in boost::tuple, template style

2010-05-30T18:15:00+00:00

In my day job I use a mix of perl and C++, along with awk, sed, and various little languages. In our C++ we use a lot of boost, especially simple things like the date_time libraries and tuple. Tuple is a neat little thing, sort of like std::pair except it lets you have up to 10 elements of arbitrary type instead of just the two. One of the major things that it gives you is a correct operator<, which gives you the ability to use it as a key in std::map. Very handy. One tricky thing, though, is generically iterating over every element in the tuple. What then?

It's easy to get at individual elements when you know how many there are and what their types are:

typedef tuple<int, string, bool> delicious_tuple;
delicious_tuple foo(1, "hi", false);

// get<N>(tuple_type) gives you a reference to the Nth element
cout << get<0>(foo) << endl
     << get<1>(foo) << endl
     << get<2>(foo) << endl;

But what if you don't know those things? A really common situation where this comes up is serialization, where you have a diverse set of tuples and you don't want to write a whole bunch of glue code. tuple overrides operator<< and operator>> for ostreams and istreams, which by default read and write strings:

delicious_tuple foo(2, "there", true);
cout << foo << endl; // prints "(2 another one true)"

Sometimes that just doesn't cut it, though. If you want to serialize to JSON or XML or something, you have to be able to generically get at each element. You could write a macro using the boost preprocessor or just by itself, but that's kinda lame. You could dig into the guts of tuple, which is actually just a compile-time set of cons cells, but that gets complex. Let's break out a little template metaprogramming and see where we get:

template<typename tuple_type, typename F, int Index, int Max>
struct foreach_tuple_impl {
    void operator()(tuple_type & t, F f) {
        f(boost::get<Index>(t), Index);
        foreach_tuple_impl<tuple_type, F, Index + 1, Max>()(t, f);
    }
};

template<typename tuple_type, typename F>
void foreach_tuple_element(tuple_type & t, F f)
{
    foreach_tuple_impl<
        tuple_type,
        F,
        0,
        boost::tuples::length<tuple_type>::value - 1
    >()(t, f);
}

Simple, right? Let's start at the bottom. foreach_tuple_element takes any old tuple and any old function as arguments. It then instantiates a foreach_tuple_impl with those arguments, as well as two additional template arguments. First, a 0, which is the index to start iterating at. Second, the length of the tuple minus one, which we'll get to in a second. foreach_tuple_impl calls f with the value at index Index using boost::get<Index>(t) and then recursively calls itself with Index + 1. Great! Done! Time for a beer and a bratwurst and a happy Memorial Day!

Compile that, though, and you'll notice a little problem. Namely that the compiler will never actually finish. It'll spin faster and faster, spewing an infinite stream of error messages to stderr. In order to actually stop the recursion you'll need to add one more functor:

template<typename tuple_type, typename F, int Max>
struct foreach_tuple_impl<tuple_type, F, Max, Max> {
    void operator()(tuple_type & t, F f) {
        f(boost::get<Max>(t), Max);
    }
};

This gets called when Index and Max are the same number and does not recurse. Now you can use foreach_tuple_element like so:

struct print_element
{
    template<typename T>
    void operator()(const T & t, const int index)
    {
        cout << index << ": " << t << endl;
    }
};

...

delicious_tuple strawberry(10, "chocolate", true);
foreach_tuple_element( strawberry, print_element() );

// prints this:
//    1: 10
//    2: chocolate
//    3: true

The fact that it's a recursive solution involving templates might scare some people off, but because tuples are guarateed to only have 10 elements it's pretty safe to say you're not going to blow the stack. Is there a better way to do this? Probably. This was a fun diversion to go with my Sunday morning bagel and coffee, though.

Note: this was inspired by a forum post in this German c++ forum but since I can't read German I had to puzzle it out and I thought I'd share.

Everyone Needs Goals

2010-05-27T20:37:00+00:00

Creating actionable information out of raw data is sometimes pretty simple, requiring only small changes. Of the few feature requests that I've received for Calorific, most (all) of them have been for goals. Always listen to the audience, that's my motto!

With the latest version you can set up goals like this:

- goals: 
    - kcal:    2200
    - protein: [ 100, 200 ]

- 2010-05-27 breakfast:
    - 1000 kcal
    - 25 protein

- 2010-05-27 lunch:
    - 850 kcal
    - 50 protein

- 2010-05-27 lunch:
    - 500 kcal
    - 50 protein

This example is super simplified, of course, but you can see how it works. Creating an entry with the special name goals with one component for each nutrient you have a goal for. The value of each component is either a single number, which will be taken as a maximum, or a two element range.

Right now these are displayed by changing the color of the values on aggregate reports (daily and weekly). Red means "outside the range" and green means "inside the range".

$ calorific
2010-05-27 <total>                2350 kcal
                                   125 prot

Colors are done using Term::ANSIColor, which is included in core perl. Adding them was fairly easy, because of a simple function colored, which takes a scalar or an arrayref of scalars and a color argument and returns the scalar wrapped in the correct ANSI codes. Future display options could be displaying how much of each nutrient you have left for the day, maybe in a little progress bar type thing. Feature requests and comments are welcome, as always.

Program your Finances: Command-line Accounting

2010-05-23T15:15:00+00:00

Note: you can find much more information about ledger on ledger-cli.org, including links to official documentation and other implementations

About three years ago I was in some serious financial straits. I had just started my first job out of college that I had moved across the country for and had to bootstrap almost my whole life. This meant buying furniture, buying a car, outfitting a kitchen, etc. Every two weeks I would get a salary deposit, and within two weeks it would be almost completely gone from my checking account. I actually bounced a rent check or two in there. After the second time that happened I vowed it wouldn't happen again and started keeping track of every penny that I spent using a program called ledger. This was, in hindsight, exactly what I needed to get myself back on track. Actually seeing money moving in and out of my accounts forced me to modify my behavior. At the time, Mint wasn't around, but I don't think it would have helped nearly as much. Forcing myself to actually type out the transactions was the key to changing behavior.

Ledger is almost the most boring, austere accounting program you could think of. There's no pretty graphs, no online interaction, no GUI of any sort. It's basically a command-line driven calculator with a lot of specializations that make it ideal for tracking finances, which is what makes it so ideal for someone who spends a lot of time inside a text editor. It's very easy to script around and it has a very rich query language that lets you get at the data that you want with a minimum of fuss. It's very much the inspiration for Calorific.

The basic idea is that you write down all of your financial transactions in a text file with an easy-to-master syntax and then run the ledger command on them to generate reports. Here's a simplified extract from my ledger file:

2010/05/20 * Opening Balances
  Assets:Checking                          $500.00
  Liabilities:Amex                         $-10.00
  Equity

2010/05/21 * Salary
  Assets:Checking                        $1,000.00
  Expenses:Taxes:Federal                   $250.00
  Expenses:Taxes:State                     $100.00
  Expenses:Taxes:Social Security            $80.00
  Expenses:Insurance:Medical                $20.00
  Expenses:Insurance:Dental                  $2.00
  Income:Salary                         $-1,452.00

2010/05/21 Rent
  Expenses:Rent                            $600.00
  Assets:Checking

2010/05/21 Pacific Power
  Expenses:Utils:Electric                   $61.75
  Assets:Checking

2010/05/21 * AT&T Wireless
  Expenses:Cell Phone                       $88.46
  Assets:Checking

2010/05/22 NW Natural
  Expenses:Utils:Gas                        $20.31
  Assets:Checking

2010/05/22 Pizzicato
  Expenses:Food:Lunch                        $7.90
  Assets:Checking

2010/05/23 Comcast
  Expenses:Cable                            $60.00
  Liabilities:Amex

This is actually a complete ledger file (you can download it here) that illustrates a few key points. First, ledger is a double-entry accounting system. Every entry has at least one from and at least one to. Generally, the first line of the entry is where the money goes to, and it's a positive amount, with the second line being where the money comes from. If you leave off the amount of one of the lines ledger will automatically fill it in and make the entry balance. If you have an accounting background you can think of from and to in terms of debits and credits, but ledger doesn't force that. Second, accounts have a hierarchical namespace, which we can see like this:

$ ledger -f ledger.sample.txt -s bal
     $721.58  Assets:Checking
    $-490.00  Equity
   $1,290.42  Expenses
      $60.00    Cable
      $88.46    Cell Phone
       $7.90    Food:Lunch
      $22.00    Insurance
       $2.00      Dental
      $20.00      Medical
     $600.00    Rent
     $430.00    Taxes
     $250.00      Federal
      $80.00      Social Security
     $100.00      State
      $82.06    Utils
      $61.75      Electric
      $20.31      Gas
  $-1,452.00  Income:Salary
     $-70.00  Liabilities:Amex

This arrangement of accounts helps to maintain some sanity when dealing with lots of accounts, and it jives with the basic accounting equation: assets = liabilities + equity + (income - expenses). You'll notice that accounts just appear when you use them, sort of variables in perl without use strict;. This is both a blessing and a curse, because sometimes it's not obvious that you're misspelling things until you run reports and they look funny. The risk of messing up is mitigated if you use emacs by the bundled ledger.el major mode, which sets up tab completion for you.

Again using the example file, we can run some more detailed reports. For example, here's our checkbook register:

$ ledger -f ~/Documents/blog/static/ledger.sample.txt -r reg checking
2010/05/20 Opening Balances     Liabilities:Amex             $10.00       $10.00
                                Equity                      $490.00      $500.00
2010/05/21 Salary               Expenses:Taxes:Federal     $-250.00      $250.00
                                Expenses:Taxes:State       $-100.00      $150.00
                                Ex:Ta:Social Security       $-80.00       $70.00
                                Ex:Insurance:Medical        $-20.00       $50.00
                                Ex:Insurance:Dental          $-2.00       $48.00
                                Income:Salary             $1,452.00    $1,500.00
2010/05/21 Rent                 Expenses:Rent              $-600.00      $900.00
2010/05/21 Pacific Power        Ex:Utils:Electric           $-61.75      $838.25
2010/05/21 AT&T Wireless        Expenses:Cell Phone         $-88.46      $749.79
2010/05/22 NW Natural           Expenses:Utils:Gas          $-20.31      $729.48
2010/05/22 Pizzicato            Expenses:Food:Lunch          $-7.90      $721.58

Ledger will abbreviate account names as necessary when printing to make it fit in 80 columns. If you have a wider terminal you can pass the -w option to make it fit to 132 columns.

The power of ledger really comes into focus when you have more data available. One of the most interesting reports that I run gives me an idea of how I'm doing month-to-month by showing how much my assets have changed (negative numbers are better, in this case): ledger -MAn reg income expenses liabilities. The -M option groups transactions by month, -A will show the running average in the second column. By default it will show the running total. -n will group all transactions together, instead of showing one subtotal for each account. It's sort of boring with the sample file, though:

$ ledger -f ~/Documents/blog/static/ledger.sample.txt -MAn reg income expenses
2010/05/01 - 2010/05/23        <Total>;                    $-161.58     $-161.58

In any of these examples you can change the output format to suit your needs. There are a lot of options here that are detailed in the manual (pdf), but here's one example. I have a little program in my bin directory called transpose, which takes three-column pipe-separated data and turns it into tab-separated values ready to be inserted into a spreadsheet. The first column is the row, the second column is the column, the third is the value to put in that cell. We can tell ledger to output, for example, a basic expense report formatted for transpose like this:

$ ledger -f ~/Documents/blog/static/ledger.sample.txt -F '%A|%D|%t\n' -M reg income expenses
Expenses:Cable|2010/05/01|$60.00
Expenses:Cell Phone|2010/05/01|$88.46
Expenses:Food:Lunch|2010/05/01|$7.90
Expenses:Insurance:Dental|2010/05/01|$2.00
Expenses:Insurance:Medical|2010/05/01|$20.00
Expenses:Rent|2010/05/01|$600.00
Expenses:Taxes:Federal|2010/05/01|$250.00
Expenses:Taxes:Social Security|2010/05/01|$80.00
Expenses:Taxes:State|2010/05/01|$100.00
Expenses:Utils:Electric|2010/05/01|$61.75
Expenses:Utils:Gas|2010/05/01|$20.31
Income:Salary|2010/05/01|$-1,452.00

With more data, this lets you easily compare month-to-month where you are spending money.

If you want to pull your financial life together but don't want to spend money on something like Quicken or trust Mint with your account credentials, I highly encourage you to try out ledger in addition to the other open source solutions like gnucash. On the other hand, if you're looking something that does all this for you automatically, check out Personal Capital. I use it along side my ledger files to help me track investments across all of my accounts, as well as help plan my retirement.

Building Battle Bots with Clojure

2010-05-16T23:00:00+00:00

Once in a while at Rentrak we have programming competitions, where anyone who wants to, including sysadmins and DBAs, can submit an entry for whatever the problem is. The previous contest involved writing a poker bot which had to play two-card hold'em, while others have involved problems similar in spirit to the Netflix Prize. This time we chose to build virtual robots that shoot each other with virtual cannons and go virtual boom! We'll be using RealTimeBattle, which is a piece of software designed specifically to facilitate contests of this sort. It's kind of like those other robot-battle systems, except instead of requiring you to write your robot in their own arbitrary, broken, horrible language, this lets you write your bot in any language that can talk on stdin and stdout.

Based on my previous entries the natural choice would be perl, right? I thought about it, actually. Started stubbing something out. Wrote some code to emulate enums and it worked on the first try, which brought to light the fact that I hadn't learned a new language in quite a long time and by not using a new language I was missing a golden opportunity. So, which language? The only real constraint that we, the Happy Fun Robot Times Killing Group, decided on was that it had to be easily installable on Ubuntu, which leaves the field pretty much wide open. Ruby? Already know it in passing. Python? Haven't done much with it for a few years but I don't think it's changed that much. Lisp? Hm. Intriging. Clojure looks interesting, and it's a good chance to figure out multithreading.

The RealTimeBattle system is conceptually pretty simple. Your robot is a little doughnut-shaped thing that can go forward, backward, accelerate, brake, and turn. In addition, it has a big cannon and a radar system, both of which can rotate independent of the bot itself. The radar is the only sensor you can rely on, although in some configurations you'll get coordinates relative to your start position every few game ticks.

When the game starts, the system will start up your bot in a child process and attach to stdin and stdout, so from the bot's point of view it's just talking a simple text protocol. In perl, talking this protocol would be a trivial combination of while(<>){ } and print, but in clojure it seems to be a might bit more complicated:

(loop []
  (let [in (read-line)]
    (if (not (nil? in))
      (do
        (println in)
        (recur)))))

Just writing that bit took me down about a dozen false starts, but I learned a whole lot about clojure in the process so I'm pretty sure it was worth it.

Ok, so now this little bot can listen, let's make it talk. RealTimeBattle has a command that your bot can send to the server to make it print out something in the message log. We can wrap that in a function like so:

(defn message [m & rest]
  (println (str "Print " m rest)))

and call that like this:

(message "Hi there my name is Botty McBotterson!")

The two other basic commands that I've implmented so far are Initialize, which will get sent when the system is ready to find out what name your bot has, and GameOption, which tells you all kinds of information about the environment that the bot lives in. Here's the whole program as it stands:

(def game-option-types [
  :robot_max_rotate
  :robot_cannon_max_rotate
  :robot_radar_max_rotate
  :robot_min_acceleration
  :robot_max_acceleration
  :robot_start_energy
  :robot_max_energy
  :robot_energy_levels
  :shot_speed
  :shot_min_energy
  :shot_max_energy
  :shot_energy_increase_speed
  :timeout
  :debug_level
  :send_robot_coordinates])

(def options (ref {}))

(defn message [m & rest]
  (println (str "Print " m rest)))

(defn robot-initialize [[first-round]]
  (if first-round
    (println "Name kabot")))

(defn robot-set-option
  [[option-number value]]
  (let [option-key (get
                    game-option-types
                    (Integer/parseInt option-number))
        option-val (Double/parseDouble value)]
    (dosync
     (alter options (fn [opts] (assoc opts option-key option-val))))
    (message (deref options))))

(defn process-input [m]
  (let [tokens        (seq (.split m " "))
        function-name (first tokens)
        args          (next tokens)]
    (message (str function-name " " args))
    (cond
      (= function-name "Initialize") (robot-initialize args)
      (= function-name "GameOption") (robot-set-option args)
      :else (message (str function-name " not implemented")))))

(loop []
  (let [in (read-line)]
    (if (not (nil? in))
      (do
        (process-input in)
        (recur)))))

This is pretty trivial at the moment. My basic design is to have the main thread deal with all of the I/O and updating a global state object, while another thread deals with analyzing this state and figuring out what to do. I haven't decided on any concrete strategies yet but for the first contest it'll probably be pretty stupid.

A few fun things to note: clojure provides very simple interop with Java classes and methods. For example, (.split m " ") calls the split method on m, which is actually just a Java String. The result of that is a String[], which isn't too useful in clojure so we immediately wrap it in a seq, which is sort of like a lazy cons list. Another example of this really trivial interop is the number parsing done in robot-set-option. I figured this out only after about an hour of thrashing about trying to figure out why passing a string as a vector index wasn't DWIMing like it does in perl. This is another example of why I need to do this project in another language. Perl has rotted my brain.

By the way, if there are things that I'm doing in this code that aren't idomatic clojure, please correct me. I just started learning today, after all. I found a pretty good tutorial which has guided me through basic types and stuff, but shortly I'll be branching beyond that into threading and agents and other fun things that it doesn't cover very well.

Actionable Information

2010-05-12T08:53:00+00:00

Let's pretend, just for a second, that you want to make some money on the stock market. Sounds easy, right? Buy low, sell high, yadda yadda blah blah blah. Except, how do you know when to buy and when to sell? Not so easy. Being a nerd, you want to teach your computer how to do this for you. But where to start? I discovered a few months ago that there are services out there that will sell you a data feed that literally blasts every single anonymous transaction that happens on any market in the US in real time. They'll also sell you access to a historical feed that provides the same tick-level information going back for several years.

So, ok, you've got a whole lot of raw data. All kinds of fun problems come from having a huge glug of raw data, especially when you're getting blasted more of it every day. Where to store it, how to store it, how to index it so you can get at certain segments quickly, etc. Let's pretend that you've solved all of those and it's time to get to the meat of this exercise: figuring out when to buy. You write a little program that searches through your historical data looking for signs of business cycles in various sized companies and gives you the top five that you should buy and how long you should probably hold onto them. That list is actionable information that you created out of raw data that you can use to make some money. Maybe. Hopefully. If the world doesn't end. Again.

Ok, that's a pretty small example. Let's do something bigger. Let's pretend that you're Google. You have truck loads of cash just laying around waiting for you to do something with it. Hey, data centers are cool, why not build some new ones! But where?

You know immediate things, like where your users are coming from and where the bottlenecks in your network are that prevent them from looking at your sweet sweet ads. Now remember, you're Google. You have a large chunk of all accumulated human knowledge at your fingertips. In addition to that stuff that every good company would know, you also know, somewhere deep down in your giant cache, things like where the zoning codes are favorable, where you have private fiber connections to and from, where you can get cheap electricity, voting patterns, histories of war riots and famine for every location on the planet. Lots of data. So, you write some Sawzall programs that go out and mine all this data and give you back likely locations, ranked by 10 year projected return on investment, and then you build at the top five places. Done. Easy.

In my admittedly limited time as a professional developer I've learned that probably close to 2/3 of my job is figuring out ways to suss out actionable information from vast quantities of low-level data. Be it displaying graphs or maps on a web page in the most understandable way, or trolling though a billion television remote clicks to determine who watched the Today Show this morning, it all boils down to providing some information to someone that they can act on.

In my personal life I need to have actionable information once in a while today. Before today Calorific could only tell exactly what I ate in its entirety or daily totals. That's somewhat useful, but sometimes I want to know what my weekly averages are, or limit the daily or detail reports to just a couple of days. To address those issues I added --begin and --end filters which will limit any report to just that day range. Specifying just one will leave the other as an open range. Calorific parses dates using DateTime::Format::Natural, which means it does the right thing with basically any date format you throw at, including relative dates like yesterday or 3 days ago . Also, I added a weekly report which prints daily averages for each week in the day range. This is the new default, which actually I'm not really sure about. Easy to change.

The next features on the docket are goals, which will let you set goal ranges for each base nutrient, an option to show you each day total or week average against the goal, and a summary report that shows you how much of each nutrient you've eaten today and how close you are to your goals. Stay tuned!

Moose vs Mouse and OOP in Perl

2010-05-09T08:00:00+00:00

After using Calorific for a month two things have become very clear. First, I need to eat less. Holy crap do I need to eat less. I went on to SparkPeople just to get an idea of what I should be eating, and it told me between 2300 and 2680 kcal. I haven't implemented averaging yet, but a little grep/awk magic tells me I'm averaging 2793 kcal per day. This is too much. So. One thing to work on.

Second, in the morning after I come back from lifting and sit down to enter my breakfast, I just add three lines to my calories file:

- 2010-05-07 breakfast:
    - 1 workout breakfast (blues)

and then type calorific in my shell, it takes ages to start up. Literally several seconds on a cold cache. I was pretty sure that this was due to the fact that I use Moose to help me define the four classes that compose Calorific. Now, Moose is great. Before writing Calorific I had only used a really old version of Class::MethodMaker or Class::Struct to build classes. That or build them myself, which is always fun (FUN FACT blessed array refs are wicked fast if you can get away with them). Moose is sort of a revelation. In the simplest case, you can say

package Foo::Bar;

use Moose;

has [qw/ baz blah frob /] => (is => 'rw');

1;

And you have yourself a fully functional class with three properties with read-write accessors. Pretty snazzy. However, you can get way more advanced:

package Calorific;

use Moose;

has 'filename' => (
    is       => 'ro',
    required => 1,
);

has 'recipes'  => (
    is      => 'ro',
    traits  => [ 'Hash' ],
    isa     => 'HashRef',
    lazy    => 1,
    default => sub { {} },
    handles => {
        get_recipe => 'get',
        set_recipe => 'set',
    },
);

has 'entries' => (
    is      => 'rw',
    traits  => [ 'Array' ],
    isa     => 'ArrayRef',
    lazy    => 1,
    default => sub { [] },
    handles => {
        add_entries    => 'push',
        filter_entries => 'grep',
        num_entries    => 'count',
        all_entries    => 'elements',
        sorted_entries => 'sort',
    },
);

1;

This is directly from Calorific. It defines three properties: a read-only simple scalar named filename which is required to be present in the call to new(), a recipes property which contains a hash ref and gets two accessors, get_recipe and set_recipe, which you call like this:

$calorific_instance->set_recipe('foo', 'bar');
$calorific_instance->get_recipe('foo'); # returns 'bar'

In addition, it sets up one more property named entries which contains an array ref and defines five accessors. There are actually more accessors defined than the code uses, but they're basically free so why not? You can see what they do and their calling conventions in the Moose::Meta::Attribute::Native::Trait::Array docs.

Ok, so Moose is great! Except, it's slow. Way slow. Wicked slow, especially on a groggy cache like my laptop has when I rudely wake it up in the morning and demand it actually do something for me for once. Geeze.

HOWEVER, there's a neat little project called Mouse, which has the lofty goal of emulating all of the sugar of Moose without any of the fat. Meaning, it doesn't pay nearly as large of a compile-time penalty that Moose does while retaining most of it's meta-y goodness. I ran one little command on the source tree yesterday evening and bam, just like that, everything was three times as fast.

$ find . -name '*.pm' | xargs perl -pi -e 's/Moose/Mouse/g'

Actually I had to install MouseX::NativeTraits from CPAN before everything worked but that's just details.

Anyway, the moral of the story is that Moose is great and makes building classes really easy and all, but if you care about startup speed and not so much about delving into meta classes and such, Mouse should be your go-to class. And in fact, you don't have to make that choice. There's another project called Any::Moose, which will load Mouse unless you declare you want Moose, which can be set with an environment variable. Pretty neat.

Calorific, a Simple Calorie Tracker

2010-04-08T19:00:00+00:00

I'm a nerd. I write software for a living. I spend a lot of my day either sitting in a chair in front of a computer, or laying on my couch using my laptop. I'm not what you'd call... athletic. I did start lifting weights about six months ago but that's really just led to gaining more weight, not losing it. A few years back I started counting calories and I lost some weight, and then stopped counting calories and gained it all back. Time to change that.

Now, I could use one of the many, many online calorie trackers. They're all ok and they have the advantage of being able to enter data whenever and where ever you are, but most of them have ads and using a web interface is kind of slow and staring at ads sucks. Also, the reports you can generate from them are always a bit limited. What if I want to see a monthly average of how many calories I ate as snacks? Or how many calories I shoved down my gullet from fast food? Or maybe I want to track another nutrient, like grams of protein. Doing all of this through a limited web interface would be tricky, to say the least. There has to be a better way.

I've been using this program called ledger for more than three years now to keep track of my finances. The idea is that you maintain a text file that contains all of your transactions in a really simple format, and then you can run basically arbitrary reports on it. I always have emacs open, so maintaining that file is a snap. I'd like to maintain my calorie history in the same way, using a lightly formatted text file. I actually tried to use ledger for this purpose but the syntax just wasn't right. What I really wanted was a way to build up foods from simpler foods, and have those be built from other, simpler foods, all the way down to calories. Something like this:

1 cup milk             = 100 kcal
1 scoop protein powder = 65 kcal
1 protein shake =
    1.5 cup milk,
    2 scoop protein powder

2010-04-08 breakfast
    1 protein shake

I danced around this format for quite awhile, trying to parse it line-wise and trying to parse it with Parse::RecDescent and treetop, and nothing ever really fit. Then, I punted. What's a lightweight, human readable format that already has a parser built? Why, YAML of course! Here's the same thing as a YAML snippet:

- 1 cup milk: 100 kcal
- 1 scoop protein powder: 65 kcal
- 1 protein shake:
    - 1.5 cup milk
    - 2 scoop protein powder

- 2010-04-08 breakfast:
    - 1 protein shake

The basic idea revolves around the concept of a recipe. Essentially, a recipe is a count, a label, and a bunch of components that can also be recipes. "100 kcal" is actually a recipe all by itself. Entries are just recipes that have a date instead of a count. At run-time, we resolve all the labels into recipes and then recursively get the values. Ideally everyhing will resolve down to a handful of base units, like "kcal" or "g protein", but if something doesn't resolve it'll get included right into the output.

So, ok, now I just need a program to analyze this stupid thing and print me some reports. That's where Calorific comes in. It's a little application (<500 lines, actually) that parses that YAML file and prints out either a detail or daily report. I have some big plans for it, including a report that gives the monthly average of daily totals, options to limit the date range you want to report, and 30 day moving averages. Installation instructions are in the readme file, if you'd like to try it out.

Adding RSS and Other Things

2010-03-29T20:34:00+00:00

Someone at work today demanded that I add an RSS feed, so here you go: atom. It didn't take very much to hack it in. Basically, all I had to do was install a few more CPAN modules, specifically DateTime::Format::Natural, DateTime::Format::W3CDTF, and finally XML::Atom::SimpleFeed. The first two are so I can put natural-looking dates in my entries and still be able to get full-fledged DateTime objects out of them, and the second is to save me the pain of writing out the Atom format's preferred datetime format. Also, I get neat date formatting in blog entires almost for free with the CLDR syntax.

Another thing to notice: File::Slurp instead of my own read_file_contents and write_file_contents. It works just as well as mine, except it's more sensitive to list vs scalar context.

Yet Another Static HTML Blog

2010-03-28T22:15:00+00:00

I'm a strict believer in learning by doing. It's how I learn best. In the spirit of learning, then, here's how I built the engine that powers this blog.

Right away I decided that there's no point in having a database to back this thing. The only useful thing that a database brings to the table is comments, and those are way more hassle than they're worth. Better to leave the comments at reddit or hacker news, where they already know how to deal with spam. Not having to worry about a database freed me up to worry about more important things, like how to put text on the screen. I'm most familiar with perl at the moment so I decided that the best way to build it would be a client-side script that generates some static html.

Current features:

Absolutely no database
Generates fully static html
Automatically ships it to my server
A really cheesy template system because I didn't want to learn Template::Toolkit just yet
Archives for everything, and only show the last 10 entries on the front page
Static pages (although currently there aren't any)
Markdown parsing for entries

If you want to see the source for it (including all the entries), it's on github, but I warn you it's kind of lame. The template system in particular is not really what I want it to be yet. It's non-recursive, so publish.pl basically acts like the top-level template. I'll probably end up converting it to Template::Toolkit at some point.