Forum: Ruby String Equations (#112)

Announcement (2017-05-07): is now read-only since I unfortunately do not have the time to support and maintain the forum any more. Please see and for other Rails- und Ruby-related community platforms.
4299e35bacef054df40583da2d51edea?d=identicon&s=25 James Gray (bbazzarrakk)
on 2007-02-08 21:10
(Received via mailing list)
This quiz is much more difficult than it looks.  There are an infinite
number of
combinations, even for small word sets, because each term can be

To handle this, most brave souls who solved the quiz used some matrix
transformations from linear algebra for solving a system of linear
This makes it possible to find solutions in reasonable time, but I had
to drag
out the math textbooks to decode the solutions.

Let's take a look into one such solution by Eric I.:

  require 'mathn'

  CompactOutput = false

  # calculate the least common multiple of one or more numbers
  def lcm(first, *rest)
    rest.inject(first) { |l, n| l.lcm(n) }

  # ...

This should be pretty easy to digest.  The mathn library is pulled in
here to
get more accurate results in the calculations the code will be doing, a
selects the desired output mode, and a shortcut is defined for applying
over an Array of numbers.

The next method is where all the action is, so let's take that one

  # ...

  # Returns nil if there is no solution or an array containing two
  # elements, one for the left side of the equation and one for the
  # right side.  Each of those elements is itself an array containing
  # pairs, where each pair is an array in which the first element is the
  # number of times that word appears and the second element is the
  # word.
  def solve_to_array(words)
    # clean up word list by eliminating non-letters, converting to lower
    # case, and removing duplicate words! { |word| word.downcase.gsub(/[^a-z]/, '') }.uniq!

    # ...

The first comment does a good job of describing the result this method
eventually produce, so you may want to glance back to it when we get
that far.

The first set of operations is the word normalization process right out
of the
quiz.  This code shouldn't scare anybody yet.  (Just a quick side note
it is possible to use delete("^a-z") here instead of the gsub() call.)

One more easy bit of code, then we will ramp things up:

    # ...

    # calculate the letters used in the set of words
    letters =
    words.each do |word|
      word.split('').each { |letter| letters[letter] = true }

    # ...

This code just makes a list of all letters used in the word list.  (Only
keys() of the Hash are used.)  To see what that's for, we need to dive
into the

    # ...

    # create a matrix to represent a set of linear equations.
    column_count = words.size
    row_count = letters.size
    equations = []
    letters.keys.each do |letter|
      letter_counts = []
      words.each { |word| letter_counts << word.count(letter) }
      equations << letter_counts

    # ...

This code build the matrix we are going to work with to find answers.
column in the matrix represents a word and each row a letter.  The
numbers in
the matrix then are just a count of the letter in that word.  For
example, using
the quiz equation this code produces the following matrix:

            l   m
            d   a r
            e   r i
          l m   v d
          o o t o d
        a r r o l l
      i m d t m o e
  v | 0 0 0 1 0 1 0
  l | 0 0 1 1 0 1 1
  a | 0 1 0 0 0 1 0
  m | 0 1 0 1 1 1 0
  d | 0 0 1 1 0 0 2
  o | 0 0 1 2 1 2 0
  e | 0 0 0 1 0 0 1
  r | 0 0 1 1 0 1 1
  t | 0 0 0 1 1 0 0
  i | 1 0 0 0 0 0 1

If you glance back at the code now, it should be pretty clear how it
builds the
matrix as an Array of Arrays.

Now we're ready to manipulate the matrix and this is the first chunk of
that does that:

    # ...

    # transform matrix into row echelon form
    equations.size.times do |row|
      # re-order the rows, so the row with a value in then next column
      # to process is above those that contain zeroes
      equations.sort! do |row1, row2|
        column = 0
        column += 1 until column == column_count ||
          row2[column].abs != row1[column].abs
        if column == column_count : 0
        else row2[column].abs <=> row1[column].abs

      # figure out which column to work on
      column = (0...column_count).detect { |i| equations[row][i] != 0 }
      break unless column

      # transform rows below the current row so that there is a zero in
      # the column being worked on
      ((row + 1)...equations.size).each do |row2|
        factor = -equations[row2][column] / equations[row][column]
        (column...column_count).each do |c|
          equations[row2][c] += factor * equations[row][c]

    # ...

Now you really don't want me to describe that line by line.  Trust me.
let me sum up what it does.

This code transforms the matrix into row echelon form, which says that
rows in the matrix have entries in further left columns and that the
significant entry in a row is preceded only by zeros.  That sounds
scarier than
it is.  Here's the transformed matrix (without the labels this time):

  1  0  0  0  0  0  1
  0  1  0  1  1  1  0
  0  0  1  2  1  2  0
  0  0  0 -1 -1 -2  2
  0  0  0  0 -1 -2  3
  0  0  0  0  0 -2  2
  0  0  0  0  0  0  0
  0  0  0  0  0  0  0
  0  0  0  0  0  0  0
  0  0  0  0  0  0  0

The why behind this transformation is that its the first step in solving
for our

On to the next bit of code:

    # ...

    # only one of the free variables chosen randomly will get a 1, the
    # rest 0
    rank = { |row| row.any? { |v| v != 0 }}.size
    free = equations[0].size - rank
    free_values =, 0)
    free_values[rand(free)] = 2 * rand(2) - 1

    # ...

This bit of math uses the rank of the matrix to determine the free
variables it
will solve for.  Free variables are just placeholders for substitutions
in our
system of equations.  More concretely, they are where one or more words
will be
inserted in our string equations.

Setting a single value to one, as the comment mentions, is basically
to to work with one word at a time.  That's why the others are zeroed

One more variable is prepared:

    # ...

    values =[0].size)  # holds the word_counts

    # ...

As the comment explains, this will eventually be the counts for each

OK, here's the last big bit of math:

    # ...

    # use backward elimination to find values for the variables; process
    # each row in reverse order
    equations.reverse_each do |row|
      # determine number of free variables for the given row
      free_variables = (0...column_count).inject(0) do |sum, index|
        row[index] != 0 && values[index].nil? ? sum + 1 : sum

      # on this row, 1 free variable will be calculated, the others will
      # get the predetermined free values; the one being calculated is
      # marked with nil
      free_values.insert(rand(free_variables), nil) if free_variables >

      # assign values to the variables
      sum = 0
      calc_index = nil
      row.each_index do |index|
        if row[index] != 0
          if values[index].nil?
            values[index] = free_values.shift

            # determine if this is a calculated or given free value
            if values[index] : sum += values[index] * row[index]
            else calc_index = index
            sum += values[index] * row[index]
      # calculate the remaining value on the row
      values[calc_index] = -sum / row[calc_index] if calc_index

    # ...

This elimination is the second and final matrix transform leading to a
The code works through each row or equation of the matrix, determining
for the free variables.

Again this process is much more linear algebra than Ruby, so I won't
bother to
break it down line by line.  Just know that the end result of this
process is
that values now holds the counts of the words needed to solve quiz.
counts belong on one side of the equation, negative counts on the other.

This is the code that breaks down those counts:

    # ...

    if values.all? { |v| v } && values.any? { |v| v != 0 }
      # in case we ended up with any non-integer values, multiply all
      # values by their collective least common multiple of the
      # denominators
      multiplier =
        lcm(* { |v| v.kind_of?(Rational) ? v.denominator : 1
})! { |v| v * multiplier }

      # deivide the terms into each side of the equation depending on
      # whether the value is positive or negative
      left, right = [], []
      values.each_index do |i|
        if values[i] > 0 : left << [values[i], words[i]]
        elsif values[i] < 0 : right << [-values[i], words[i]]

      [left, right]   # return found equation
      nil  # return no found equation

  # ...

Assuming we found a solution, this code divides the words to be used
into two
groups, one for each side of the equation.  It divides based on the
positive and
negative counts I just explained in values and the end result was
described in
that first comment at the top of this long method.

With the math behind us, the rest of the code is easy:

  # ...

  # Returns a string containing a solution if one exists; otherwise
  # returns nil.  The returned string can be in either compact or
  # non-compact form depending on the CompactOutput boolean constant.
  def solve_to_string(words)
    result = solve_to_array(words)
    if result
      if CompactOutput do |side|
 { |term| "#{term[0]}*\"#{term[1]}\"" }.join(' + ')
        end.join(" == ")
      else do |side|
 { |term| (["\"#{term[1]}\""] * term[0]).join(' + ')
            join(' + ')
        end.join(" == ")

  # ...

This method just wraps the previous solver and transforms the resulting
into the quiz equation format.  Two different output options are
controlled by
the constant we saw at the beginning of the program.

Here's the final piece of the puzzle:

  # ...

  if __FILE__ == $0  # if run from the command line...
    # collect words from STDIN
    words = []
    while line = gets
      words << line.chomp

    result = solve_to_string(words)

    if result : puts result
    else exit 1

This code just brings in the word list, taps the solver to do the hard
work, and
sends back the results.  This turns the code into a complete solution.

My thanks to all of you who know math so much better than me.  I had to
use math
books and my pet math nerd just to breakdown how these solutions worked.

Tomorrow we return to easier problems, pop quiz style...
This topic is locked and can not be replied to.