Hash

johnsmith · December 5, 2007, 3:41pm

hi

im writing a class which so far:

reads a text file
and prints out the the number of Tags it encounters

however now i want to create an empty hash every time it encounters a
tag
line, and if it encounters any other field, put the field and the
related
value in the hash, using the field name as the key

im unsure of how to go about this
any help or pseudo code would be greatly appreciated
my code is provided below

thanks

.txt file

Tag: ref1
Type: Book
Author: Little, S R

Tag: ref2
Type: Journal
Author: Smith, J

Tag: ref3
Type: Conference Paper
Author: Williams, M

ruby file:

require ‘getoptlong’

opts = GetoptLong.new(
[’–style’, ‘-n’, GetoptLong::NO_ARGUMENT ],
[’–database’, ‘-i’, GetoptLong::REQUIRED_ARGUMENT]
)

$linecount = 0

opts.each do |opt, arg|
case opt
when ‘–style’
require arg
when ‘–database’
end
end

process options

File.open(‘reference.txt’).each do |line|
if line =~ /^tag:/i
$linecount += 1
end
end
puts $linecount

johnsmith · December 5, 2007, 4:21pm

2007/12/5, Johnathan S. [email protected]:

related
Tag: ref1
puts $linecount
–
Posted via http://www.ruby-forum.com/.

Why do you repost the same question when you got answers already? Are
those answers not propagated to ruby-forum?

Cheers

robert

johnsmith · December 5, 2007, 5:09pm

On Dec 5, 9:26 am, Johnathan S. [email protected] wrote:

you could point out my errors id be extremly grateful
puts $linecount
  linecount += 1
  hash[m[1]] = m[2].chomp
end
end

–
Posted viahttp://www.ruby-forum.com/.

If you don’t understand or don’t find an answer sufficient, say so in
the same thread, no one will be mad. But if you start a new one, then
people following the old one aren’t helped if you get the answer in
the new one.

Assuming that your text file is always in the format you listed, this
may be what you’re after…

database = {}
File.open(‘reference.txt’) { | handle |
tags = handle.read.split(“\n\n”)
for tag in tags
key, type, author = tag.split(“\n”)
database[key[5…-1]] = [type[6…-1], author[9…-1]]
end
}
p database

Given your sample data, this will output the following…

{“ref1”=>[“Book”, “Little, S R”], “ref2”=>[“Journal”, “Smith, J”],
“ref3”=>[“Conference Paper”, “Williams, M”]}

And the number of tags seen is just “database.length”.

Regards,
Jordan

johnsmith · December 5, 2007, 4:26pm

Why do you repost the same question when you got answers already? Are
those answers not propagated to ruby-forum?

hi

i wasnt sure i had recieved the answers i was looking for. I was
provided with a few different bits of code which im very grateful for
but I was unable to get working. one of the solutions provided by the
previous threat i will post below. Im not provided with any output at
all. I was to maintain the tag count as well as creating the hash. if
you could point out my errors id be extremly grateful

thanks

require ‘getoptlong’

opts = GetoptLong.new(
[’–style’, ‘-n’, GetoptLong::NO_ARGUMENT ],
[’–database’, ‘-i’, GetoptLong::REQUIRED_ARGUMENT]
)

$linecount = 0

opts.each do |opt, arg|
case opt
when ‘–style’
require arg
when ‘–database’
end
end

process options

File.open(‘reference.txt’).each do |line|
if line =~ /^tag:/i
$linecount += 1
end
end
puts $linecount

linecount = 0
results = []
hash = {}
File.open(‘reference.txt’).each do |line|
m = line.match /^(\w+):\s*([\w+,\s]+)$/
unless m
results << hash unless hash.empty?
hash = {}
else
linecount += 1
hash[m[1]] = m[2].chomp
end
end

johnsmith · December 5, 2007, 9:50pm

On Dec 5, 2:28 pm, Johnathan S. [email protected] wrote:

  $linecount += 1
end
end
puts $linecount

^ This part is pointless now. $linecount above is equal to
database.length in the code below.

–
Posted viahttp://www.ruby-forum.com/.

It should have been author[8…-1], sorry about that.

I suggest that you go through one of the many ruby guides/tutorials
listed here: Documentation

Regards,
Jordan

johnsmith · December 5, 2007, 9:28pm

hi,

thanks, i’ve got it to print out the hash
although I’m getting a wierd output where its taking the first letter
off the surname e.g:

ruby main.rb
3
{“ref1”=>[“Book”, “ittle, S R”], “ref2”=>[“Journal”, “mith, J”]

can you see in my code where im going wrong?
also, is it possible to print the different refrences on new lines?

ill provide my code below
thanks

File.open(‘reference.txt’).each do |line|
if line =~ /^tag:/i
$linecount += 1
end
end
puts $linecount

File.open(‘reference.txt’) { | handle |tags =
handle.read.split("\n\n")
for tag in tags
key, type, author = tag.split("\n")
database[key[5…-1]] = [type[6…-1], author[9…-1]]
end
}
p database

johnsmith · December 5, 2007, 10:00pm

cheers.

yer i have been reading many tutorials but was unable to grasp what I
was trying to achieve.

one for thing i was wondering:

if i was to extend the some of the information in the .txt file
e.g:

Tag: ref1
Type: Book
Author: Smith, J
Publisher: New Books
Chapter: 12

could i adapt the code so “any other information” is put into the hash
rather than having set information like author or type?

thanks

johnsmith · December 5, 2007, 11:16pm

On Dec 5, 2:00 pm, Johnathan S. [email protected] wrote:

Author: Smith, J
Publisher: New Books
Chapter: 12

could i adapt the code so “any other information” is put into the hash
rather than having set information like author or type?

Yes, both of the pieces of code I wrote for you originally handle this
case just fine. Please read through the code and understand it. You
asked for pseudo-code originally, and I actually gave you fully-
functioning code. Now you just need to understand how to use it and
its output.

johnsmith · December 5, 2007, 11:36pm

On Dec 5, 3:19 pm, MonkeeSage [email protected] wrote:

On Dec 5, 4:02 pm, Phrogz [email protected] wrote:

Yes, both of the pieces of code I wrote for you originally handle this
case just fine. Please read through the code and understand it. You
asked for pseudo-code originally, and I actually gave you fully-
functioning code. Now you just need to understand how to use it and
its output.

Sorry about that Phrogz. I didn’t read the other thread, I just
glanced at it briefly the other day. I didn’t realize you had already
answered this fully.

Absolutely no need for an apology. Helping people isn’t a competition
of who can answer first. And, if I choose to stop helping someone
because I’ve done as much as I’m willing to do, there’s certainly no
etiquette that says you shouldn’t step in.

I’m glad you helped Johnathan get further towards his solution that I
did, and certainly I don’t care that you used your own code versus
mine.

But I appreciate the sentiment.

johnsmith · December 5, 2007, 11:22pm

On Dec 5, 4:02 pm, Phrogz [email protected] wrote:

e.g:
Yes, both of the pieces of code I wrote for you originally handle this
case just fine. Please read through the code and understand it. You
asked for pseudo-code originally, and I actually gave you fully-
functioning code. Now you just need to understand how to use it and
its output.

Sorry about that Phrogz. I didn’t read the other thread, I just
glanced at it briefly the other day. I didn’t realize you had already
answered this fully.

Regards,
Jordan

johnsmith · December 5, 2007, 11:49pm

hey guys

sorry for the whole mix up! I appreciate all the help I’ve received
especially as im new to ruby! im also new to the forum and should have
tried to stick it our with the first threat! so i hope i havn’t offended
any of you by this.

that aside, i really hope you can help me resolve this problem.

Jordan, the code you gave me worked successfully. However, it just deals
with the the two types i gave you, author and type.Its partially my
fault for not explaining properly becuase i hope it could deal with any
information after the tag. is this possible to achieve?

again sorry for the mix up
cheers

johnsmith · December 6, 2007, 1:12am

On Dec 5, 4:49 pm, Johnathan S. [email protected] wrote:

hey guys

sorry for the whole mix up! I appreciate all the help I’ve received
especially as im new to ruby! im also new to the forum and should have
tried to stick it our with the first threat! so i hope i havn’t offended
any of you by this.

Nah! Any mistake you might make, we’ve already done that and worse
(probably several times).

that aside, i really hope you can help me resolve this problem.

Jordan, the code you gave me worked successfully. However, it just deals
with the the two types i gave you, author and type.Its partially my
fault for not explaining properly becuase i hope it could deal with any
information after the tag. is this possible to achieve?

Sure it is. The code I gave you can be expanded, but it was mainly
just for you to get started with, because it is fragile (as you saw
from my mis-post). Even with the correct “author[8…-1]”, if you
accidentally leave out the space between the “Author: Name” line, it
will still clip the first letter off the name ([number…number] means
to grab a smaller section from a larger container, and -1 has the
special meaning of the last item; so author[8…-1] means to get the
characters from number 8 to the end).

What you really want is something more flexible, that will allow for
those kinds of mistakes and still do the right thing in almost every
case. That is a regular expression. It lets you give it a pattern
which has certain rules, and matches a string in different ways based
on those rules. And this is just what Phrogz’ code is doing:

we’ll get rid of linecount since you can

get that information directly from your hash

and we’ll store the hashes as subhashes

underneath a main one, keyed by “Tag:” value

database = {}

File.open(‘reference.txt’) { | handle |

last_tag = nil # we’ll use this to keep track of
# the last tag we’ve seen, see below

handle.each { | line |

# the following regexp says to match:
#   ^          = starting at the first char
#   (\w+)      = 1 or more "word" characters (a-z / 0-9)
#   :          = followed by a literal colon
#   \s*        = then zero or more "space" characters
#  ([\w+,\s]+) = 1 or more "word/space/," characters
#  $           = with nothing else until the end of line
#
# any string meeting that criteria is a match and m is
# the match data, however if the string doesn't match
# e.g., a blank line, m is nil

m = line.match(/^(\w+):\s*([\w+,\s]+)$/)

if m # if m is a match (i.e., not nil)

  if m[1] == 'Tag' # we want to add a key to the hash
                   # since this is a "Tag:" line
                   # when we used the () in the regexp
                   # that told ruby we only care about
                   # those chars, and we can access
                   # them in the match object with m[n]
                   # i.e, m[1] means whatever part of the
                   # line matched (\w+)

    last_tag = m[2].chomp
    database[last_tag] = {} # make a subhash as the value
                            # now we have, e.g.,
                            # database["ref1"] = {}
  else

    # it's not a "Tag:" line, so we add it as key, value
    # to the subhash we created for the last tag key we
    # created in the hash

    database[last_tag][m[1].downcase] = m[2].chomp
    # e.g., database["ref1"]["type"] = "Journal"

  end
end

}
}

now print it

print "# of Tags: ", database.length, “\n\n”
print "Contents: ", database.inspect, “\n\n”

puts database[“ref2”][“author”]

========

This outputs:

of Tags: 3

Contents: {“ref1”=>{“author”=>“Little, S R”, “type”=>“Book”},
“ref2”=>{“author”=>“Smith, J”, “type”=>“Journal”},
“ref3”=>{“author”=>“Williams, M”, “type”=>“Conference Paper”}}

Smith, J

again sorry for the mix up
cheers

–
Posted viahttp://www.ruby-forum.com/.

No worries

HTH,
Jordan

johnsmith · December 6, 2007, 1:54am

although i have one conern at the end of the output it prints J, Smith
randomly

oh wait…iv solved it
guess it shows i shud actually have a go at it properly

cheers

johnsmith · December 6, 2007, 1:46am

thanks mate.

i really appreciate it. and its making sense!
although i have one conern at the end of the output it prints J, Smith
randomly

can this be avoided?

thanks again