What is this syntax: \001\002?

Hi,

On p. 131-132 in “Programming Ruby 2d.”, it talks about writing binary
data to files, but there’s no explanation of the syntax:

str1 = “\001\002\003”

I looked in the index of the book, I poured over “The Ruby Language”
chapter, and I also couldn’t find any escape character that was “\0”.
Is that Ruby’s unicode syntax? How about a ding dang explanation Mr.
Thomas?

7stud – wrote:

Hi,

On p. 131-132 in “Programming Ruby 2d.”, it talks about writing binary
data to files, but there’s no explanation of the syntax:

str1 = “\001\002\003”

This is an octal escape sequence. Use it to insert an arbitrary byte
into a string.

The escape code is \n, \nn or \nnn where each n is from 0 to 7. In octal
000 is 0 and 777 is 255.

best,
Dan

Le 12 septembre à 13:10, 7stud – a écrit :

I looked in the index of the book, I poured over “The Ruby Language”
chapter, and I also couldn’t find any escape character that was “\0”.
Is that Ruby’s unicode syntax? How about a ding dang explanation Mr.
Thomas?

You should look in the 3rd part, chapter 22, under “The Basic Types”.
Table 22.2, page 306 (in my PDF edition).

It says :

Table 22.2. Substitutions in double-quoted strings
\a Bell / alert (0x07) \nnn Octal nnn
\b Backspace (0x08) \xnn Hex nn
\e Escape (0x1b) \cx Control-x
\f Formfeed (0x0c) \C-x Control-x
\n Newline (0x0a) \M-x Meta-x
\r Return (0x0d) \M-\C-x Meta-control-x
\s Space (0x20) \x x
\t Tab (0x09) #{code} Value of code
\v Vertical tab (0x0b)
--------------------------------------------------------------

So, your example uses the octal representation.

Fred

On 9/12/07, 7stud – [email protected] wrote:

Hi,

On p. 131-132 in “Programming Ruby 2d.”, it talks about writing binary
data to files, but there’s no explanation of the syntax:

str1 = “\001\002\003”

I looked in the index of the book, I poured over “The Ruby Language”
chapter, and I also couldn’t find any escape character that was “\0”.
Is that Ruby’s unicode syntax?

Hi
That’s the way of expressing octal numbers.

F. Senault wrote:

Le 12 septembre � 13:10, 7stud – a �crit :

I looked in the index of the book, I poured over “The Ruby Language”
chapter, and I also couldn’t find any escape character that was “\0”.
Is that Ruby’s unicode syntax? How about a ding dang explanation Mr.
Thomas?

You should look in the 3rd part, chapter 22, under “The Basic Types”.
Table 22.2, page 306 (in my PDF edition).

Ah. I didn’t realize there was a second column on the right.

Thanks everyone.

Daniel L. wrote:

The escape code is \n, \nn or \nnn where each n is from 0 to 7. In octal
000 is 0 and 777 is 255.

777 is 255 ?

Daniel L. wrote:

Daniel L. wrote:In octal

000 is 0 and 777 is 255.

No it very much isn’t. 377 is what I was going for there. Thank you for
being too polite to say…

:frowning:
Dan

Try again.

A byte is coded on 8 bits, so the maximum is 255 hence the strange
results below :

irb> 8 * 8 * 7 + 8 * 7 + 7
=> 511
irb> “\777”.inspect
=> “\377”
irb> “\777”[0]
=> 255
irb> “\377”[0]
=> 255

\777 is invalid and is truncated to \377.

Daniel L. wrote:In octal

000 is 0 and 777 is 255.

No it very much isn’t. 377 is what I was going for there. Thank you for
being too polite to say…

:frowning:
Dan

Gaspard B. wrote:

=> 255

\777 is invalid and is truncated to \377.

How I represent a number eight?

sysdebug(main):024:0> “\008”[0]
=> 0

This returned 0!

Regards,

On 12.09.2007 13:45, F. Senault wrote:

| \f Formfeed (0x0c) \C-x Control-x
| \n Newline (0x0a) \M-x Meta-x
| \r Return (0x0d) \M-\C-x Meta-control-x
| \s Space (0x20) \x x
| \t Tab (0x09) #{code} Value of code

\v Vertical tab (0x0b)

So, your example uses the octal representation.

Does anyone know a good reason for outputting 8 bit byte characters
as octals in String#inspect?

see string.c (1.8.6):

[…]
else {
sprintf(s, “\%03o”, c & 0377);
rb_str_buf_cat2(result, s);
}
[…]

why not make it:

        sprintf(s, "\\x%02x", c & 0377);

I can’t understand why it is desirable to introduce yet another base
that is rarely used outside of chmod and od (without options). We
learned decimals, we got used to binary and even hex, but why do we
need octals just for these control characters when there is no
benefit compared to the hex representation (note: both take 4 chars
to display)?

  • Matthias

Jonas Roberto de Goes Filho (sysdebug) wrote:

irb> “\377”[0]

This returned 0!
8 doesn’t exist in octal, so it interprets the 8 as a separate
character.

irb(main):009:0> “\007”.length
=> 1
irb(main):010:0> “\008”.length
=> 2

On 12.09.2007 17:28, Rob B. wrote:

On Sep 12, 2007, at 8:48 AM, Matthias Wächter wrote:

I can’t understand why it is desirable to introduce yet another base
that is rarely used outside of chmod and od (without options). We
learned decimals, we got used to binary and even hex, but why do we
need octals just for these control characters when there is no
benefit compared to the hex representation (note: both take 4 chars
to display)?
This is pure history.

Well, I understand that in history it was a popular format for
various reasons. But Ruby is now, and now we have 8 bit chars (aka
Bytes) and use hex nearly exclusively for them. Why not so in
String#inspect?

Or is the reason similar to the question why Float#to_s uses 15
instead of 17 significant bits to display? Pure convention? A (core)
developer rolling a dice somewhen? :wink:

  • Matthias

% cheat string
string:
Ruby 1.8 String Literal
[…]
|---------±-----------------|
| Format | Translation |
|---------±-----------------|
| \nnn | Octal nnn |
| \xnn | Hexadecimal 0xnn |
| \cx | Control-x | Control-x == (?x & 0b10011111)
| \C-x | Control-x |
| \M-x | Meta-x | Meta-x == (?x | 0b10000000)
| \M-\C-x | Meta-control-x |
| \x | x |
| #{expr} | Value of expr |
|---------±-----------------|
[…]

Steve

On Sep 12, 2007, at 8:48 AM, Matthias Wächter wrote:

| \e Escape (0x1b) \cx Control-x
Does anyone know a good reason for outputting 8 bit byte characters

  • Matthias

This is pure history. The use of octal notation predates hex by a
long time. In my “The C Programming Language” [1978], 0ddd and 0xddd
were both valid integer literals, but only octal \ddd was valid for
character constants.

In fact, there’s a table of internal sizes for data types on various
machines which includes the Honeywell 6000 having a 9-bit char type
and 36-bit types for short, int, long, and float and 72-bit doubles.
For that machine, octal representation of bit patterns makes perfect
sense.

-Rob

Rob B. http://agileconsultingllc.com
[email protected]