RCRS

RCR 2: Icon-like suspend or "inside-out yield" (dave, 2001-07-30 15:51:23)

Status: Rejected

Icon has the suspend command (see for example) which allows a function to produce a result in the way that return does, but it suspends execution so that if the function is called again in the same context execution can continue from where the suspend left off. It could be considered to be an inside-out yield. This is used to create generators rather than iterators. It is achieved by leaving the state of the function on the stack. I suspect this may give a more functional style of programming than Ruby's iterators, but messing around with the stack in this way may be considered messy. It might be possible for the function to carry its state with it in some other way like closures do. previously discussed in ?

Comments

Possible reason for the rejection. (HughSasse, 2001-08-02 06:50:45)

The "same context" in Icon depends on re-entry from within the same statement. This might occur when the statement goes through backtracking: Icon is goal directed and statement succeed or fail, and statements with generators in them backtrack if they don't succeed at first. Ruby does not use this model. It is a powerful model but difficult to get used to. Without this model, conditions for re-entering the generator from where it left off are rather difficult to determine. Adding goal directed execution to Ruby would be a huge change.

Reason for rejection

No reason given

RCR 3: Shortcut for instance variable initialization (dave, 2001-07-30 15:54:41)

Status: Rejected

I'm constantly creating small classes, and many of them start something like:

  class Msg
    def initialize(name, type, txt)
      @name, @type, @txt = name, type, txt
    end
    # ...
  end

So, could we change parameter passing slightly so that if an instance method has a formal parameter starting with an '@' sign, the incoming value is assigned directly to the corresponding instance variable? Using this scheme I could write the above as:

  class Msg
    def initialize(@name, @type, @txt)
    end
  end

Reason for rejection

No reason given

RCR 4: Cut operator for short-circuiting method chains (dave, 2001-07-30 15:57:10)

Status: Rejected

The idea of this is that methods such as sort!() would be ideal to use in a chain, except for its sometimes returning nil. I proposed that an operator (which I called cut, vaguely after the Prolog cut) would break the chain in the case where it was used with a nil receiver. Otherwise it would just pass on the receiver.-- HughSasse previously discussed - [] [] [] []

Reason for rejection

It smells like Objective-C. I liked the idea, but we should use exceptions for this purpose if the language provide it.

RCR 6: Infix 'function composition' operator (dave, 2001-07-30 16:02:26)

Status: Rejected

I'd sure like an infix 'function composition' operator for "chaining" proc's but I'm not sure this is important or generally useful to include in the language as opposed to requiring a file with the code:

  class Proc
    def *(other)
      raise unless arity == other.arity
      proc {|*a| self.call(other.call(*a))}
    end
  end

and then, as a side-note being able to do:

  f = lambda{|x| 2*x}
  g = lambda{|x| x+1}
  h = f * g
  h.call 1         # => 4

Why not throw in some syntactic sugar for the "lambda{|x| ...}" stuff à la Haskells "x -> 2*x" while we're at it!

Comments

Re: Infix 'function composition' operator (Stephan, 2001-08-01 02:13:31)

Having somewhat of a mathematical background, I'd really like to see this in my now favourite language.
I'll vote for this one.

Reason for rejection

A very few people would use this feature.

RCR 8: Alternate variable declaration (dave, 2001-07-30 16:04:34)

Status: Rejected

Another way to declare that something is a variable. Like "let" in Lisp or "my" in Perl. By itself this is useful as a way of telling Ruby that I don't care if there is already something in scope called foo, this foo is private.

Comments

use iterate { &lti, j> ... } notation (joe, 2001-08-29 12:31:29)

I like Dave's idea from:

Possibly we could use a different set of delimiters for local block
parameters:

iterate { |i, j| ... } # as now

iterate { ... } # 'i' is always block local

I think Matz liked it too:

Reason for rejection

No. Variable declaration is evil from the eyes of rubyists.

RCR 9: Pragma to disallow variable creation with just an assignment (dave, 2001-07-30 16:05:40)

Status: Rejected

request that there be a pragma to (lexically) disallow declaring a variable by just assigning to it. Combining this with the previous RCR could protect users from typos ala perl's use strict "vars"

Comments

We need to flesh this out a bit... (Dave, 2001-08-10 00:18:30)

If the pragma is enabled, what is the proposed scheme for declaring a variable, and how will these variables be scoped?

Will this pragma be file-local, or global? (I'm thinking it will have to be file-local, or all hell will break loose with required libraries).

If the pragma is enabled, and 'a' isn't known to be a variable, will the interpreter be free to try self.a and self.a= methods (as appropriate)?

Warn about variables used only once (Moxon, 2001-10-14 07:43:58)

Perl has the -w switch, which enables certain warnings. Especially the warning about variables which are used only once helped me finding lots of typos. Though one might repeatedly mistype a variable name (e.g. when using emacs dynamic expansion [type the beginning of a word and tell emacs search to the buffer for completions]), it was a common mistake to me. Maybe this is an useable approach, too.

Reason for rejection

I'd take warning approach.

RCR 10: Keep track of implicitly called methods (dave, 2001-07-30 16:06:43)

Status: Rejected

request to keep track of what methods have been implicitly called and have a function you can call that checks that they all resolve properly. If you make a habit of adding this at the end of classes, this will catch the common error where a method will blow up later due to a typo. (There is, of course, no good way to test for typos in explicit method calls. That wouldn't stop it from being a useful feature though.) a la perl's use strict "subs"

Comments

I'm not sure I see this one (Dave, 2001-07-30 17:25:56)

Could someone explain how this could work? This seems to require knowledge that just isn't available. Wouldn't the better approach be to have an option that required local variables to be declared before use?

Vague; some issues not considered (matju, 2001-08-31 07:31:12)

Too vague; no actual mechanism proposed.

I think the poster may be talking about methods with implicit receivers. In that case you have more information on the methods it may call or not; but the poster doesn't take care of (or even mention) Object#method_missing, method-undef, forward references, and how his feature would interact with those.

I don't know what Perl's strict-subs have to do with it, I don't remember anything it did for me, and IIRC, strict-subs don't do anything when using Perl's method-lookup.

Reason for rejection

Need to be more concrete to accept.

RCR 11: Allow "#{foo} bar" to be used as "#foo bar" (dave, 2001-07-30 16:07:42)

Status: Rejected

Would like to be able to use #foo in place of #{foo} (Just to stop people complaining that interpolation requires 2 extra characters.) ala perl's "$foo bar"

Comments

That might introduce some ambiguities (Dave, 2001-07-31 23:04:14)

If 'fred' is a method taking a parameter, would Ruby be expected to parse



  "hello #fred and barney"

as a call to fred? If so, what would be passed as the parameter? And how would it know to do it?

If the Perl people complain it takes an extra two characaters, point out that the stuff between the {}'s is an arbitrary expression.

#foo would be only for variables (anonymous, 2001-08-08 03:30:12)

Just specify that #foo may only be used for variables, which probably is the most common use.

If you need arbitrary expressions you have to use #{foo}. With this rule there are no ambiguities.

marko

Reason for rejection

I feel it's too confusing.

RCR 12: New method Enumerable#hashify(value) (dave, 2001-07-30 16:08:49)

Status: Rejected

Return a hash based on iterating through the receiver (an enumerable), either setting every value to the parameter "value" or setting the values via a block. previously discussed - [] [] [] [] [] [] [] [

Comments

This would be useful (Dave, 2001-07-31 17:15:10)

This is an idiom I find myself using fairly often, so having it built-in would be nice.--Dave

Reason for rejection

Hash[*a.flatten] would work for this purpose.

RCR 13: Make Ruby XML-ready "out of the box" (dave, 2001-07-30 16:10:22)

Status: Rejected

All future Ruby distributions (tgz, InstallShield?) ought to be "XML-ready" out of the box. I think XML is going to be the "sort-of next big thing"--i.e. not that big, but big enough to matter. (Where's my marketing hat. Ahh, here it is. "Why dork around with Perl, Python, or Tcl when Ruby supports XML out of the box?")

Comments

here here (ianm74, 2001-07-30 16:20:50)

This would be great. XML is becoming so pervasive by now. No language should be without it.

A must-have if we can resolve all the issues (Dave, 2001-07-30 17:22:43)

On the face of it, this is one of those non-brainers: XML support is pretty much a prerequisite nowadays. However, the devil, as always, lurks in the details. In particular: how do we provide XML support? Do we link in expat? If so, do we distribute expat sources? Can we distribute just binaries under Windows? Or should we use NQXML, which is a more elegant solution, but one that opens us up to criticisms of being non-standard and (possibly) slow.

I'm not saying "don't do it". I think it's a key idea. We just have to make sure we get it right. -- Dave

XML and Web Services (Rich_Kilmer, 2001-07-30 17:38:23)

If Ruby could support the emerging Web Services environment consisting of XML/HTTP, Simple Object Access Protocol (SOAP), Web Services Description Language (WSDL), and UDDI...that would make Ruby really allow Ruby to pass any other language.

Imagine in two lines of Ruby code "publishing" a Ruby object for any other language to use across the network in the same way Michael Neumann's XML-RPC Ruby library does.

That would be power, and one no other language currently offers "out of the box"

BTW: We need an XML parser AND generator framework.

expat (anonymous, 2001-08-01 08:08:21)

If expat was part of Ruby distribution,
the author of NQXML might have used
it as the base for his module instead
of completely write it from ground up,
even for an excise,
of course, I am imaginating. Making
expat a base for XML in Ruby so that
other standard modules can be written
on top of can solve many issues, such
as maitenance and performance, NQXML
should be able to adapt to use it.

XML support essential (xen, 2001-08-05 23:54:30)

I think XML is going to be the "sort-of next big thing" --i.e. not that big, but big enough to matter.

With respect, you should think again. W3C wants XML to replace HTML, M$ is using XML as its data protocol in .NET, MacOSX uses XML for its config files. Beyond that all sorts of publishers and data providers are moving to XML or (XML compliant SGML) for preparation, tracking and publication. All the signs are that XML is going to be the data format (and markup language) of the 2000s-10s.

Where I work we have had to begin dealing with information being delivered in XML. Although its a perl shop, I have pursuaded the manager to let me use python for 'prototype development purposes'. Although I prefer ruby, the excellent XML support in python made the case for python arguable.

All future Ruby distributions (tgz, InstallShield?) ought to be "XML-ready"

Absolutely! And what this means is that ruby must have acess to some parser, perhaps expat (but why not Xerces?), but most importantly of all the standard APIs (ie SAX and DOM) will need to be supported (though there is nothing wrong with having some unique ruby-xml API in addition the the standard ones).

How about data-binding (anonymous, 2001-08-11 13:04:44)

Has anyone implemented XML data binding for Ruby? With Ruby's powerful meta-programming facilities I can imagine a very powerful API for XML processing that maps attributes and elements into Ruby attributes, methods and iterators.

Data-binding and Ruby (Rich_Kilmer, 2001-08-12 14:43:07)

I was able to use the XMLParser (expat) parser and write a generic ParsedXMLElement class that leveraged Ruby's method_missing method to capture calls to allow the following syntax:

&lt;tag1 foo="bar"&gt;
  &lt;tag2 this="that"/&gt;
  &lt;tag2 this="the other"&gt;mydata&lt;/tag2&gt;
&lt;/tag1&gt;

becomes:

xmle.element_tag -> tag1
xmle.attr_foo -> bar
xmle.tag2[0].attr_this -> that
xmle.tag2[1].element_data -> mydata
xmle.tag2.count -> 2
xmle.tag2.each {|element| puts element.element_data} -> nil, mydata

Notice I preface the attributes of a tag with attr_ and the sub-tags with the actual name. The array index (tag[0]) is optional if there is only one. The xml element's direct data is element_data. Its worth trying to build this into some type of library to deal with XML that's Ruby flavored.

fix to previous post... (Rich_Kilmer, 2001-08-12 14:45:30)

becomes...

xmle.element_tag -> tag1
xmle.attr_foo -> bar
xmle.tag2[0].attr_this -> that
xmle.tag2[1].element_data -> mydata
xmle.tag2.count -> 2
xmle.tag2.each {|element| puts element.element_data} -> nil, mydata

wave goodbye to expat (xen, 2001-10-01 23:42:48)

how do we provide XML support? Do we link in expat? If so, do we distribute expat sources? Can we distribute just binaries under Windows? Or should we use NQXML

What we should to is not to waste time using expat at all. We should use a professional quality XML parser. IMHO this should be Xerces (from xml.apache.org) because of MSXML's platform specificity and 'extensions' to the standard.

The approach taken with Perl, to create Xerces.pm, an API to Xerces-C (resulting in much faster parsing), would seem to be the obvious way to go.

full DOM (tobi, 2001-10-09 15:04:42)

I think it would be a lot of fun to have a full featured and fully conformant XML DOM implementation. (one in Ruby, one in C for Ruby(an existing implementation could be wrapped))
100% DOM2 seems to be a lot of fun, especially http://www.w3.org/TR/DOM-Level-2-Core/
and http://www.w3.org/TR/DOM-Level-2-Traversal-Range/
Perhaps some XML experts would help.

Xerces, expat, REXML ... why should I know? (fmitchell, 2001-10-24 10:23:44)

We could borrow from Python and/or Java and provide a standard API for creating and using an XML parser, and then adapt non-conforming to this API.

How to find a parser implementation, and which concrete parser to instantiate if implementations are found, are tricky problems. Java uses system properties and a default in the JAXP implementation, but I'm not sure how Python's xml.sax.xmlreader locates parsers.

Defining that API in a way that suits all parties is left as an exercise for the reader.

XML in the standard lib of Ruby (tobi, 2001-11-13 17:50:16)

a proposal for a strategy and for requirements:

Reason for rejection

1.8 will ship with REXML.

RCR 14: New method, instance_eval + value passing (dave, 2001-07-30 16:11:21)

Status: Rejected

matz has proposed that there should be a method with functionality similar to instance_eval but with the ability to pass values to the evaluating block. matz suggests that the behavior contradicts the name and role of instance_eval, and so a new method with a new name should be implemented. instance_yield is noted as being one possible name.

Reason for rejection

"instance_yield" did not make concensus. I guess we need a better name.

RCR 18: Move numeric iterators out of Integer and in to Numeric (dave, 2001-07-30 16:17:52)

Status: Rejected

Is there any reason that #step, #upto and friends are in Integer, not Numeric? []

Comments

Possibly because... (Dave, 2001-07-30 17:32:30)

If they were in Numeric, then we'd be wondering what



  Math::PI.times { puts "Hello" }

  Complex::I.times { puts "Goodbye" }

would output.

Reason for rejection

Numeric may not be repeatable.

RCR 19: Method additions to Numeric (dave, 2001-07-30 16:19:32)

Status: Rejected

Request to add the following methods:

Numeric#odd?
Numeric#even?
Numeric#negative?
Numeric#positive?

previously discussed - [] []

Comments

Would this work for all kinds of numbers? (anonymous, 2001-08-01 07:32:37)

What about complex numbers? Is -3-2i positive or negative, even or odd?

Non numeric method? (matz, 2001-08-02 02:38:30)

I don't think they can be defined for Numeric in general.

odd?, even? can be defined for integers; positive? and negative? can be defined for integers and floats.

Even though it's possible, I still suspect if they are worthy enough to add, or not.

Reason for rejection

I don't think they are methods for Numeric, maybe for Integer. If someone want to see these methods for Integers, submit new RCR.

RCR 21: New method: Exception#message=(aString) (dave, 2001-07-30 16:21:04)

Status: Rejected

This would allow setting or modification of message of an existing Exception object.

Comments

We don't need this. (matz, 2001-08-01 03:08:54)

if we have RCR#21 (exception clone in Exception#exception).

Plus, it makes exception mutable, so it must be handled carefully.

Reason for rejection

we don't need this, I think.

RCR 25: unparsed string literals (eli, 2001-08-08 11:50:24)

Status: Rejected

From a discussion that arose on the ruby-talk mailing list; it might be useful to have a string literal sequence that is free of any backslash escapes, largely when using strings for regular expression matching. Python's version: r'this string is unparsed' Perl's version: q{this string is unparse} The backslashes will appear in the resulting string rather than being escaped. The %q syntax could be borrowed for this: %r/This is unparsed./ Where / can be anything. This avoids the rather unpretty: gsub(////, "") If I understand the ruby parser, this wouldn't be a particularily hard feature to add, and since it only occurs at parse time, wouldn't be hurting anybody. Pretty low priority, though.

Comments

suggestions for enhancements (anonymous, 2001-08-09 01:05:25)

%r is taken for regexps, of course. What about %u indicating unparsed or %l for literal text.

In python there is the possibility to include the delimiting character by doubling it: "r'this doesn''t seem right'" I would favor to leave out this special handling, it is not that neccesary.

Maybe omit the detection of nested parenthesis that is possible in the other %-notations too ("%q{nesting {really} works}") and make the parser really just return literally what is written.

marko schulz

Re: suggestions for enhancements (HughSasse, 2001-08-09 05:52:30)

Marko Schultz wrote:

Maybe omit the detection of nested parenthesis that is possible in the other %-notations too
("%q{nesting {really} works}") and make the parser really just return literally what is written.

But that is the whole point of detecting nesting, isn't it? You get what you wrote rather than internal close-brackets being interpreted as the end of the string. And you don't have to quote them with backslash. Maybe I misunderstood you -- what would removal of nesting detection allow you to do that you cannot do now?

Reason for rejection

No need for another string literal.

RCR 27: Text similarity methods in String (aj, 2001-08-17 09:37:14)

Status: Rejected

Ruby is short of methods to compare two strings to determine how similar they are. (At least) two different approaches comes to mind: soundex or the Levensthein string distance. Incidently, I have made a which enables these methods in string.c.

Comments

Internationalisation (anonymous, 2001-08-17 12:30:10)

Is the soundex algorithm specific to English or can it work with any language?

Re: Internationalisation (aj, 2001-08-17 13:39:04)

Soundex have only mapping mappings for english characters A to Z (at least in my implementation). Their values are specific to english pronunciation.

Re: Internationalisation (anonymous, 2001-08-17 20:43:27)

... however the Levenshtein algorithm does not suffer from these limitations.

Sounds perfect for a mixin (anonymous, 2001-08-17 23:00:11)

I would rather see this as a mixin, written in Ruby. No need to clutter the String class.
A single interface could work for multiple methods (soundex, levensthein, etc.), and apps would mix in the one they want to use.

Even better... a Behaviour (anonymous, 2001-08-18 10:15:29)

Different classes in the same app might want to mix different string comparison methods into the String class, causing name clash.

An even better approach would be to lexically scope the mixed in behaviour, using Dave Black's Behaviours package.

Reason for rejection

It's too English (or western languages) centric.

RCR 31: Kernel#method_var: simple hidden per-method variables (green, 2001-08-27 11:15:42)

Status: Rejected

After seeing a million examples of overriding methods in classes by aliasing an existing method to something with a more obscure name, defining over that method and then calling the alias of the old method in the new method's code, I convinced myself there HAD to be a better way. In that vein, I've implemented the ability to "attach" an anonymous variable to a given method.

It works like this:

module MyStringComparison
    def self.enabled
        String.class_eval <<-'END'
            old = instance_method(:<=>)
            def <=>(other)
                c1, c2 = [self, other].collect {|s|
                    s.gsub(/[^[:alnum:]]*/, '').upcase
                }
                method_var.bind(c1).call(c2)
            end
            define_method_var(:<=>, old)
        END
        begin
            yield
        ensure
            String.class_eval <<-'END'
                define_method(:<=>, method_var(:<=>))
            END
        end
    end
end

irb(main):003:0> "Abc...?" <=> "A B C!"
1
irb(main):004:0> MyStringComparison::enabled {"Abc...?" <=> "A B C!"}
0
irb(main):005:0> "Abc...?" <=> "A B C!"
1

As you can see, not only does it leave the String class unchanged, it also does not provide any visible evidence of the : method being overridden, and the method overriding can be recursed without worrying about how exactly to do aliasing, defing, and undefing without conflicting upon names for old method storage.

Pretty simple and generalized, eh? :) You could essentially attach any variable you would like to a method in this fashion, but it's most useful for keeping an "anonymous" copy of the old method around. The implementation I came up with can be found .

Regarding the technical aspect of the proposed change, it increases the size of a NODE by one pointer, as well as the size of a FRAME. It shouldn't add any significant amount of overhead because of this, and remain invisible on benchmarks.

Comments

Questions, also comparison with Ruby Behaviors (dblack, 2001-08-29 23:20:28)

Hi --

I'm not sure I have the hang of what the advantage would be of this over the aliasing approach. It seems to involve at least as many steps. I don't (yet) see why this way would be any easier, or make things easier to keep track of. Can you elaborate a little on that?

I don't think method_var covers exactly the same "problem space" as Ruby Behaviors, but there are certainly some points of contact, so perhaps some comparative remarks would be in order. With Behaviors, having to alias and unalias isn't an issue because the aliasing and unaliasing are done automatically. You design a Behavior, (such as one which redefines ), and the underlying engine takes care of swapping it in and out. Then in your top level code you invoke the behavior in a block.

For what it's worth, here's what you'd have to do to get a Ruby Behavior that did your string method change. ("ship" is the default alias for the spaceship operator :-)

module Behaviors
 
  class Behavior
    class BriansThing ",
               (other)
                      c1, c2 = [self, other].collect {|s|
                        s.gsub(/[^[:alnum:]]*/, '').upcase
                      }
                      c1.ship(c2)
                    end
               EOS
        )
      end
    end
  end
end
 
 
# Test
 
include Behaviors
b = Behavior.new(:BriansThing)
 
s = "Abc...?"
t = "A B C!"
 
p s  t              # => 1
b.adopt { p s  t }  # => 0
p s  t              # => 1

Anyway -- I'm not sure how to the point this is, but note that you get that same effect of a temporary change (but not without possible dangers... but that's another story), without having to explicitly save and restore, since the underlying Behavior engine does that for you.

Re: Questions, also comparison with Ruby Behaviors (dblack, 2001-08-30 07:08:45)

Whoops, I forgot to escape the spaceship operators :-)

That should be:

    p s  t               # => 1
    b.adopt { p s  t  }  # => 0
    p s  t               # => 1

Solution not general enough (matju, 2001-08-30 12:56:49)

Solution only allows to bind one var to the method.

Can be fixed if everyone agrees not to set method_var directly, and that method_var is a pointer to the Method object representing that Method. Then you can add accessors to the Method, as you can do with Class objects.

It would work better if Methods were made more like "real objects"; that is, if there a 1-to-1 identity correspondence so that when you query for a specific method, you get always the *same* object. It would remove most of the nastiness associable with the solution I propose.

Reason for rejection

It's not simple as you described. I'm not sure how it can be used.

RCR 32: non-bang []=, maybe a.set_at(2,new_value) (tobi, 2001-09-01 12:34:02)

Status: Rejected

I'm trying to modify specific items in a nested array, without modifying the original.

I tried some variations, but most things that come to mind failed. There are ways to achive what I want (I found some hack that works for me), but I believe it would be good for the ease of use to introduce the following:

[]= is bang;
so
a version of []= that doesn't modify the receiver
would be cool.

1.1.
Array#set_at position, new_value
 an_array.set_at(2, new_value)
 
and maybe
1.2.
 Array#set_at!

or
1.3.
 an_array.at(2) = new_value
 

# for example

"add a newline to the second and last item in the array"

an_array.each_index do |x|
 new_val = an_array.at(x) + "n"
 if (x == 2) or (x == (an_array.size - 1))
  an_array.set_at(x,new_val)
 end
end

or even:

an_array.each_index do |index,item|

or

an_array.collect |index,item|
  if (index == 2) or (index == (an_array.size - 1))
    item + "n"
  else
    item
  end
end

Comments

further explanation (tobi, 2001-09-01 17:15:44)

These for example are describing examples of collect and collect!
in the Pickaxe Library Reference:

collect

a = ["a", "b", "c"]
a.collect {|x| x + "!" } -> ["a!", "b!", "c!"]
a -> ["a", "b", "c"]

collect!

a = ["a", "b", "c"]
a.collect! {|x| x + "!" } -> ["a!", "b!", "c!"]
a -> ["a!", "b!", "c!"]

The following is what I want to have as an additional, new method
for Arrays (for ease of use, convenience, and nicer code):

set_at

a = ["a", "b", "c"]
a.set_at 0,"x" -> ["x", "b", "c"]
a -> ["a", "b", "c"]

set_at!

a = ["a", "b", "c"]
a.set_at! 0,"x" -> ["x", "b", "c"]
a -> ["x", "b", "c"]

Tobi

'set' doesn't sound right (dblack, 2001-09-01 21:19:38)

Hi --

As per my ruby-talk postings, I'm not sold on this. But in any case, I'd think a different name would be better. 'set', in any form, suggests that the receiver is going to set something.

Maybe: a = [1,2,3] a.with_at(0,100) => [100,2,3]

Re: 'set' doesn't sound right (dblack, 2001-09-01 21:21:36)

<sigh> Formatting error.

The code should not be all run together:

a = [1,2,3]
a.with_at(0,100)  # => [100,2,3]

David

Re: 'set' doesn't sound right (tobi, 2001-09-02 03:04:06)

Great! Thanks for your suggestion. My whish is to have that functionality for arrays;
the name of the method is open for discussion.
with_at position,new_value
with_at! position, new_value
or: (all with bang and non-bang versions):
fill_at, place_at, value_at, at()=, at()!=, replace, set_at, ...

Re: 'set' doesn't sound right (tobi, 2001-09-02 04:32:28)

sorry, one in the listed doesn't make sense:
at()!=
maybe:
at()=
and
at()=!

Re: non-bang []=, maybe a.set_at(2,new_value) (michael, 2001-09-02 05:57:23)

> an_array.each_index do |x|
>   new_val = an_array.at(x) + "n"
>   if (x == 2) or (x == (an_array.size - 1))
>     an_array.set_at(x,new_val)
>   end
> end

What's wrong with the following?

an_array.dup.each_index do |x|
  an_array[x] += "n" if (x == 2) or (x == an_array.size - 1)
end

Sorry, but I think that set_at or with_at would not be used that often to add it to the libs.

  a.with_at(pos, val)

would equal to

  
  (b = a.dup)[pos] = val; b

correct?

What I'd like to have is a collect_index or something alike. For me this would be much more useful, because it would let me ommit a counter variable.

Regards, Michael

Re: non-bang []=, maybe a.set_at(2,new_value) (dblack, 2001-09-02 07:01:02)

> What's wrong with the following?
 
> an_array.dup.each_index do |x|
>  an_array[x] += "n" if (x == 2) or (x == an_array.size - 1)
> end

Only that the dup doesn't do anything :-) But Tobi's example seems to throw away the results of set_at, so it ends up as a no-op. Which doesn't in itself mean too much, but I still think that this isn't needed in the core language.

> a.with_at(pos, val)
>
> would equal to
>
>
> (b = a.dup)[pos] = val; b
>
> correct?

Yes.

> What I'd like to have is a collect_index or something
> alike. For me this would be much
> more useful, because it would let me ommit a counter
> variable.

I'd like that too -- so much so that I wrote it a few months ago :-)

  module Enumerable
    def map_with_indices
      res = []
      each_with_index do |e,i|
        res.push yield e,i
      end
      res
    end
  end

Re: non-bang []=, maybe a.set_at(2,new_value) (tobi, 2001-09-02 08:33:34)

map_with_indices
would be a great addition to Ruby.
(just hours ago I thought that collect_with_index would be cool;
but I like your choice for the name much more)
It should be available for arrays etc. and perhaps both bang and non-bang versions would make sense.
Maybe you want to consider posting a new RCR for that?

Re: non-bang []=, maybe a.set_at(2,new_value) (tobi, 2001-09-02 08:51:13)

... I still want a non-bang version of []=,
which could look something like
set_at(new_value, position)
or
with_at(new_value, position)
(= this RCR is not obsolete;
map_with_indices
is a seperate request)

What about Ruby Behaviors? (MrCode, 2001-09-02 12:38:27)

David, I'm surprised you didn't bring this up, but couldn't this "set_at" and "map_with_indices" functionality be coded as Ruby Behaviors and just included when needed?

My opinion is that core API changes should be experimented with as Ruby Behaviors and once we have some way to quantify how much certain behaviors were used, then we could consider adding them to core. Once the "Behaviors Distribution System" is in place (which might just be based on my "RubyGems Distribution System"), it will be easy to see how much certain Behaviors have been shared and maybe even how much they are used.

In fact, if Ruby Behaviors were packaged as RubyGems, I could add some functionality that keeps track of how much each Behavior Gem is used on a machine, then upload that to some running count of all the uses in all installations, and then we would have an automatic "RCR Voting System." The more a Behavior is used "in the real world" the better chance it will be added to the core API. In fact, this same logic could be applied to libraries in determining if they should be added to the standard library and distributed with Ruby.

What do you guys think?

--Ryan Leavengood

Re: What about Ruby Behaviors? (dblack, 2001-09-02 15:39:44)

Hi Ryan --

> What do you guys think?

I don't think Ruby core changes should be vote-based -- nor do I think they are likely to be :-) I tend to see it more as: people might use and exchange ideas, and based on their experience (qualitative and quantitative) decide whether an RCR is a good idea. Behaviors definitely have the potential to play a role in this, and to divert some of the RCR traffic.

I happen to think that map_with_indices actually should be a method of Enumerable, based on my own wanting to have it and talking with other people about it. Meanwhile I've got my own Behavior-friendly version :-)

Reason for rejection

The idea itself is interesting, but we need better appearance.

RCR 33: New method map_with_indices (tobi, 2001-09-03 03:53:16)

Status: Rejected

Several have expressed their desire for a new method with versions

map_with_indices
map_with_indices!

This could be added to Enumerable, and made available for arrays (etc.) I believe it would be a very handy method, that adds to Ruby's readability, conciseness and elegance. For example, it would obsolete the need for counters in certain operations.

Comments

Re: New method map_with_indices (dblack, 2001-09-03 08:55:43)

map_with_indices would mean that instead of this:

  res = []
  ary.each_with_index {|e,i| res.push "#{i}: #{e}"}

you could do:

  res = ary.map_with_index{...}

Dinky example, but I think it's a very useful method.

Also it would add symmetry to Enumerable. Though actually to be fully symmetrical (or whatever), we'd also need map_indices, which would be the map equivalent of each_index. (instead of (0...ary.size).map {|n|...})

Example (tobi, 2001-09-03 15:53:27)

Here's a running example, using an implementation of map_with_indices by David Black:

####################
# by David Black:
module Enumerable
  def map_with_indices!
    res = []
    each_with_index do |e,i|
      res.push yield e,i
    end
    replace res
  end
  def map_with_indices
    dup.map_with_indices!
  end
end
####################

a1 = ['zero','one','two','three']
a2 = ['zero','one','two','three','four','five']

# add a newline to the second and last element of any array

def my_format a
  new_a = a.map_with_indices do |element,index|
    if (index == 1) or (index == (a.size - 1))
      element + "
"
    else
      element
    end
  end
end

a1_formatted = my_format a1
a2_formatted = my_format a2

p a1_formatted # -> ["zero", "one
", "two", "three
"]
p a2_formatted # -> ["zero", "one
", "two", "three", "four", "five
"] 

Tobi

Re: Example (tobi, 2001-09-03 15:59:54)

The newlines in the example don't show as backslash-n, but as newlines. I double-escaped them, which worked for previewing, but not for posting. So here it's in txt:

http://www.pinkjuice.com/txt/map_with_indices.rb

applicability to hashes? (anonymous, 2001-09-05 06:13:00)

hash.map_with_indices { |val, key|
# do your stuff
}

would be a swap of parameters w.r.t. the existing each_pair. Confusing.

Reason for rejection

reject this time. I think this should be solved by somthing like Enumerator class.

RCR 34: Selective Export (jvoegele, 2001-09-10 16:10:04)

Status: Rejected

Eiffel provides the ability to export a feature to a specific class or classes. This means that the feature is visible only to the specifically named classes and their subclasses. I think this would be a useful feature for Ruby. Consider the case when an attribute needs to be set by some external component, but in general is read-only. C++ and Java provide similar mechanisms with friends and package access respectively. It could perhaps work something like this:

class Foo
    def bar
        puts "bar"
    end

    # Make bar visible only to MyClass
    export :bar, MyClass
end

Comments

Re: Selective Export (cout, 2001-09-11 07:53:38)

Sounds like an interesting idea.

1) Could you come up with some examples where this would be useful, but the problem could not be solved cleanly any other way?

2) Would it be possible to not export a feature to ANY other classes?

3) In the example you have here, "bar" is a public function, and is already visible to other classes. Would the export keyword make it invisible or change its access level? If this is the case, it sounds like "export" is doing a lot more than exporting a method.

4) Would a selectively exported method be accessible to derived classes of the current class?

Refinement (jvoegele, 2001-09-11 08:41:07)

To address your questions:

1) Consider an Object Oriented database. All modifications to database objects must occur within the context of a transaction. If an object is modified within a transaction, it must be marked as dirty so that the database can save it after the transaction is commited. Persistent objects could provide a mark_modified method exported to only the Transaction class, so the transaction could mark the object for storage but no other part of the system should do so. Or perhaps you are writing a more basic persistence system, and only the persistence system should be able to invoke save and load on objects.

More generally, any tightly coupled abstractions will need access to each other's methods that shouldn't really be exposed to users of them. Think "List" and "Cursor", for example.

2) Eiffel handles this by exporting to the class NONE. I think Ruby could handle it by either using the already existing "private" semantics, or perhaps exporting only to NilClass.

3) I made the presumption that export would work in a similar fashion to public, protected, and private. This means that the export would override the previously declared access, just as private would override the default public access. In fact, we could go further by giving export the exact behavior of private and protected: if you use export(ClassName) without a method as an argument, it applies to the rest of the methods up until the next public, protected, private, or export statement.

4) That's something that needs to be figured out. Eiffel allows it, but that's because Eiffel doesn't have true "private" access; everything is always available to subclasses. Obviously, this policy is the right one for Ruby. My initial thought, however, is that, yes, subclasses should have accesses to selectively exported features.

errata (jvoegele, 2001-09-12 09:47:16)

The third sentence should have read "Obviously, this policy ISN'T the right one for Ruby."

Re: Refinement (cout, 2001-09-13 10:52:55)

Could your database example be solved with the following code:

private
  def foo
    puts "foo!"
  end

public
  def bar
    return self.method(:foo)
  end
end

f = Foo.new
b = f.bar
b.call

Re: Refinement (anonymous, 2001-09-13 11:46:49)

Perhaps a simpler example is in order. This is entirely Eiffel, as my Ruby is pathetic at the moment:

class A
feature { ANY } -- Visible to all

   value_of(other: A): INTEGER is
      -- What is other's value?
      do
         Result := other.value
      end -- value_of

feature { A } -- Visible only to A

   value: INTEGER

end -- class A

This is highly trivial, but it should give a fair idea of what can be achieved with such a mechanism. Here we basically give access to 'value' only to other objects of type 'A'. This cannot be accomplished using the protected/private/public approach.

Re: Refinement (jvoegele, 2001-09-13 12:31:41)

Unless I misunderstand, I don't think this resolves the issue. The bar method effectively makes the foo method public again. Anyone can invoke bar, correct?

Another example from the OO Database perspective: every object has an Object Identifier (OID) that is generated by the database. However, not all objects are created by the database. They can be created using a standard "new", and later made persistent. At that point in time, the database could call an "oid=" method to set the OID. No other part of the system, however, should be able to set the OID of an object.

Also, many of the GoF patterns could be improved by using selective export. The "visit" method should be available only to Visitor and subclasses of Visitor, for example.

Consider also Java's nested classes. Nested classes have access to each other's private parts. The same scheme could be accomplished by exporting necessary features to what would otherwise need to be a nested class.

Just a few more ideas about where such a scheme would be useful.

Perhaps I was not clear... (cout, 2001-09-13 13:25:30)

I did not mean to imply that any object should be allowed to invoke bar() and get a reference to the private method foo. In a more realistic situation, bar() could check who the caller is (perhaps by requiring the caller to pass its binding as a parameter, then evaluating "self.type", or perhaps by passing in a key that only the calling class could possibly know). The reason for doing this in bar() instead of in foo() is because it is probably not a cheap operation, and so doing it only once is desirable.

The obvious disadvantage with this method is that it is not as clean as a real "export" feature.

Also, how do you propose to do the type checking? Since classes in Ruby are very dynamic, what is to stop someone from doing this:

class Database # repoen the Database class
  def foo(databse_object)
    database_object.exported_method()
  end
end

There are a lot of places in Ruby where access control could certainly be done better, and other places where it is just plain broken. (I think this is the former case). So here's a question to whomever is reading: what can/should be done about the more general problem of fixing access control in Ruby?

Re: Perhaps I was not clear... (jvoegele, 2001-09-13 14:43:02)

OK, this makes more sense to me now. In other words, your example wasn't implying blindly passing a reference to a method to any caller, but instead first checking to see who that caller is. I can see that that's a reasonable approach. Perhaps export could even be implemented to do just that?

As for reopening classes to gain access to restricted methods, well I think the point is to try to prevent accidental misuse, rather than intentional abuse. I've used similar mechanisms to gain access to the private instance variables of other objects by reopening class Object and writing a method that does exactly that. I had my reasons and was aware that this was not how the other classes were intended to be used, but I really did appreciate the flexibility of Ruby to allow me to do something like that. So I'm not too concerned about the fact that there are ways around access control, only that it allows us to reflect our intentions and prevent accidental misuse.

Possible with RCR #15? (rtarpine, 2001-09-15 08:21:10)

If RCR #15 is accepted, couldn't one check call_stack to get the caller object, check its class, and see if it's one of the "friend" classes?

'export :bar, MyClass' could alias bar and define a new bar method that would check the caller class and raise an exception if it is not permitted or call the aliased method if it is. This could be used to allow only the specified class (callerobj.class == MyClass), or all subclasses (callerobj.kind_of? MyClass).

Multiple classes could be stored in an array (raise "Not a friend class" unless [MyClass,MyClass2].find { |c| callerobj.class == c }).

Alternatives? (jvoegele, 2001-10-10 09:03:49)

OK, so it doesn't appear that there's a lot of support for this idea. So let me ask you, what would be your solution to the situation where a subset of methods in a class/object are used only by some subset of the system at large?

Say a factory class needs access to "setter" methods, but in general the objects created by the factory should be seen as immutable. Do we just use a comment that says "DO NOT USE!", or can you think of some other mechanism for reducing the clutter of the public interface?

I don't like it (matta, 2002-01-16 23:41:53)

I don't like this for many reasons.

Ruby is at its heart a simple and clean scripting language with simple rules for public, protected and private. This adds more complexity. Ruby does not need every feature of other OO languages.
This ties the class closely with another class by name, reducing its reusability. In C++, I've used friend classes in this way only to regreat it later when the system changed and I needed access to those "private friend" methods elsewhere (i.e. for test code, logging code, new features).
If "Foo" and "MyClass" (from your example) are so intimately related, other mechanisms such as instance_eval or send can be used to get at private things in Foo.

Reason for rejection

It doesn't fit well with Ruby's simplest scoping. Intersting idea though.

RCR 35: String/Array equivalence (larsch, 2001-09-16 07:21:11)

Status: Rejected

In some languages, strings are simply arrays of characters. I don't think this would be appropriate for Ruby, but it would be nice if they were somewhat equivalent. This is how constructor equvialence would look like: Array.new() #=> [] String.new() #=> "" (*) Array.new(4, 'A') #=> [ "A", "A", "A", "A" ] String.new(4, ?A) #=> "AAAA" (*) String.new("foo") #=> "foo" Array.new([1,2,3]) #=> [1,2,3] (*) Lines marked by (*) are proposed changes to Ruby.

Comments

Re: String/Array equivalence (larsch, 2001-09-17 13:52:49)

Perhaps I should clarify the poor formatting:

Array.new() #=> [] String.new() #=> "" (*) Array.new(4, 'A') #=> [ "A", "A", "A", "A" ] String.new(4, ?A) #=> "AAAA" (*) String.new("foo") #=> "foo" Array.new([1,2,3]) #=> [1,2,3] (*)
(*) are suggested changes to Ruby.

Re: String/Array equivalence (matz, 2001-09-17 21:39:37)

The latter 2 in above 3 patterns are contradicting. And I think this difference caused because nature of Arrays and Strings differ.

The first one (String.new => "") should be added.

Re: String/Array equivalence (larsch, 2001-09-21 09:15:48)

How are they contradicting? The "overloading" should be possible because the number of arguments differ. Also, if you see a String as equivalent to an Array restricted to Fixnum elements in the range (0..255), String.new(4, ?A) should be equivalent to Array.new(4, "A").

Re: String/Array equivalence (matz, 2001-09-25 02:05:50)

>Also if you see a String as equivalent
>to an Array restricted to Fixnum...
But they are not. Strings and Arrays are both sequence that share common beheviors, but Strings are not Arrays, and vise versa. Despites the fact that I found

 Array.new([1,2]) => [1,2]

useful.

Reason for rejection

Strings and arrays are similar but different objects, as I believe.

RCR 36: basic constructors (matju, 2001-09-17 10:49:55)

Status: Rejected

Symbol.new should replace String#intern; the latter would only be kept for backwards compatibility.

Class.new and Module.new, when called with a block, should also call #module_eval with that block.

change Regexp#to_s so that it returns a string that gives back the first argument to Regexp.new (or equivalent).

add Regexp#options and Regexp#lang, which gives back Regexp.new's 2nd and 3rd arguments.

Comments

Could you re-subscribe this RCR? (matz, 2001-09-18 23:50:51)

As four separate RCRs, please.

Reason for rejection

This should be four separate RCRs.

RCR 39: Symbol.new (matju, 2001-10-08 01:48:59)

Status: Rejected

Symbol.new should replace String#intern; the latter would only be kept for backwards compatibility.

Comments

Re: Symbol.new (matz, 2001-10-09 01:45:45)

But it doesn't allocate new symbol. We don't have Fixnum.new etc. How should we have Symbol.new?

How about Symbol.intern(str) instead?

Re: Symbol.new (matju, 2001-10-10 14:30:15)

a class that keeps track of all its instances may return an existing instance if the design is such that a completely equivalent substitute for the requested construction can be found in the set of existing instances.

In that light, Symbol.new makes sense.

off-topic:

Fixnum.new does not exist because there is nothing anyone has found to construct from. For Bignum it could make sense if it'd take a string (array of bytes) in base 256 (conversion from base 2**32 to 10 and back is slow); but anyway, Fixnum and Bignum are implementation details so it should be Integer.new instead (which would return either a Fixnum or a Bignum). As an alternate idea, Integer.new could be like Integer() or String#to_i, but it wouldn't be very useful.

Reason for rejection

I still feel "new" implies allocation. Maybe Symbol.intern() might be good choice of the name.

RCR 44: Chaining relational operators. (HughSasse, 2001-10-08 12:15:07)

Status: Rejected

Some time ago I suggested that relational operators might return their right hand sides as in the Icon programming language, so one could write if 3 http://www.eng.cse.dmu.ac.uk/~hgs/ruby/comparisons.html> for what I wrote at the time. I have just updated it because of Perl6's includes acceptance of . I think it may therefore ba appropriate to consider this again for Ruby, given the support support for Perlists in the language philosophy. The Perl RFC uses a different suggestion from mine to get the same effect. I don't mind which is adopted. I did not see this on the list of RCRs despite having raised it on the list in the past, so I hope it's OK to add this here.

Comments

oops! (HughSasse, 2001-10-08 12:18:19)

That if statement got botched. I only checked it twice! Anyway, you can see from the URLs how it should have looked.
Sorry!

Hmm. (cout, 2001-10-08 16:10:29)

Given:

  1 &lt; x &lt; 5

If 1<x is false, then what does it mean to have:

  false &lt; 5

If this is guarantee to be false, then would:

  false&gt;= 5

always be true? In that case, what happens with:

  1&gt;= x&gt;= 5

if 1>=x is false?

I think making this change introduces a lot of problems, and the "C way" of writing such expressions is not too terribly difficult to read.

BTW, Rubygarden's translation of &lt; and &gt; into < and > makes it difficult to type messages like this.

Re: Chaining relational operators. (matz, 2001-10-09 01:31:01)

Long time ago, Ruby used to allow comparison operator chaining, just like you proposed. But what really needed is shortcut evaluation, e.g.

1 1 <x and x <5

so if 1 <x is true, x <5 should never be evaluated. This must be implemented by modifying syntax. And I'm not sure (and little bit against) for complicating Ruby's syntax more.

Re: Hmm. (HughSasse, 2001-10-09 03:49:02)

It just depends how you define
boolean relop numeric
or
numeric relop boolean
It seems to me that they could both
give false.

Re: Hmm (rjp, 2001-10-09 05:22:32)

You wouldn't get

  false 
because it would short-circuit at the first failure, much like

  (1  already does.

Re: Hmm (rjp, 2001-10-09 05:24:29)

That should be

false <5

and

(1 <x) and (x <5)

(I blame Mondays)

Re: Chaining relational operators. (cout, 2001-10-09 09:02:54)

This already works (except that I think you meant that if 1 false then x irb(main):002:0> def foo; puts "foo!"; return 5; end nil irb(main):003:0> 1 def foo; puts "foo!"; return 0; end nil irb(main):006:0> 1

Re: Chaining Relational Operators (bobalex, 2002-01-22 09:12:13)

Returning the right-hand value on success is a wonderful feature of Icon, but it works only because Icon conditionals "fail" rather than returning a false value. In Icon, that eliminates the ambiguity between the conditional reporting failure and a right-hand term of false. You also get the side benefit of it short-circuiting the operation as soon as failure is eminent.

Unfortunately, that concept really doesn't work well in Ruby, mostly because of the inevitable ambiguity.

Reason for rejection

It is rejected because it is too hard to define object-oriented semantics of chained relational operators.

RCR 45: Hash#keys should be a base method for other methods (rjp, 2001-10-10 06:25:01)

Status: Rejected

At the moment, if you subclass Hash and reimplement Hash#keys, this doesn't affect any of the other methods like Hash#each_key, Hash#each_value, Hash#each_pair, Hash#values, etc. It would useful if they built on Hash#keys when subclassing the Hash class since that would mean you only needed to supply an implementation of Hash#keys, not all of them. This would also be closer to Perl's tie since that only requires FIRSTKEY and NEXTKEY to provide all the functionality.

Comments

And furthermore... (anonymous, 2001-10-10 11:39:19)

I'd like to see this approach taken generally for all methods of Hash and Array. It would be nice if both Hash and Array's methods were implemented in terms of a small, and well documented subset of necessary methods. I'd like to be able to create my own Array-like and Hash-like classes without reinventing the wheel. As it is though, I have no idea what methods need to be reimplemented, and which can safely be trusted to "do the right thing".

I asked about this on the mailing list once, and was referred to matju's Hollow* classes (in the MetaRuby package). Unfortunately, they enforce some constraints on their implementation that are inconsistant with the built-in Array and Hash classes.

Re: And furthermore... (HughSasse, 2001-10-10 12:05:09)

If Hash used a module to do this, then other things could as well: having keys() defined would give you similar leverage to have each() defined. Modules Enumerable and Keyable, perhaps?.

use MetaRuby (matju, 2001-10-10 14:44:24)

i asked for this a year ago. Now there's a library called MetaRuby (current version 0.7) which allows you to do that.

require "Hollow/Hash"

from there, you create a class that implements #length, #has_key?, #get, #put, #remove, and #each_key, and then you include HollowHash. This is equivalent to Perl's tie(). (note that #keys is implement with #each_key)

If you want to start with a complete implementation and override things, you use the AutoHash class. This one is used to implement undo queues on Hashes. (see the provided samples)

The same things also exist for Array and String. With some help I'd do also IO/File.

Re: MetaRuby (rjp, 2001-10-10 19:15:32)

Sounds good, but it's too fiddly. All I wanted to do was override the #keys method to return keys in a particular order, not to reimplement all the other methods as well. The AutoHash class sounds useful, but it's not built-in and it's a kludge around the problem that Hash has too many methods that aren't based on #keys (or #each_key which makes more sense now I think about it) and should be.

Something like this should be a fundamental in a language as object-oriented as Ruby.

Re: Hash#keys should be a base method for other methods (cout, 2001-10-11 09:03:02)

I think it makes more sense to reimplement Hash#each_key and have Hash#keys call Hash#each_key. The reason for this is that Hash#keys returns an array with ALL the keys; for a large Hash, this could require a lot of memory.

An even better alternative is to have Hash#each_key implemented in terms of Hash#each_pair. This would allow one to reimplement just one method to get the effect that you want.

Re: #each_key instead of #keys (rjp, 2001-10-11 10:02:18)

Definitely makes more sense this way. I'm not sure about using #each_pair though -- what if the value is computationally intensive?

Re: #each_key instead of #keys (cout, 2001-10-11 10:19:00)

In that case, you could override each_pair, each_key, and each value. Everything would work fine overriding just each_pair, but you could optimize by overriding methods that depend on each_pair.

Reason for rejection

Not in the current implementation. Method invocation is too slow. But in the future, it's good to see this happens.

RCR 46: Socket#gethostbyname should use gethostbyname() (rjp, 2001-10-10 06:37:37)

Status: Rejected

As it stands now, Socket#gethostbyname calls the system getaddrinfo() call instead of the gethostbyname() call. The underlying code should call gethostbyname() to be consistent since getaddinfo() returns the information in a different format which needs converting (via gethostbyaddr() which breaks some lookups unnecessarily) and may not return the "right" information.

Comments

Re: Socket#gethostbyname should use gethostbyname() (anonymous, 2001-10-10 07:42:48)

One problem with using gethostbyname() and gethostbyaddr() is that they will block. Since Ruby does not use real threads, this means that the entire Ruby application is hanging while a lookup is being made. Someone pointed out on #ruby-lang that his timeouts weren't working when a lookup failed; I think this is why. The name resolution in Ruby really needs to be replaced with some specialized functions that know about rb_thread_select(), imo.

Reason for rejection

gethostbyname(3) does not work right with IPv6. I refactored Socket#gethostbyname to reduce unnecessary lookup on 1.7.

RCR 50: Simplified use of map by adding optional parameter (feldt, 2001-10-26 06:12:27)

Status: Rejected

Change map to take an optional parameter which is a Symbol which is sent to each element (as a message). Example:

  [1,2].map(:succ)  #=> [2,3]

I know that you can do it with

  [1,2].map {|e| e.succ}

but its more than a "shortcut". IMHO its more in line with how map is used in functional programming languages. I we want to be really fancy we could also allow methods as parameters. But maybe that's taking it too far. I'm not sure what should happen if one both gives a parameter and a block. Either skip the parameter or apply it first? Probably the latter since you would otherwise not specify the parameter. So here's the new semantics I'm asking for (I'm not sure about the extra args but added them anyway):

class Array
  def rcr50_map(symbol = nil, *args, &block)
    if symbol.kind_of?(Symbol)
      a = self.map {|e| e.send(symbol, *args)}
    else
      a = self
    end
    a.map(&block)
  end
end

Oh, it should really be Enumerable#map but this is just an example!?

Comments

This is a little too non-orthogonal for me (dblack, 2001-10-26 07:35:05)

Robert, I'm not sure why the point that your suggestion is more in line with
functional languages is really relevant. Down that road lies Ruby becoming a real pastiche of
different things. Well, maybe it is a little already :-) But to me, this suggestion
really doesn't add anything and, to the extent that it's syntactic sugar, it actually has a
rather awkward look.

what should happen if... (anonymous, 2001-10-26 07:52:59)

...I pass both a block and a symbol?

Re: Simplified use of map by adding optional parameter (michael, 2001-10-26 08:57:20)

Just a suggestion for an alternative method name:

[1,2].map_with(:succ) #=> [2,3]

How about a default receiver (buter, 2001-10-26 09:15:10)

Would a default receiver be
something? A thing analogous
to '$_' which would receive
the symbol and the args?

Renald

Ok, but then don't call it map! (feldt, 2001-10-26 15:22:54)

Fair enough, but map is VERY common in functional languages and carry a special baggage there so for me (and at least some other now-Rubyists) its a little disturbing not begin able to do this.

For me its not PoLS without this new feature. Thats why I had to propose this. Not as syntactic sugar but as "PoLSification".

Look at the code again (feldt, 2001-10-26 15:24:32)

First the param and then the block since its not sensible to give the param if you have a block.

I think you misunderstand (anonymous, 2001-10-26 16:30:45)

My question is not "what does your code do?"; my question is "is what your code does correct?".

re: calling it map (avi, 2001-10-26 22:13:28)

That's why I always use the collect/select/detect versions of that protocol - the ruby behavior matches the smalltalk behavior exactly, whereas it doesn't match my (functional) notion of what map/filter/etc should be...

I don't like this (anonymous, 2001-10-27 06:02:34)

The thing I don't like about this is that every time you write a method that takes a block you would have to add support for this behavior (for orthogonality). It seems like a lot of repetitive work for a small piece of syntactic sugar.

Niklas

Ok, sorry. (feldt, 2001-10-29 04:13:53)

So I did.

Good point (feldt, 2001-10-29 04:14:39)

Thats a good point.

Syntax change proposal (pit, 2001-10-31 14:08:35)

I like Roberts idea, but would propose a slightly different syntax: instead of

  [1,2].map(:succ)  #=> [2,3]

I'd write

  [1,2].map(&:succ)  #=> [2,3]

Of course this would require a parser change and I doubt it will be done, but it would have two advantages:

You can't use both the & parameter and a block, so no ambiguity here.
The method writer doesn't have to be aware of the use of this syntax: the method is called with an ordinary block.

Passing more than one parameter could be done with

  [1,2].map(&:+, 4)  #=> [5,6]

No strange syntax changes please (anonymous, 2001-11-30 12:14:35)

Come on, you can't make strange
changes like that.
The original idea works, I would say.

Re: Ok, but then don't call it map! (anonymous, 2002-01-28 19:17:31)

What about apply?

Reason for rejection

It's too functional so that I'm afraid it would not work well with OO nature of Ruby. Maybe it's matter of notation.

RCR 51: Add upcased?, capitalized? and downcased? to String (phil_tomson, 2001-10-26 21:25:36)

Status: Rejected

I needed to figure out if a string was all uppercase or not, so I wrote a trivial method to do it called upcased? and put it in the String class. Since we have upcase, capitalize, and downcase methods for String, it might be nice to also have upcased? capitalized? and downcased? in the standard library as well.

Comments

the code (phil_tomson, 2001-10-27 01:18:40)

Like I said it's trivial, but here it is for reference:

class String
   def upcased?
      if self.upcase == self
         return true
      else
         return false
      end
   end
   
   def downcased?
      if self.downcase == self
         return true
      else
         return false
      end
   end

   def capitalized?
      if self.capitalize == self
         return true
      else
         return false
      end
   end
end

shortened code (anonymous, 2001-10-27 07:35:13)

Rewrite this to:



          class String

          def upcased?

          self.upcase == self

          end

          

          def downcased?

          self.downcase == self

          end

          

          def capitalized?

          self.capitalize == self

          end

          end

I don't think this is enough code to be worth making a library function, when writing it out directly is understandable enough anyway.

thanks (anonymous, 2001-10-27 12:26:51)

Yes, you shortened it considerably!

Even though it's a trivial amount of code, it would be nice to have it in the standard library because it makes things more orthogonal: New Ruby programmers will see the 'capitalize' function and wonder if there is a 'capitalized?' function as well (I know I did).

Yes, but we must consider feature bloat (feldt, 2001-12-06 03:44:32)

IMHO, you're basically right but we must always consider the additional cost of adding to the builtin Ruby classes: more methods to learn, larger code base and larger libs etc.

I think it would be good to have "standard" additions to the base classes where the code for this and similar proposals are gathered. So that everyone could do

require 'additional/string'

or something like that and then get a "full" String class.

Reason for rejection

This should be done in additional library.

RCR 52: attr_initializer shortcut for def initialize(...) (gan, 2001-10-29 17:34:11)

Status: Rejected

Instead of the suggestion in RCR #3, how about the following syntax as a shortcut for the same "problem":


  attr_initializer :foo, :bar

This can be done by adding it as a method to Module, just like attr_accessor and similar methods.


class X
  attr_initializer :foo, :doo, :bar
  ...
end

basically expands to:


class X
  def initialize(foo, doo, bar)
     @foo, @doo, @bar = foo, doo, bar
  end
  ...
end

( More info in [] )

Comments

how about this? (jtra, 2001-10-29 18:07:49)

this is nicer:

class X
  def initialize(@foo, @doo, bla, @bar)
    @bla=bla.to_f
  end
end

This would be same as:

class X
  def initialize(foo, doo, bla, bar)
    @foo=foo; @doo=doo; @bar=bar
    @bla=bla.to_f
  end
end

and this has benefit of having standard initialize too.

Matz doesn't like this idea... (maki, 2001-10-29 21:32:53)

see [].

wrong link? (anonymous, 2001-10-29 22:04:42)

I don't see Matz's words there.

this doesn't buy me much (anonymous, 2001-10-29 22:11:26)

Most of my intialize methods are far more than just setting the values of instance variables. *If* a solution is proposed that allows shortcuts for instance variable initialization, then the solution needs to allow me to add my own code.

One way to do this with the proposed method is to have attr_initializer take a block, but this is ugly, IMO.

Re: wrong link? (maki, 2001-10-29 22:28:37)

Oops, sorry. Valid URL is ] . And you can see old discussion on ] -.

It's about the common case, not full flexibility (gan, 2001-10-30 05:35:12)

Thanks for your comments, both of you. About this:

class X
  def initialize(@foo, @doo, bla, @bar)
    @bla=bla.to_f
  end
end

I just don't think that looks good or is easily understandable. It uses two different styles/mechanisms to do the initialization.

However, some argue that you should be able to do (for any method, not only initialize): def anymethod(@foo) and then have the instance_variable @foo be assigned directly, simply because it is consistent with blocks as in { |@foo| ... } I can sympathize with that but I think it's a different issue. I also don't find the initializer written as above particularly clear and wouldn't like to write them that way. In that case I prefer writing out an intitialize like it is today. It just doesn't resonate with me, but of course people have different tastes.

To the second poster, I don't agree that a shortcut must be fully flexible. attr_initializer should surely not take a block, just like attr_accessor does not! If you need that, instead use the standard method.

Consider that also attr_accessor, attr_reader, attr_writer are shortcuts for a common case and therefore they are useful, but there are still times when you need to define your own accessor methods too, like if you want to put constraints on the values, do bounds checking, or to let the getter return a different type than you use internally. In all these cases you fall back to the standard method and the case is the same here. It is clear attr_initializer does not, for example, allow you to add default values. It's not meant to solve every case.

Let's make one thing clear, a shortcut like this, also attr_accessor, is not meant to replace but to extend. It allows you to define the function manually if you want or when you need to.

Consider most other RCRs that suggest to add a fairly specialized method to some class just for completeness, and people often seem to think that's a good idea. But additions should be made only for commonly used things. For me, the above initialize is very common for simpler container-type classes. For others I need to use a full def initialize(). I think RCRs should be rejected if they are not common enough, to avoid growing the language too much, and you are free to do so if you wish.

But you find attr_accessor useful, don't you? Then attr_initializer could be useful too. But not for everyone perhaps. It's just a suggestion and you need to decide if it is worthwhile for you or not. Whether to allow def meth(@var) or not is another question. In fact, I would still find attr_initializer useful regardless. To all readers - if you are reading RCRs, please vote, even if you specify that you don't really care.

attr_initialize (hutchike, 2001-10-31 08:12:30)

I agree - it's a "noisy" feature, and most of my initialize methods do more than copy args into members.

Writable? Readable? (anonymous, 2001-11-14 11:03:14)

My biggest problem with attr_initializer: should those instance variables be read-only, read-write, or private? The "initializer" attribute of an instance variable is orthoginal from the "readable/writeable" attribute.

Is not about that (anonymous, 2001-11-30 12:08:47)

Yes. It probably wasn't supposed to replace attr_accessor. You still need to write attr_reader/writer/accessor to get accessors.

Reason for rejection

It doesn't appeal me much. You can define it by yourself if you really want.

RCR 53: second argument to Dir.glob, specifying file test (dblack, 2001-11-25 19:37:30)

Status: Rejected

This came up on #ruby-lang recently and I thought it might be RCR-worthy.

The idea is: allow a second argument to Dir.glob (aka Dir.[]), that argument being used as a filter, passing everything returned by the glob through Kernel#test.

Example: Dir.glob("*.rb", "f") (returns all *.rb that match test("f", name)) Implementation:

class &lt;&lt; Dir
  alias :oldglob :glob
  def glob(g,t=nil)
    d = oldglob(g)
    if t
      d.find_all {|e| test(t,e)}
    else
      d
    end
  end
end

(P.S. There seem to be extra blank lines being put in here but I'm not using Opera this time :-)

Comments

Sv: second argument to Dir.glob, specifying file test (anonymous, 2001-11-30 12:05:33)

Sorry but I feel it's
unnecessary.

You can't support every possible
combination of functions so RCRs that
don't simplify much should be rejected. This situation isn't common
enough to warrant yet another variant.
It will be more difficult for someone reading the code to know about every possible extra feature of functions.
Simple orthogonal functions are better.

What's wrong with chaining
Dir.glob("*.rb").filter {|f| isOK(f)}?
or whatever the syntax is... ?

You're probably right (dblack, 2001-12-01 16:11:04)

Actually I tend to agree with this -- that we don't want language level support for everything. The extra arg to Dir.glob seemed sort of innocuous, and sort of logical, and whatever. But I completely understand your point.

Reason for rejection

This task should be done by combination with glob and Enumerable#select.

RCR 54: Allow String indexing using Regexps to use groups (josb, 2001-12-04 19:04:33)

Status: Rejected

Does the following extension make sense? If yes, how hard would it be to implement?

s = "a foo and a bar"
re = /a (f.*?o) and a (b.*?r) to/
foo, bar = s[re]
puts foo,bar # -> foo bar

Comments

Ruby can do this... (anonymous, 2001-12-04 20:55:09)

... but using Regexp#match or String#scan instead of String#[].

As a side note, I think your regex is a bit messed up (since f.*?o matches "fo", then none of the rest of the regex matches; I'm not sure where the "to" part is supposed to fit in).

Re: Allow String indexing using Regexps to use groups (jfh, 2001-12-04 23:07:58)

This seems similar to the String#match I proposed on ruby talk a couple of weeks ago:

a, b = foo.match(/(bar).*(baz)/)

No one showed much interest. I like either of s[re] or s.match(re) better than:

dummy, a, b = /(bar).*(baz)/.match(foo).to_a

or

a, b = [ $1, $2 ] if foo =~ /(bar).*(baz)/

Actually, I think I may like yours better, fewer characters :->

----------------------------------------------------------------------
| Jim Hranicky, Senior SysAdmin                   UF/CISE Department |
| E314D CSE Building                            Phone (352) 392-1499 |
| jfh@cise.ufl.edu                      http://www.cise.ufl.edu/~jfh |
----------------------------------------------------------------------

Re: Ruby can do this... (josb, 2001-12-04 23:49:25)

Yeah, the regex is incorrect, sorry (maybe foo+ is better), but I hope you get the general idea: to show the utility of being able to anchor the pattern used to extract or replace (when used as an lvalue) part(s) of the String object.

Re: Allow String indexing using Regexps to use groups (josb, 2001-12-04 23:55:48)

I also like a, b = foo.match(/(bar).*(baz)/)better than the alternatives (foo.scan(/(bar).*(baz)/).flatten[0] is another one but also less concise). The proposed syntax really is a generalization of the existing mechanism. Thanks for your support :-)

this is rather perl-like :-) (DavidBlack, 2001-12-05 07:21:25)

I disagree that

a, b = foo.match(/(bar).*(baz)/)

"is a generalization of the existing mechanism", because the existing mechanism is that match returns a MatchData object.

I don't think it's so bad to do:

a,b = /(hel).*(ere)/.match("hello there")[1..-1]

(note: no "to_a")

Re: this is rather perl-like :-) (josb, 2001-12-05 13:20:55)

I was talking about my original proposal, not the foo.match() example. It's already possible to say foo = s[/foo+/], so why not extend this to foo = s[/(foo+)/], then on to foo,bar = s[/(foo+) and (baa+r)/]?

Here's why not (cout, 2001-12-05 15:22:04)

It changes functionality. Presently, this is what happens:

irb(main):001:0&gt; s = "foo and bar"
"foo and bar"
irb(main):002:0&gt; s[/(foo).*(bar)/]
"foo and bar"

If I understand correctly, then this is what you suggest:

irb(main):001:0&gt; s = "foo and bar"
"foo and bar"
irb(main):002:0&gt; s[/(foo).*(bar)/]
[ "foo", "bar" ]

Why would the String#[] return an string in almost all cases, but an array of strings in one oddball case? (Yes, I know that aString[aFixnum] -> aFixnum is already an exception).

BTW, could someone please fix the HTML filter to not change &gt; to > when I click "preview"?

Re: this is rather perl-like :-) (jfh, 2001-12-05 15:38:19)

I still feel Regexp#match is less elegant
than either of the new methods proposed
here. Since I'm drawn to Ruby because it
is so elegant in the first place, it's
somewhat of a big deal to me.

After all, POLS and HOOP are the main
philosophies driving Ruby, if I'm not
mistaken, and having String#match feels
to me to be less suprising and more human
oriented than not having it, especially
when there's a String#gsub.

It seems I'm in the minority, though.

Re: Here's why not (josb, 2001-12-05 18:36:05)

Because it is concise and convenient. So the return type of String#[] depends on its argument; is that a big deal?

Re: this is rather perl-like :-) (josb, 2001-12-05 18:38:58)

I agree, for the reasons you mentioned we should at least have String#match. You may be in the minority but you are not alone feeling this way.

Concise is relative :-) (dblack, 2001-12-06 06:50:38)

Doing what you're proposing would make one thing concise, but it would also mean that other things would become less concise.

For example:

str = "I say: Hello there to you"
re = /(Hel).*?(ere)/
str[re]  # =>  "Hello there"

would have to be rewritten in a less concise manner.

Every method that used String#[] with, say, a regex passed in as an argument, would now have to examine the regex, and/or branch on the type of the return value of [], etc....

I agree that having to examine MatchData objects can be a bit cumbersome. But in a sense you're suggesting that a similar examination process be conducted on the return value of String#[], instead of on a MatchData object -- plus introducing an inconsistency in the method's behavior.

Re: Concise is relative :-) (josb, 2001-12-06 13:07:48)

As much as I hate to admit it, you make a good point here :-/

Re: Concise is relative :-) (jfh, 2001-12-07 14:13:24)

So, should I ( or someone ) put up an RCR
for String#match ?

Re: Concise is relative :-) (josb, 2001-12-10 23:14:15)

Yes please.

Reason for rejection

use Regexp#match instead.

RCR 56: Integer division again. (slumos, 2001-12-31 17:38:26)

Status: Rejected

I've done some searching on ruby-talk.org, but I couldn't find any argument that satisfied me, so I thought I'd post here rather than have it gone over on ruby-talk yet again. The one thing that all arguments on ruby-talk seemed to have in common is that they all talk about "3 / 2". But real code NEVER does that, real code does "x / y". I claim that in the second case, returning 0 when when x = 3 and y = 2: 1) violates POLS, 2) is almost never what a randomly sampled programmer wants to happen (maybe the same thing as (1)), and 3) makes code unnecessarily ugly. The proof of (1) is just to point out what other languages do. Most of the Ruby code I write is to do experiemental calculations, and it has been the case too many times that my lengthy calculation ends up returning 0 because I forgot to put a .to_f somewhere. I'm hoping that the proof of (2) will come from how the vote on this RCR goes. Even if I'm wrong and a randomly sampled programmer is more likely to want integer division when they write "x/y", then a special operator for integer division (or even floating point, I could get used to that) would be a better solution than the current, because: (3) "x / y.to_f" is just plain ugly. One of the things I like about Ruby is that I can talk Ruby to non-Ruby programmers. A non-Ruby programmer does not have to think about what "x / y" means if it means the right thing, but with "x / y.to_f", they have to stop and think about why .to_f is necessary.

Comments

some issues (cout, 2002-01-01 02:36:45)

1) Yes, 3/2 == 0 violates POLS. The answer should be 1.

2) In C if I write:



          &nbsp;&nbsp;&nbsp;&nbsp;float x = 3/2;

          &nbsp;&nbsp;&nbsp;&nbsp;printf("%fn", x);

then I get 1.000000 printed to the screen, because / is the division operator, and when applied to two integers, integer division is what is done. To do anything else in Ruby would violate my POLS. But as stated before, POLS in Ruby only applies to Matz.

3) If you really want floating-point division, then use floats instead of integers, i.e.:



          &nbsp;&nbsp;&nbsp;&nbsp;puts 3.0/2.0

4) A special operator for integer division needlessly complicates the language. What happens when I try to apply integer division to two non-integers? Should we also add special operators for complex division and for vector division?

4) I suspect that for the work you are doing, Ruby is probably not the right solution for the job. A language better suited to floating-point calculations will probably perform much better (i.e. be faster and more accurate).

BTW, having this entry field convert &nbsp; to a space every time I press "preview" is a nuisance. The field should be left alone in all cases; filtering should only be applied to the output.

Blatant advertisement (taw, 2002-01-01 18:33:39)

If you do any serious integer computation with Ruby, you should consider using Ruby-GMP. It gives correct result (and is much faster):

require 'gmp'
include GMP
x=Z(3)
y=Z(2)
print x/y,"n" # 3/2, that is - rational

URL:
http://freebsd.orzeszkowej.ble.pl/~taw/ruby_gmp.alpha5.tar.bz2

Ruby-GMP this is highly experimental, but should be usable.

Re: some issues (slumos, 2002-01-02 03:33:35)

>1) Yes, 3/2 == 0 violates POLS. The answer should be 1.

Ack. I meant to say 2/3 = 0. I am working with probabilities currently, so it is impossible to get a result that could be considered useful with integer division and I was trying to use a similar case.

>2) In C if I write:

So what? C is a low level language. Would you argue that:

a = Array.new(3)
a[3] = 1

should have behavior based on C?

>puts 3.0/2.0

I would never write something so worthless. If I want to print "1.5", I'll use puts "1.5". This type of argument is exactly what I stated I wanted to get away from. It clouds the issue. What you are really saying is that I should use "puts x/y.to_f", which I claim is ugly and confusing. I would also never write anything like your C example, what real code does is more like:

float x;
float y;
/* calculate, read, etc x and y */
printf("%f", x/y);

which is why it isn't an issue in C. In Ruby, it's

# calculate, read, etc x and y
puts x/y

but if x and y happened to not become Float in the preceding code, you get a bogus answer.

>4) A special operator for integer division needlessly complicates the language.

It simplifies programs for the majority case at the expense of having to know another operator in the rare case.

>Should we also add special operators for complex division and for vector division?

Should we remove binary - since we can just combine binary + and unary - instead? Maybe += needlessly complicates the language as well.

>4) I suspect that for the work you are doing, Ruby is probably not the right solution for the job. A language better suited to floating-point calculations will probably perform much better (i.e. be faster and more accurate).

Is that because you assume that I want to type 2/3 at my program? I'm extracting certan features from XML files and calculating the probability of finding certain features in future files, so I have a lot of

prob = feature_count / total.to_f

Speaking of counter-intuitive, if you believe that 3/2 means integer division, why should 3/2.0 be floating-point division? The dividend is an integer. Shouldn't we have 3/2.0 = 1 and 3.0/2 = 1.5?

SUMMARY: Real programs (almost) never compute constant values, so it does not make sense to talk about the result of "2/3". Ruby is different from C because in C when I see "x/y", I can look at the declaration of x and y and know every time what kind of division it is. Since Ruby doesn't have declarations, the type of division in "x/y" depends on things that happen at runtime and so we need some other way to guarantee floating-point division. Currently, the common case of floating point division has a burden of explicit conversion. I think that burden should be shifted to the uncommon case instead. Adding another operator is not even necessary, I have no problem with having a single division operator that carries out the standard mathematical division operation, and writing "(3/2).to_i" when I want integer division.

Re: some issues (cout, 2002-01-02 11:07:28)

So what? C is a low level language...

C being low-level and Ruby being high-level isn't the issue. Ruby is dynamically typed, and it's perfectly legal for you to pass integers into a function that is expecting floats. Adding an integer division operator does not solve this problem.

Incidentally, Ruby is written in C, and Ruby extensions are written in C. That's not an insignificant argument for maintaining some coherence between the two languages.

...but if x and y happened to not become Float in the preceding code, you get a bogus answer.

This is definitely a problem with dynamically-typed languages, and extends far beyond just division. If I have a function that expects a String and I pass it an Array, then what should that function do? Note that C++ has a similar problem with templates:

  template&lt;typename T, typename U&gt;
  float foo(T t, U u) {
    return t / u;
  }
  ...
  std::cout &lt;&lt; foo(3, 2) &lt;&lt; std::endl;

Should we remove binary - since we can just combine binary + and unary - instead? Maybe += needlessly complicates the language as well.

The operators you mentioned do not carry type information with them. An integer division operator and a floating-point division operator both carry type information. That is why they seem to not match Ruby philosophy.

>4) I suspect that for the work you are doing, Ruby is probably not the right solution for the job. A language better suited to floating-point calculations will probably perform much better (i.e. be faster and more accurate).

Is that because you assume that I want to type 2/3 at my program? I'm extracting certan features from XML files and calculating the probability of finding certain features in future files...

No, I'm suggesting that you use a different language, because:

Ruby is an interpreted language, and is slower than an equivalent solution in a compiled language. A Ruby solution for a program that must process large amounts of data will probably have many pieces implemented as a C extension in order to be efficient.
Ruby does not have many of the necessary functions for ensuring that you do not have roundoff error, etc. C, Java, and Fortran all either have such functions in the standard library, or available as a non-portable extension.
Ruby does not have many libraries for performing statistical and numerical analysis. Thus, to do something even as simple as find where a function intersects the x axis (the zeros), you will have to reinvent the wheel. Fortran has netlib.org. C also has some stuff at netlib, but also has good bindings to octage, maple, mathematica, and more. There's probably even something equivalent for Java.

...I have a lot of

prob = feature_count / total.to_f

So why not convert your data to floats when you read them in from your xml file? That seems to me to be the simplest solution.

If you really want Fixnum#/ to do floating-point division in spite of all this, then one of the beautiful things about Ruby is that you CAN make this happen:

  class Fixnum
    def /(other)
      p self
      p other
    end
  end

  3 / 2 # will print 3, then 2, then return nil

In order to keep this from interfering with Ruby code that you didn't write, you may want to consider using something like Ruby Behaviors to limit the scope of the change.

Re: some issues (slumos, 2002-01-02 21:16:38)

C being low-level and Ruby being high-level isn't the issue.

I think it is. A high-level language ought to allow me to think about numbers instead of int, long, float, double. Saying that it shouldn't do that is in my opinion equivalent to saying it shouldn't have lists. Out in the world, when I divide two of those things called `numbers', I sometimes get a result with a decimal point in it. I don't care whether the `numbers' already had decimal points in them or not.

So why not convert your data to floats when you read them in from your xml file?

A perfectly reasonable question, except that I'm not reading values from the XML, I'm counting the occurance of certain values. Although as I said, this particular program is irrelavent (for one thing it is already written and running), it long ago occurred to me that I could just initialize values to float, like:

   feature_count = 0.0
   [...]
   feature_count += 1 if feature(doc)
   [...]
   prob = feature_count / total

but think for a second what that conveys to someone reading the code. Why am I doing that? Since the value is a count, perhaps they should change it to 0 to improve efficiency. Or maybe I should mark my code with comments like:

   # Don't change this or else Ruby will
   # do integer division later on.
   x = 0.0

As ugly as I think using .to_f is, at least it puts the explicit conversion with the division instead of some random amount of code before it.

In a final attempt to be clear, I don't have any major problem with telling Ruby what I mean using .to_f, I'm concerned with telling humans what I mean. Which is why I want to use Ruby in the first place. And when I say ``want to use'', I mean in general. The specific program in question has long-since been written with copious and ugly .to_f's.

Ruby is an interpreted language, and is slower than an equivalent solution in a compiled language.

Ruby isn't just a slower language, from the language shootout it appears to be just about the slowest language. If that was my primary concern, I wouldn't be using it.

A Ruby solution for a program that must process large amounts of data will probably have many pieces implemented as a C extension in order to be efficient.

Lucky me, an interface to a fast XML parser is already there. You might want to look up REXMLBuilder for your own reference.

If you really want Fixnum#/ to do floating-point division in spite of all this, then one of the beautiful things about Ruby is that you CAN make this happen:

I don't claim to be a Ruby expert, but I have been using it for some time now, have prototyped a simple fulltext search engine and Bayesian classifier in it (calculating probabilities, get it?) and I think you can assume I'm familiar with the simplest concepts of the language which you chose to demonstrate. I don't do what you suggest because that is even worse for people reading the code.

Meanwhile, nobody has mentioned why it is so important to have integer division as the default. The closest you've come is to say that it's not an insignificant argument to say Ruby should be like C. Yet you seem to be so opposed to it that you'd rather I use a different language than advocate that Ruby compute a floating point value when dividing integers. So what exactly is the problem with:

   3 / 2         #=> 1.5
   (3 / 2).to_i  #=> 1

? Do you have a language-design argument for why this would be undesireable? Does it go something like ``but, well, the arguments are integers''? Should I then count for you the number of built-in methods that return a type different from their arguments?

It's even easy to implement without much overhead, since Ruby already calculates the remainder when it sees 3/2 only to throw it away later. If the remainer is 0, return, otherwise redo as a floating point divide.

My argument is that this way conveys more information and is more clear to humans reading the code than any method you care to name for guaranteeing floating point division using the current semantics. If you do manage to name a method that is equally informative and clear, I'll gladly use it.

You also haven't answered my question about 2 / 3.0. Is your only argument for why this should be a floating point division that C does it that way? It sure looks equally like an integer division to me.

Its a correctness issue. (JimWeirich, 2002-01-02 21:51:25)

I agree with this RCR. I see no advantage in making division dependent upon the dynamic type of the arguments. 99% of the time, when a programmer writes x/y, they know which kind of division they want. I can't think of any algorithm that would require dynamic selection of the division operator. If fact, if you can't guarantee the type of x and y, then every time you divide, you must write either



  x.to_i / y.to_i  # to get truncating division

or

  x.to_f / y       # to get  non-truncating division

I suspect there is a lot of code out there that inappropriately assumes it always gets integers when doing division. That seems dangerous. That's why I see this as a correctness issue.

If changing the division operation would break too much legacy code (a very valid objection), then consider introducing two new division operators that always use the same kind of division.

Fatal Error (cout, 2002-01-03 07:40:32)

Fatal error: Can't redeclare already declared function in messages.en.php on line 10

Why do I see this when I click on "Read the rest of this comment?"

Which operators? (cout, 2002-01-03 07:50:38)

Which symbol (or combination of symbols) would you propose Ruby use to indicate integer and floating-point division?

Because.... (anonymous, 2002-01-03 08:03:27)

PHP wanted to give you just one more reason to like Ruby...

Hopefully it's fixed.

Dave

Its a correctness issue. (slumos, 2002-01-03 16:59:49)

If fact, if you can't guarantee the type of x and y, then every time you divide, you must write either [...]

Exactly. And I like the terms truncating and non-truncating division that you use because it shows that adding new operators does not mean that they somehow carry type information, it should be perfectly reasonable to use truncating division with floats.

Agreed (Bill, 2002-01-04 23:45:36)

Well put. There's certainly a reason why Python is switching to two different division operators (after much contention, to be sure). Really, the only reason in favor of keeping it the old (C-style) way is that it's, well, the old way of doing it. But having the mathematical function invoked depend on the types in a dynamically typed language is just a recipe for lots of annoying little bugs and .to_f 's.

My plan (matz, 2002-01-07 01:05:02)

My plan to solve this problam is that adding a new method for more accurate division, that returns most accurate division (Float by default, Rational if you require 'rational').

I have to get a proper name for the method (quotient maybe?). And this may not solve the original problem (I'm not good at even basic math). So I'd like to hear your opinion.

Some questions (cout, 2002-01-07 08:21:17)

1. Where will this method be defined?
2. How will it handle new numeric types? (suppose I add a BigFloat type)
3. How will you determine what kind of division is required?
4. If it is determined by a parameter, then would it be reasonable to divide this one giant method into many smaller methods?

It depends (slumos, 2002-01-07 17:16:09)

I think the main question to answer is ``What is the common case?''. If you add a method called quotient, how likely is it that there will be a module to alias / to quotient, which almost all programs will end up requiring? If it is very likely, then the name for quotient should be /, and instead of quotient there should be truncated_quotient and a module that aliases / to that.

That is assuming that breaking existing programs until they add require 'integer' or something is acceptable.

As long as there is some reasonably fast method that is a guaranteed non-truncating division, I'll be much happier, although all my programs will end up aliasing / to whatever it is.

Re: Some questions (matz, 2002-01-07 21:00:03)

1. Where will this method be defined?

Each numeric class, like other numerical operators.

2. How will it handle new numeric types? (suppose I add a BigFloat type)

By coerce system, like other numerical operators.

3. How will you determine what kind of division is required?

I'm not sure what you mean. If you want precise division, use the new method, otherwise use '/'.

4. If it is determined by a parameter, then would it be reasonable to divide this one giant method into many smaller methods?

We chose coerce system for numeric type mix. It's done before this division discussion.

Re: It depends (matz, 2002-01-07 21:18:50)

I think the main question to answer is ``What is the common case?''.

It depends person to person, program to program. I myself have almost never used int/int -> float division. I guess this kind of problem should be solved by selector namespace or similar technology.

Re: Some questions (cout, 2002-01-07 21:33:11)

I'm not sure what you mean. If you want precise division, use the new method, otherwise use '/'.

Your original statement was that you would add "a new method for more accurate division, that returns most accurate division (Float by default, Rational if you require 'rational')." My question is how you would determine what the most accurate division is (particularly for and between user-define numeric types).

BTW (slightly off-topic), is there anything happening in Ruby with regard to ? Would this be useful here?

Re: Some questions (matz, 2002-01-07 23:24:37)

Your original statement was that you would add "a new method for more accurate division, that returns most accurate division (Float by default, Rational if you require 'rational')." My question is how you would determine what the most accurate division is (particularly for and between user-define numeric types).

In the near future implementation, rational.rb overwrites Integer#quotient, I think. In the far future implementation, it might use selector namespace; but don't ask me what it is. It's still a vague idea.

Re: Integer division again (anonymous, 2002-01-08 21:10:46)

I basically agree (and Python has recently changed to this behavior), but all you need to do is require "mathn" to get Ruby to act as a mathematician would expect (i.e., 2/3 is the rational number two-thirds).

Re: It's a correctness issue (anonymous, 2002-01-09 13:45:21)

If you require "mathn" then

7/3

gives the rational number 7/3, while

7 .div 3

gives 2 (C-like integer division), and

7 .divmod 3

gives [2, 1] (integer division with remainder).

Rather than needing an RCR, I think this issue could be addressed by popularizing the mathn standard library (which is not mentioned in the pickaxe book or the bezoar-goat book).

Regards, oinkoink (I haven't yet figured out how to put my name in the header!)

Re: some issues (anonymous, 2002-01-10 11:53:30)

Ruby's Numeric class hierarchy is ''just'' a supped up OO-rapper - very well done for the most part - around the ordinary C-numeric type system. From this point of view it makes plenty of sense to expect ''normal behavior'' - i.e.

2/3 == 0.666..

and I don't feel like changing to a different language if I only need to make a couple of quick calculations. Take for example the current implementation of the Matrix#inv method, as implemented in the standard library 'matrix'. Right now, because of this IMHO weird semantics of :/, you get

require 'matrix'

p Matrix[[2.0, 3], [3, 4]].inv #=> Matrix[[-4.0, 3.0], [3.0, -2.0]]
p Matrix[[2, 3], [3, 4]].inv   #=> Matrix[[1, -1], [-1, 1]]

Not that is a big issue, but this type of "dynamic behavior" gives probably every mathematician a big grin.

There other mathematical oddities in Ruby standard libraries, for example the insistence of providing a (naturally flawed) implementation of :

-- Chr. Rippel

Re: some issues (anonymous, 2002-01-10 12:05:17)

Hm this board does not seem to like my comments;-) - i.e the rest of this posting should read as ..

There are other mathematical oddities in Ruby's standard libraries, for example the insistence of providing a (naturally flawed) implementation of #

-- Chr. Rippel

Re: some issues (anonymous, 2002-01-10 12:11:03)

... #> and #div and friends for Complex objects. Generally speaking the whole coerce framework of the Numeric class hierarchy, which is basically a Band-Aid for the fact
that Ruby does not have (symmetric) multiple dispatched methods, naturally sticks out like a sore thumb from the rest of Ruby single dispatch philosophy,
but not having binary operators, with multiple dispatch method like behavior, would make the language very impractical for any type of numerical work.

mathn (slumos, 2002-01-11 18:44:05)

From what I can tell, require 'mathn' satisfies me.

I'd still be happier if it was the default behavior, but I'll live.

It is definitely the case that mathn needs more publicity.

mathn (anonymous, 2002-01-28 19:54:33)

Thanks for the tip. I didn't know 'mathn' was there.

Re: some issues (patsplat, 2002-03-18 00:53:00)

I see why this would be annoying, but there are also good reasons for keeping both integer and floating point division around. (when working with atomic things like pixels, files, array entries, etc. )

If this is an application specific problem, you can have a require that would redefine division appropriately...

Reason for rejection

IN 1.8, you have Numeric#quo for better precision. Changing / behavior will cause serious compatibility problem.

RCR 57: Use 'String literals' as :Symbols. (anonymous, 2002-01-02 19:00:05)

Status: Rejected

The use of :symbols for quick lookup has resulted in (mostly external) library methods that don't uniformly accept both :symbols and "strings".

It is painful to try and keep track of which methods accept :symbols and which methods accept "strings".

Could fixed string 'literals' be the lexical equivalent of Symbols? Any string literal seen in the code goes into the same lookup table as Symbol's, so the two become equivalent.

a = 'hello' (now 'hello' exists in Symbol table)

This change helps avoid the question of "string or symbol?", which is really more of a compiler's crutch than a human problem.

Comments

But do you really want all of that in the symbol table? (anonymous, 2002-01-03 02:06:24)

Sure, it's fine for 'hello' to be equivilent to :hello (and be in the symbol table) but what if the string literal is:
'hello I'm in the symbol table'
That's not possibly a valid variable name because of the spaces.

It's not a real problem (anonymous, 2002-01-03 05:16:10)

It would just be a few additional entries in the Symbol table.

Perhaps... but (anonymous, 2002-01-03 15:15:48)

Perhaps. Maybe I'm being a little paranoid, but isn't there a much greater chance for name collisions? How do you handle spaces as in the example given above?
You can do:
:symbol_name
with symbols, but with a string literal like:
'symbol name' which contains a space do you now have two symbols in the table? (:symbol and :name)?

Perhaps you could examine the string literal to determine if it contains spaces or not. So,
'symbol_name' would become a symbol in he table, but
'symbol name' wouldn't. But I really don't like it...

That's still OK (anonymous, 2002-01-04 00:10:14)

With spaces, it would be the equivalent of :'symbol name', which is an impossible name for a variable to have.

Collusions don't matter, since Symbols only need the guarantee that they hash to the same value regardless of where they are.

If it weren't for this property, methods in different classes wouldn't be able to have the same name.

More information on perspective (anonymous, 2002-01-04 00:24:03)

It doesn't really matter how this is implemented. The question is whether the use of Symbol vs string is creating a problem here.

Does anyone else think that it's becoming difficult to keep track of which calls accept strings and which calls accept symbols?

I feel it is. While each library is self-consistent, it gets complicated when using several libraries.

Re: That's still OK (cout, 2002-01-04 07:21:02)

I don't know what :'symbol name' is (Ruby 1.6.5 certainly doesn't like it), but I can create a symbol with spaces using:

  'a b c'.intern()

Re: Use 'String literals' as :Symbols. (cout, 2002-01-04 07:30:09)

I think there might be some implementation trouble with your proposal. How would I now pass a string literal into a function expecting a string?

Perhaps a better solution would be to cache the hash value of all strings (and string literals could have their hash values calculated when the file is parsed). This would eliminate the need to use symbols for quick lookups in most cases. The only problem I can think of here is that it might break some C extensions that try to manipulate strings directly.

Re: Use 'String literals' as :Symbols. (anonymous, 2002-01-06 21:11:17)

1.6.6:
"abc".type #=> String
:abc.type #=> Symbol

RCR:
"abc".type #=> Symbol
:abc.type #=> Symbol
"abc#{var}xyz".type #=> String? Symbol?

Re: Use 'String literals' as :Symbols. (anonymous, 2002-01-09 12:34:45)

RCR:

"abc".type #=> String
'abc'.type #=> String
:abc.type #=> String

Symbol ceases to exist, as it was only ever a crutch to speed the language up.

Ruby converts Symbols to hash values at parse time. So things don't slow down, Ruby could convert String literals to hash values instead.

Hope this makes the original proposition a little clearer, but please bear in mind that I don't know Ruby internals.

The main point of this RCR was to highlight the growing problem of Symbols vs Strings in external libraries.

this loses some safety (cout, 2002-01-09 20:17:54)

I often use symbols as "constant strings," since a symbol cannot be modified. As it stands, this RCR would require me to freeze all my strings, which would be quite tedious. Perhaps this RCR could be modified such that :foo is a shortcut for creating a frozen string.

There are still many other problems, already discussed, but this is one that can be solved.

Convenience vs. functionality. (anonymous, 2002-01-14 23:52:54)

It's not that inconvenient to check for a string or a symbol, so why must we change the language to treat them identically?

Example: we want to set instance vars from a hash argument and let the user specify either a symbol or a string for the parameter name:

def do_something(hash)
  @foo = (hash[:foo] or hash['foo'] or default)
  @bar = (hash[:bar] or hash['bar'] or default)
end

If we treat symbols as string literals, we still have to do the above, but with one less or case.

I'd rather see changes that improve programming efficiency by a large marrgin rather than satisfy various semantical concerns.

- Leon Torres

simpler idea (anonymous, 2002-01-25 15:30:13)

class Symbol
def intern ; self ; end
end

Then just always call .intern on the string/symbol parameter you're expecting.

It seems like a reasonable bit of orthagonality that belongs in the standard Ruby kit, actually.

Reason for rejection

It's too big change for both semantic and implementation.

RCR 58: java/c++ type method overloading (onekilo, 2002-01-09 16:53:13)

Status: Rejected

Request to have method overloading like c++ and java. It would count the number of parameters passed to it and determine which method to call. cheers. muzza

Comments

Some ruby-talk links (cout, 2002-01-09 20:12:00)

I think dispatching on the type of arguments is a little more powerful than simply counting the number of parameters passed, but if you really want dispatching based on the number of arguments, please see ].

If you would rather have dispatching based on the types of the arguments, see ]. This second option also buys you type checking almost for free (the cost is that it will not work with new types unless they inherit from the appropriate class).

Generally, overloading in Ruby is unnecessary, because you can get similar effects in different ways. For example, Matz suggests an double dispatching as an alternative for overloading in ]. This might not work for all cases, but will work for many cases where overloading is used in C++.

Finally, Matz has already ruled on this subject, for the time being. See ].

Re: Some ruby-talk links (Rich_Kilmer, 2002-01-09 21:27:14)

Actually, where we arrived at in that thread on dispatching was in:

With this syntax, not other class is needed. -rich

Reason for rejection

This RCR contradicts with other Ruby features such as optional argument and dynamic typing.

RCR 60: Consistency in subscripted references (bobalex, 2002-01-22 10:32:14)

Status: Rejected

When "range-like" forms of array and string subscripts are used, and one or more of the endpoints is out-of-bounds, the result is nil if the left endpoint is out of bounds, but is the contianed portion of the string or array that does exist if the right endpoint is out of bounds. In my experience, this leads to having to code excessive parameter checking and result checking. I suggest always returning the portion of the string or array that does exist when either end of a range is out of bounds -- never nil. (Of course, nil should still be returned for single-element subscripted expressions.) For example: s = "abc" s[5..6] and s[5, 2] now return nil, but would return "". s[2..4] and s[2, 5] now return "c", which is good.

Comments

You can use to_s (Cosine, 2002-01-30 15:31:07)

When I run into this problem and I meant to have an empty String returned if I'm out of bounds, I just run the result through it's to_s method. That gaurentees that I have a String. I suspect that someone out there relies on the behavior of nil being returned if you are out of bounds, and since the such a simple solution exists I'm not so big on making that change.

However, I'm not against this idea either. I'm torn on the issue of which behavior goes with the principle of least surprise, since either one could be legitimately expected by reasonable people. And given that, I don't think it ought to be changed since there may be code out there that expects the current behavior.

Reason for rejection

It's good for bug finding.

RCR 61: Array#bagsubtract (HughSasse, 2002-02-11 07:47:18)

Status: Rejected

Array differences have surfaced again on the list ([], see also [] and subsequent messages). A method is needed (by some people, at least) which will find the difference between two arrays but NOT perform uniq!() on the result. That is, a difference operator which treats arrays as bags, rather than sets (which we already have) is needed. I wrote a some time ago, but there may be better implementations. Note, this is needed as an addition to existing subtraction methods, not to replace them. I would be happy if it were called bagminus or bag_minus, for brevity.

Comments

Maybe not (anonymous, 2002-02-11 19:00:55)

I want not mind if a lean Set and MultiSet Class were added as a
standard classes - adding yet another method to the already beefy
Array interface is not the right direction imho.

/Christoph

Re: Maybe not (HughSasse, 2002-02-12 04:52:54)

Array is pretty close to doing what is wanted here. Rather than creating a plethora of new types, adding methods would seem to be the ruby way to go. See for example: [] and []

Re: Maybe not (cout, 2002-02-12 14:25:23)

An Array is not a Set, and is very slow for performing Set operations. A Set implemented in terms of a Hash will perform much better.

If I want an unordered collections of objects, then I should create a Set. If I want an ordered list of objects, then I should create an Array. Rarely do I want to perform Set operations on an ordered list of objects.

Maps and Sets would make a nice addition to Ruby, IMO.

Re: Maybe not (HughSasse, 2002-02-13 11:34:03)

Granted, an Array is not a set, and a hash can be more effective as a set. But Array.-() already works as a Set operator. Changing this now would break existing code. The proposal is to add bag_subtract to Array. Arrays are more like bags than sets.

Sets, Multisets and Maps of high performance may be a valued addition to Ruby. I didn't intend to suggest they should never be added. Performance in some applications may be a strong enough case.

Re: Maybe not (anonymous, 2002-02-14 09:47:11)

The amusing point is that the current set interpretation of the Array class internally converts the arrays involved into Hash Sets and (re)converts the resulting Hash Set into an array (which in the absence of natural element order is an a priori ill defined operation). Following this logic you probably would implement #bagminus etc. by internally converting the arrays into Multi HashSets ... at which point you start to wander if the Ruby way of cramming unrelated behavior into one class interface is always a good idea.

/Christoph

Reason for rejection

It's easily done using iterators. I`ve never required to use Bags.

RCR 63: Coerce-ability of bitwise operators (haldane, 2002-02-11 12:09:59)

Status: Rejected

I notice that none of the bitwise operators have the ability to coerce - so in my class Foo (which has an attribute @value)...

  def initialize(n)
    @value=n
  end
  def coerce (number)
    [Foo.new(number), self]
  end
  def | (n)
    if n.is_a? Foo
      @value | n.value
    else
      @value | n
    end
  end

..does not work as intended. It looks to me like this ability could easily be added in numeric.c - how about it?

Comments

Use "to_int" (matz, 2002-02-13 01:47:27)

When you want to define "coerce" method to convert your object to integers, I guess there's better conversion scheme: to_int.

The object that defines "to_int" behave like integers.

matz.

to_int doesn't solve my problem (haldane, 2002-02-13 04:10:08)

Yes, to_int would be ok for my example, but not for what I am really doing - my real-life class does other stuff instead of just returning integers.

It seems inconsistent that all the other numeric operations call coerce except for the bitwise ones.

There is a work around: to redefine the bitwise operators in class Fixnum and handle it there - but it would be nicer if all the operators worked alike.

Re: to_int doesn't solve my problem (matz, 2002-02-13 22:50:54)

Could you be more concrete? Or perhaps we can discuss on the ruby-talk list.

Coerce does not have operator information, so that it might not solve your problem either.

Bitwise operators do not call coerce because they are integer operation. You can't define bitwise-or between integer and float.

matz.

Why redefining the semantic of bitwise operators? (anonymous, 2002-02-21 00:01:46)

It seems like a bad idea to "redefine" the normal semantic of the bitwise operator unless you have a in-house convention of using the bitwise operators to represent another operation.

This kind of operator overloading usually leads to unreadable and confusing code down the road.

Just curious, why do you want to overload the bitwise operator?

Reason for rejection

Unlike other mathematical operations, bitwise operations are integer operation. No coerce is needed.

RCR 64: data = File.open(filename).read should close the file after the read (joe, 2002-02-20 14:30:25)

Status: Rejected

when you do: data = File.open(filename).read or File.open(filename).write(data) The file should be closed right after that operation because there is no reference to the file handle and no way to close it, you must wait for the GC, so instead of the simple: data = File.open(filename).read we must do: data = "" File.open(filename) { |f| data = f.read } That adds a line and is too much work ;-)

Comments

It isn't quite that bad (Dave, 2002-02-20 14:49:13)

In 1.6, you can do

  data = File.open(name) {|f| f.read}

and in 1.7, there's

  data = File.read(name)

Does it close the file? (joe, 2002-02-20 15:41:48)

I like that: data = File.read(name)

Does 1.7 close the file right after the read?

Yes (Dave, 2002-02-20 22:16:59)

it does... :)

Reason for rejection

It should be done by GC or methods like File::read.

RCR 65: IO orthogonalization, improved reusability (pong, 2002-02-22 12:59:37)

Status: Rejected

The IO class is central for all input and output in Ruby. Unfortunately it is not easy, if at all possible, to subclass IO for new data sources/sinks that are not file descriptor based. For other sources/sinks developers have to reimplement the behaviour of IO. Correctly imitating the behaviour of IO can be difficult and violates the DRY principle. And, to add insult to injury IO manipulates $_ (thread local) and it is not possible, or at least practical, for a pure ruby module to do this. IO should be orthogonalized and factored into a set of modules/classes that provide all the convenience methods for input and output and a set of classes/modules with simple interfaces that provide raw input and raw output. A la istream/ostream/streambuf in C++, except with nice and elegant ruby interfaces. If it is not reasonable for performance reasons to do this, ruby should still provide a class/module that makes it trivial to implement IO-like access to data sources/sinks.

Comments

I second that nomination (gmiller, 2002-02-22 18:03:47)

I struggled with this a few months back while trying to write some code that read from a stream. Maybe I've gotten too used to C++ and Java. I have grown really accustomed to not caring where the data comes from, which also facilitates testing. I ran into the exact same conclusions as you with trying to subclass IO. At that point I then looked at many other codes and saw that there is a large number of occurrences of checking to see whether the object was a string or a file.

Yes. (matju, 2002-02-22 22:05:19)

This is something I've wanted to write for a long time. I've done the same for Array, Hash, String; the IO one is more difficult to do correctly because there are more details. I want something about as generic as the Java one, while keeping the usual case simple. Contact me by email if you are interested.

Excellent suggestion! (anonymous, 2002-02-23 01:07:23)

And the sooner the better. Now how do we get it implemented?

Agree: need for stream (ronjeffries, 2002-02-24 09:49:39)

I agree with this idea. I wanted to implement a memory stream object such as exists in Smalltalk. The difficulty of making it really work like an IO was enough to make me solve my problem another way.

Need concrete proposal (matz, 2002-02-24 22:03:12)

Both behavior and implementation of IO class are file descriptor dependent. Could you (or somebody else) come up with concrete Stream (or whatever) behavior? You don't have to implement (I will), just expected bahavior is sought.

matz.

Not really a proposal, but a place to start (pong, 2002-02-25 08:02:27)

Is there a reason to change the behaviour of IO? I haven't used ruby enough to know if its behaviour has significant short-comings that should be fixed.

The following is from rubyzip (slightly modified). It is not to be understood as a recommendation, rather something to get the ball rolling, discussion-wise.

Thomas

  # relies on: inputFinished?, produceInput and read
  module ConvenienceInputStream
    include Enumerable

    def readlines(aSepString = $/); end
    def gets(aSepString=$/); end
    def flush; end
    def readline(aSepString = $/); end
    def each_line(aSepString = $/); end
    alias_method :each, :each_line
  end
  

  #relies on

truncated post (pong, 2002-02-25 08:12:49)

rubygarden.org truncated my post, even though it looked just fine in the preview. For complete response, please view http://www.rubygarden.org/ruby?IORCR. Comments are encouraged :-)

Another starting point (JimWeirich, 2002-02-27 11:56:34)

I've used a StringOutput class to capture output normally intended for standard output or a file. It is not complete, but does most of what I need in a output stream. I've posted StringOutput (and OutputShell) to the RubyGarden wiki page started by Pong.

IO Abstractions (ianrae, 2002-03-01 12:03:27)

A good i/o subsystem was developed for the Oberon language. See

http://ooc.sourceforge.net/OOCref/OOCref_7.html#SEC22

It separates out various parts of I/O into a number of abstractions. Here is an excerpt:

There are several conceptual layers to the I/O process that are modeled by various abstractions in the OOC

library. Their relationships are shown here:

data locations - where data resides (raw data).

| (e.g., hard disk, memory block, keyboard port, RS232 links)

|

channels - connections to data locations in the form of byte streams.

| (e.g., files - on disk and in memory, pipes,

| TCP/IP connections)

|

basic riders - basic operations on bytes.

| (e.g., SetPos, ReadByte, ReadBytes, WriteByte, WriteBytes)

|

mappers - translations of high level data to and from a byte stream.

(e.g., binary reader/writer, text reader/writer)

A data location (or simply location) is a source of input data or destination of output data. It it the physical or

logical place where data exists; say a hard disk, or keyboard buffer.

A channel is a connection to a data location. A channel is envisioned as a contiguous sequence, or stream, of

bytes. Channels may be sequential as in the case of terminal I/O, a TCP stream, pipes, and so forth; or positionable

like Files and ProgramArgs.

Riders are associated with a channel and provide read and write access of a location; they operate directly on a

stream of bytes (i.e., a channel). Multiple readers and writers can exist for a single channel.

A mapper is a high-level rider; it operates on a particular format of data, like textual or binary

representation of elementary data types. Mappers rely on the primitive operations of basic riders to build more complex operations.

The benefit of differentiating these layers is allowing a way to distinguish between the simple access layer, that

doesn't know a thing about the byte stream being read or written, and the interpretation layer that transforms bytes into useful data.

Reason for rejection

IO is an IO is an IO. We need to extract Stream behavior first.

RCR 66: New methods: Array#merge and Array#squeeze{,!} (knu, 2002-03-13 14:28:15)

Status: Rejected

Array#merge is a destructive version of "|=". In other words, "a.merge(x, y, z)" is a low cost equivalent of "a.replace(a | [x, y, z])". After experiencing a lot of "a.push(x) unless a.include?(x)" phrases, I came to think we'd need it. Array#squeeze{,!}([obj,..]) is an analogy of String#squeeze{,!}([obj,..]). It squeezes sequences of the elements of an array. For those interested, I have to give a try.

Comments

Re: Array#squeeze (anonymous, 2002-03-15 12:31:08)

Do we need #squeeze when we have #uniq?

Re: Array#squeeze (knu, 2002-03-16 12:07:32)

[1,2,2,3,1].squeeze #=> [1,2,3,1]
[1,2,2,3,1].uniq #=> [1, 2, 3]

Actually it is Array#squeeze that works like the uniq(1) command. :)

Clutters the Array namespace (Cosine, 2002-03-18 18:12:23)

I don't like either suggestion, but each for a different reason.

Given how Arrays are implemented, in general (I don't know Ruby's specific implementation but I'm sure it can't be far off), I would expect that this Array#merge would have to traverse the whole Array to figure out if the new element is already present. This is not a task that an Array is designed to do. Also, while it may be a useful method to have for you, I just see it cluttering up the namespace with a method that encourages a poor use of an Array.

When I want to do what you do, I use an Array and a Hash in parallel. Check the Hash for existance, and then push onto the Array if there was no previous existance (as well as update the Hash). The performance difference once there are several thousand elements in the Array should be easily noticable. I haven't abstracted it to a separate class as I should, but I haven't decided the best way to do that, and I'm still relatively new to Ruby.

Also, having Array#merge might even provide a false sense of efficiency to people that don't know how Arrays are implemented.

On to Array#squeeze. I think this is a fine method to have and use in your private library, but I don't think it belongs in the standard Ruby distribution. I think its use is too specialized to allow the intrusion of another name in the Array namespace. I feel the same way about String#squeeze, too, but hoo-hum that's already there.

If you can point out some general uses for squeeze, then I can be turned around on this one. Just because I can't think of one doesn't mean I've closed my mind to the possibility that they exist. But remember, you have to convince me that a fair number of people would want to use this method to turn me around on this.

Re: Clutters the Array namespace (knu, 2002-03-23 04:52:25)

> Given how Arrays are implemented, in general (I don't know Ruby's
> specific implementation but I'm sure it can't be far off), I would
> expect that this Array#merge would have to traverse the whole Array
> to figure out if the new element is already present. This is not a
> task that an Array is designed to do. Also, while it may be a useful
> method to have for you, I just see it cluttering up the namespace
> with a method that encourages a poor use of an Array.

I strongly doubt that. Array already has so many methods that
traverse the array such as sort, uniq, compact, assoc, rassoc,
member?, &, |, etc. . I believe traversing is one of the main
functionalities of Array.

> When I want to do what you do, I use an Array and a Hash in
> parallel. Check the Hash for existance, and then push onto the Array
> if there was no previous existance (as well as update the Hash). The

I do too, but obviously it's far from OO to have two objects for
one data structure.

> On to Array#squeeze. I think this is a fine method to have and use
> in your private library, but I don't think it belongs in the
> standard Ruby distribution. I think its use is too specialized to
> allow the intrusion of another name in the Array namespace. I feel
> the same way about String#squeeze, too, but hoo-hum that's already
> there.

String#squeeze can easily be replaced with String#tr_s() but
Array#squeeze is not very easy to do in place.

Actually, what I was thinking is how to use Array to deal with a set,
so I thought I'd need squeeze() to do uniq() against a set more
effectively.

Now, I've finally decided to write a Set class so I won't need above
methods. I'll come up here again later with a Set class as
the best solution for my problems.

RCR withdrawn, thanks.

Reason for rejection

Use "set" library for merge. I don't see any usefullness of squeeze for Arrays yet.

RCR 68: "p" method should return its last argument, unconverted (bobalex, 2002-03-15 09:48:17)

Status: Rejected

(It currently returns nil.) This would allow it to be inserted into expressions during debugging to see the value of subexpressions: E.g. in:

   a = d + b / c

To quickly see what the subexpression (b / c) is producing, make the following simple edit:

   a = a + p(b / c)

or, to provide a "caption":

   a = d + p("***** b / c:", b / c)

Note that returning the value will cause it to print the value in an irb session (in addition to its printed output), BUT, p isn't often used at the top level in an irb session since just entering an expression prints its value.

Reason for rejection

I like the idea, but I think the name "p" is not sufficient.

RCR 70: A "println" method (bobalex, 2002-03-15 10:11:27)

Status: Rejected

Would work just like "print" but would tack on a newline after outputting all of its arguments. Coding:

println("a", "b")

would be equivalent to

print("a", "b", "n")

I favor this additional method because it better expresses the programmer's intent in the method name and, well, it just looks better!

Note that this is different from the existing "puts" method, which prints each of its arguments as separate lines.

I chose the name println because it is similar to names of functions with the same semantics in other languages (e.g. Java, Pascal).

Comments

A println method (gmosx, 2002-03-20 13:32:39)

I agree that a println method is missing from the standard library.

Append operator (devEiant, 2002-03-23 21:57:20)

You could, rather than add a whole new method just for a join and a newline, use the String Append operator instead of commas:

puts "a" nil

Append operator (retry) (devEiant, 2002-03-23 21:59:47)

[Sorry. Forgot to escape my angle-brackets.]

You could, rather than add a whole new method just for a join and a newline, use the String Append operator () instead of commas:


puts "a" nil

`Try this: (anonymous, 2002-12-13 00:37:15)`

def println(*args) puts args.join end The point is that it's too easy to express this idea within the existing language. If you need a completely one-off solution, you can do: puts [5,6,7,8].join to the same effect. (Array.join always produces a string.)

`Reason for rejection`

Why don't you put explicit newlines at the end of "print" argument. It's just three letter longer.

`RCR 71: String#count could also accept Regexp as argument. (bobalex, 2002-03-15 10:25:00)`

`Status: Rejected`

If a single Regexp argument is provided (instead of a strings), it would count non-overlapping matches of the Regexp. Otherwise it would work just as it does now. This change is compatible with the current String#count.

Specification:


class String
 alias old_count count
 def count(*args)
  if args.length == 1 && args.first.kind_of?(Regexp)
   count = 0
   scan(args.first) {count += 1}
   count
  else
   old_count(*args)
  end
 end
end

`Comments`

`Why complicate count? (devEiant, 2002-03-23 21:07:36)`

Scan works just fine by itself:


"foobarbazbingbong".scan(/[aeiou]/).length
    ==>6

`Reason for rejection`

It makes the meaning of "count" method ambiguous. Use scan for this purpose (I know it's ineffective. But you don't need regular expression count that much).

`RCR 72: Dir methods that produce entries without "." and ".." (bobalex, 2002-03-15 10:34:12)`

`Status: Rejected`

The majority of times when directory entries are iterated, the programmer is interested in only the "child" entries, not the "parent" and "self" entries ('..' and '.'). It would be good to have new methods that produce only children, removing the need to write additional code to filter out the non-child entries.

I suggest the following two new methods in class Dir:

Dir::children(path) produces an array of child entries (similar to Dir::entries but excluding the special entries).

Dir::each_child iterates over child entries (similar to Dir::foreach but excluding the special entries).

`Comments`

`New glob (sodell, 2002-04-19 15:31:08)`

Why not just make a new glob that takes a second, optional parameter to filter filenames returned during iteration? By default, it would return only non-hidden files and directories and would exclude '.' and '..'. The optional parameter could be specified, however, and could also filter out directories, or show only directories, exclude symlinks, dev files, etc., etc.

`Reason for rejection`

1.8 Dir::glob accepts fnmatch(3) option flags.

`RCR 73: The built-in Mutex object should be nestable (recursive) (bobalex, 2002-03-15 10:47:30)`

`Status: Rejected`

The Mutex class, as currently implemented, is non-recursive, meaning if it is locked twice in the same thread without an intervening unlock, the program will deadlock. It is often useful to be able to nest the locks in a thread so that a method that acquires the lock can call another method that also acquires the lock without deadlocking.

I propose that the built-in Mutex be implemented as recursive.

There is no use at all in having a non-recursive mutex. Non-recursive mutexes are sometimes available in thread libraries, but they are only for extremely performance-sensitive applications where saving a machine cycle is paramount. In a high level language such as Ruby, a non-recursive mutex is unnecessary and all-too-likely to cause deadlocks.

`Comments`

`Yes, please! (WayneConrad, 2002-03-22 17:44:13)`

I'm trying to explain to a friend who is learning Ruby why he can't have one public function that's locking his object call another public method that's also locking his object. It's embarassing to have to tell him that he has to use a less natural form of expression to work around Mutexes's non-reentrant behavior, especially after I got done telling him that Ruby tries to adapt to the human rather than the other way around :) I don't have the numbers in front of me, but I remember doing some tests with pthreads a few months back and finding out that reentrant mutexes cost hardly any more than non-reentrant ones. So pleeeeeease can we have Mutex be reentrant? Pretty please?

`Sync (devEiant, 2002-03-23 21:00:16)`

There's already a library included with Ruby that does what you want. It's called Sync. It not only allows recursive synchronization, but it supports 2-phase locking as well (shared vs. exclusive locks).

Simple (contrived) example:


  use 'sync';

  @foo = true;
  @mutex = Sync.new

  def falsify_foo

    # Shared lock -- multiple threads
    # can access as read-only
    @mutex.synchronize( Sync::SH ) {
      if @foo
        # Exclusive lock -- only one thread
        # can modify @foo at a time
        @mutex.synchronize( Sync::EX ) {
          @foo = false
        }
      end
    }
  end

Hope this helps.

`agree (anonymous, 2002-04-10 14:24:52)`

While it's easy to implement a recursive-mutex on top of the existing one, I believe it violates the Principle of Least Astonishment that Ruby's standard mutex is nonrecursive. The poster is right in that non-recursive should be the special case, rather than recursive.

`Definitely (sodell, 2002-04-19 15:24:42)`

Mutexes are by and large used recursively...non-recursive should not be the default.

`Reason for rejection`

Someday it will be. Use monitor until it happens.

`RCR 75: Dir.mkdirhier && Dir.rmdirhier (jfh, 2002-03-18 22:09:37)`

`Status: Rejected`

Fairly straightforward, Dir.mkdirhier and Dir.rmdirhier:


class Dir 
    def Dir.mkdirhier(dir)
        begin
            Dir.mkdir(dir)
        rescue Errno::ENOENT
            Dir.mkdirhier(File.dirname(dir))
            Dir.mkdir(dir)
        end
    end

    def Dir.rmdirhier(dir)
        #
        # Exit when dir == "." or dir == "/"
        #
        return if dir.match(/(?:(^.$|^/$))/)
        begin
            Dir.rmdir(dir)
        rescue SystemCallError
            return
        end
        Dir.rmdirhier(File.dirname(dir))
    end
end

`Comments`

`Re: Dir.mkdirhier (jfh, 2002-03-18 22:20:54)`

One could add this as well:


        rescue Errno::EEXIST
            return

`Remember ftools (root, 2002-03-19 06:13:17)`

FTools already contains a makedirs method.--Dave

Re: Remember ftools (jfh, 2002-03-19 07:30:14)

Hmmm...ok...

Are those the kinds of methods we want
to see make their way back up into the
File and Dir classes? Or is there a
reason to keep them out?

Re: Remember ftools (anonymous, 2002-03-19 10:53:30)

require 'ftools'

File.makedirs( "my/directory/here" )

Reason for rejection

Part of this RCR is already done by ftools, and I think the rest should be done in it too.

RCR 76: Modify mkmf.rb to silently support frameworks (CanyonRat, 2002-03-27 10:11:47)

Status: Rejected

The NeXTish systems, Darwin, Mac OS X, and Simply GNUStep Linux, bundle application services inside "frameworks", custom directory hierarchies that facilitate versioning and localization among other things. It is sufficient for mkmf that at the top of each framework is a link to the relevant library and Headers directory. To make gcc work with frameworks, one uses the -framework and -F command line arguments. Mkmf.rb currently never generates these arguments. The Principle of Least Surprise would seem to require that that an extconf.rb script written for more traditional Unix should simply work on systems that use frameworks. This can be accomplished in a fairly straightforward manner by having methods such as have_library, have_header and find_library try a framework oriented test if the current test fails. If the framework version of the test succeeds, the arguments passed to the portion of the script that actually generates the makefile can be adjusted accordingly and success returned.

Comments

Re: Modify mkmf.rb to silently support frameworks (anonymous, 2002-05-22 21:41:02)

What do you mean by framework oriented test?
At first, you need to clarify it.

Reason for rejection

I need more concrete RCR to understand what "framework" is, and how to support it.

RCR 78: Kernel::big_endian? and Kernel::little_endian? (jvoegele, 2002-04-09 08:55:07)

Status: Rejected

I posted this to ruby-talk and got no response so I'll try here. I'd like to be able to determine the endian-ness of the architecture Ruby is running on. This is important for many network applications. Currently, you must use some hack like [1].pack('l') which yields 0 or 1, depending on endian-ness. I propose instead that we add two methods to Kernel: big_endian? and little_endian? Could we add these as Kernel methods? Is it worth it? JasonVoegele

Comments

Sounds good to me (anonymous, 2002-04-16 08:43:31)

Anyone else have an opinion?

I agree (djberg96, 2002-08-17 21:52:14)

Yes, sounds good. :)

Question - reverse? (djberg96, 2002-08-19 13:29:26)

If we add this, should we also add a 'reverse()' method for Fixnum? Otherwise, for packed data, you may end up having to do this on big_endian machines:

packed.to_s.reverse!.to_i

How about Kernel::endian returing a defined constant? (aidenc, 2002-10-17 14:15:12)

On big endian archs, Kernel::endian would return Kernel::BIG_ENDIAN, on little Kernel::LITTLE_ENDIAN

allows for the possibility of non-standard endianness... I can't recall right now, but I've definately heard of architectures which used non-big, non-little endians...

Reason for rejection

This must be solved something like sysconfig module that answers system (or architecture) information.

RCR 79: Prevent access to internal variables through implicit return (anonymous, 2002-04-09 20:44:29)

Status: Rejected

Given the following code:

class Firewall
def initialize
@hardcode = "SUPASECREX"
end

def secret
@hardcode
end
end

It is possible to get to the internal variable "@hardcode" and change it by doing the following:

example = Firewall.new
example.secret.replace("0WNZED!!!")

or less obviously...

@password = example.secret

and over in another method:

@password.replace("DEFAULT")

neither line shows that it is changing the internal Firewall string.

A possible solution to this problem is to change the implicit return semantics to "dup" any instance or class variables. If the original behaviour is required, an explicit return can be used.

Local variables and function results don't need to be "dup"ed as they don't persist for long enough to matter.

Globals are accessible from both parts of the code so they're not an issue either.

Comments

Proposed behavior is too complex. (matz, 2002-04-09 22:54:26)

And I feel too hard to understand/explain.
How about using "freeze", or explicit "dup"?

matz.

Violates POLS (pong, 2002-04-11 13:08:37)

I think this kind of implicit behaviour would be confusing and in violation of the principle of least surprise. Couldn't dup/freezing be made declarative:

def someValue
  return @blah
end

returnImmutable :someValue

Ruby wouldn't have to change.

Reason for rejection

Object duplication is better be done explicitly.

RCR 80: Change implicit return to always return 'self' (anonymous, 2002-04-09 20:44:45)

Status: Rejected

I put this into a seperate RCR so you can vote 'no' to this while voting 'yes' to the other one. :)

I don't need some of my methods to return values, so I don't specify anything. The value that does get returned is whatever the last calculation was - eg, junk.

If someone else were to use my code, they may probe and discover return values that are useful to them even though I did not intend for them to be visible through that part of the API.

Rather than default to returning junk half the time, implicit return can be changed to return 'self'. This is a useful value that allows the calling code to chain methods together.

Explicit return would be used to override the default. This would need the "dup" protection for instance and class variables.

Comments

some issues to consider (cout, 2002-04-09 22:05:01)

1) This will certainly break existing code. Ideally, a migration path needs to be provided so that old code will have time to change.

2) In Ruby 1.6 (don't know about 1.7), it is slightly faster to omit return on the last line than it is to use an explicit return. This issue should be addressed before accepting this RCR.

3) Currently, the only way to return a value from a Proc or block is to use an implicit return. How would this RCR affect a) proc objects, b) blocks, and c) methods that have been converted to proc objects?

4) Chaining method calls can sometimes make code unclear. Would it be better to return nil or to return self (the former would exclude the possibility to chain method calls) when no return value is specified?

5) Might implicitly returning self cause any problems related to garbage collection? (I can't think of any apart from doing strange things with bindings and/or continuations, but this is an issue that should be considered nevertheless).

6) What was the original reason for Ruby's returning the last value anyway? Is this an important enough reason to keep this behavior?

Re: some issues to consider (matz, 2002-04-09 23:03:25)

6) What was the original reason for Ruby's returning the last value anyway? Is this an important enough reason to keep this behavior?

(a) it's inherited from Lisp (called implicit progn). And it is VERY useful and consistent with other compund statement (such as if, begin, etc.)

(b) as you stated in 3), it is the only way to return value from Procs.

matz.

Reason for rejection

Compatibility is the biggest reason to reject this RCR.

RCR 81: Comma separated arguments to "alias" (Nikodemus, 2002-04-12 06:34:22)

Status: Rejected

Normally arguments are comma separated in ruby, "alias" is an exception to this rule, however: alias :foo :bar I suggest that both the current behaviour and the following should be valid: alias :foo, :bar alias( :foo, :bar ) This would be more in line with POLS and I doupt it would break any existing code.

Comments

Better formatting, same content -- sorry! (Nikodemus, 2002-04-12 06:39:54)

Normally arguments are comma separated in ruby, "alias" is an exception to this rule, however:

alias :foo :bar

I suggest that both the current behaviour and the following should be valid:

alias :foo, :bar alias( :foo, :bar )

alias :foo, :bar alias( :foo, :bar )

alias is a keyword, not a method (cout, 2002-04-12 07:24:59)

Arguments to method calls are comma-separated. However, alias is a keyword (it falls in the same category as for, while, def, and class). I don't know what kind of implications allowing commas might have.

As an alternative, you might try using alias_method, which does allow commas. The difference between alias_method and alias are (I think): the order of arguments is backward, alias causes method_added to be called (while alias_method does not), and alias_method is only for methods (while alias works with global variables in addition to methods).

IMO, using alias and alias_method is a source of confusion to begin with, and both should be avoided whenever possible. The uglier they look, the better.

Reason for rejection

why not use alias_method method.

RCR 82: String#+ should automatically do param.to_s (rennex, 2002-04-13 20:19:38)

Status: Rejected

I'm doing my first bigger project in Ruby, and the error I keep making is forgetting to call to_s when constructing strings. For example: line = prefix + number where prefix is a string and number is not. Even Java automatically calls toString on objects that are added to strings, and it doesn't even support overriding operators! (at least the last time I checked :) It's easy to change String#+ yourself to automatically call to_s, but it seems so useful and issue-free to me that it should be default behavior.

Comments

It used to do auto conversion. (matz, 2002-04-15 03:09:15)

But I removed it because it often encouraged type-less programming. Ruby is a strongly but dynamically typed language, not a type-less language.

Type-less programming often hinders finding serious type bug.

matz.

Not issue-free (anonymous, 2002-04-15 06:37:34)

An issue with this is that it destroys associativity of addition, i.e.

"ho " + (1 + 2)  ==>  "ho 3"
("ho " + 1) + 2  ==>  "ho 12"

Personally I think it is best to be as explicit as possible with type conversions.

// Niklas

bad idea for ruby (jtra, 2002-04-15 06:48:10)

Such kind of sugar worth it in languages like java. But it is often used rather as strange conversion tool, i.e.

s=""+number;

Ruby's string interpolation is imho always better. i.e.:

line="some prefix #{number}"
or
line=prefix + number.to_s

In the second case it is also more readable since you know that number is not string (usually) without searching the previous assignment to that variable.

How is that an issue? (anonymous, 2002-04-15 07:06:05)

But everyone can see at first glance that the stuff in parenthesis is evaluated first, then the * and / operators from left to right, and then the + and - operators from left to right.

We don't need in everyday programming that a+(b+c) == (a+b)+c

Not his point I believe (anonymous, 2002-04-19 07:32:30)

Personally I find it a little bit awkward and non-intuitive. Consider this:

"ABC"+5 = "ABC5"
5+"ABC" = ??

I think it's better to be a little more strictly typed, than being surprised by things such as this.

Reason for rejection

type conversion should be explicit.

RCR 83: ARGV should be frozen (with a security level of 1 or greater?) (seanoc, 2002-04-14 14:45:11)

Status: Rejected

It would be of great interest to some in the security community if ARGV was frozen at a security level of one or greater. IMHO, however, given that ARGV is a magic constant, it should be frozen from the start as a readonly variable regardless of the security level.

Comments

I'm not convinced (JimWeirich, 2002-04-14 21:39:26)

1) Its trivial to do yourself. (e.g. ARGV.freeze)

2) Its trivial to defeat in many cases. For example, I ususually setup my top level function to accept ARGV as a method parameter, rather than hard code a reference to ARGV. If ARGV was frozen, I could pass in ARGV.dup as the main program parameter.

3) It breaks code that depends on the ARGV.shift idiom. It also breaks subsystems that filter ARGV, removing parameters that the subsystem recognizes and leaving "unhandled" arguments for the rest of the system.

4) It provides dubious security value. What is the additional security risk of a mutable ARGV?

Re: ARGV should be frozen (matz, 2002-04-14 23:27:38)

ARGV (among othre values) IS frozen with a security level of 4 or greater. Do you have any particular reason to make it level 1?

matz.

Reason for rejection

freezing ARGV at level1 will cause many problems. Modifying values is NOT allowed at level 4 for security reason.

RCR 84: Interpreter hint: iterator variable reuse (seanoc, 2002-04-16 15:39:02)

Status: Rejected

This would yield a rather sizable performance boost for iterators. Imagine the following pseudo code: ### Begin foo = [] # This is populating a large array # Note that x is a new object for each # iteration (1...1000).each{|x| foo.push(x) } # Note the reuse: interpreter hint foo.each do |reuse:x| puts x end ### End In circumstances where the developer _knows_ that the contents of a loop are only valid within the context of the loop, such as the print statements above, it would be insanely useful to be able to tell the interpreter to not create a new object for each iteration, but instead to reassign a new value to x for each iteration. This would substantially reduce the strain on the GC because there would only be one instance of x, not foo.length() instances. The default behavior, as shown when populating the array, would be to still create a new object for each iteration. Another huge win for this would be DBI and iterating over rows from a database.

Comments

Sounds like a good idea... (anonymous, 2002-04-16 16:24:04)

A few things to consider:

Are there other areas where we put
tags on variables like this? If accepted, this would open
the door for others, of course.

How will the parser be effected?

I do like the concept, though.

Internal problems... (seanoc, 2002-04-16 21:23:37)

Well, I think I picked a notoriously bad example: array just increments through the array returning the symbol for each of the elements in the array. I was trying to some how address a DBI issue:

dbh.select_all('SELECT * FROM foo') do |row|
puts row[0]
puts row[1]
end

Each iteration through recreates a new row object instead of reassigning to the existing row object. Another source of inspiration are the benchmarks that were done on page 552 of the Ruby Developer's guide wherein many of the performance problems of constantly creating objects, came to light (for example: Array#each_with_index). In most cases, there's nothing that can be done about this because the objects need to be created, however in many cases, if the programmer is savvy and knows that the underlying object will be _reused_, then I think passing an interpreter hint to the iterator would be a HUGE asset to Ruby.

Tired... (seanoc, 2002-04-16 21:25:28)

Array#each doesn't return the symbol, it returns a pointer the underlying element/object. I need some sleep. <:~)

No copy is done by iterators (matz, 2002-04-17 02:31:26)

As you've probably noticed by now, Ruby's variables are just references to the objects, so that no object copy is done by iteration. There's no use for "reuse" declaration.

matz.

Modifying underlying object without recreating.... (seanoc, 2002-04-19 04:32:52)

A better example:

Enumerable#each_with_index

That returns a new two element array for each iteration. Would it not be possible to reuse the 1st instance of the array and then have the contents be references to the actual values? This is a much better example than iterating over an array. <:~)

Reason for rejection

There's no use of "reuse" in Ruby's iteration.

RCR 85: key value mapping for sprintf/% (anonymous, 2002-04-17 01:47:20)

Status: Rejected

I was on irc at irc.openprojects.net in #ruby-lang
last week when someone came in and asked if Ruby
had something like python's string formatting
templates. While we were able to refer him to
various templating/interpolating solutions, they
didn't solve his problem as directly as he expected.

After some discussion, we realized what he was asking.
Python allows you to pass a mapping (e.g. Hash) as
values to a format string.
(cf. http://www.python.org/doc/current/lib/typesseq-strings.html)

In Ruby, we have format strings, but they only accept a fixed
number of arguments, and are substituted in the order they are provided.



"This is first: %s, and this is 2nd: %s" % ['one','two']

But python allows something like this for the format string:



ftext = "%(name)s is %(gender)s. %(name)s is %(age)2d years old"

In Ruby, it might work like this:



hash = {'gender'=>"male", 'name'=>"Fred", 'age'=>23}

ftext % hash  # "Fred is male. Fred is 23 years old"

I wrote a substitute method for String#% which treats a Hash argument
as described above. It seems to be working fine, and I even posted
it to the RAA ("TextFormatTemplate").

However, it seems this is something that could (should?) be easily added
to Ruby itself. It would help those coming from Python, add a useful
feature, and be totally backwards-compatible.

http://www.ruby-lang.org/en/raa-list.rhtml?name=TextFormatTemplate

Guy N. Hurst

Comments

in ruby (slumos, 2002-04-23 18:38:26)

isn't:

ftext = "%(name)s is %(gender)s. %(name)s is %(age)2d years old"

basically the same as:

ftext = "@{"%s" % name} is @{"%s" % gender}.  @{"%s" % name} is @{"%2d" % age} years old"

Re: in ruby (anonymous, 2002-05-10 18:06:27)

did my hashes get converted to ats somehow, or did I just not know what I was typing?

John Proctor (anonymous, 2002-06-18 12:49:09)

This is a very powerful tool and is used heavily in Python. I use Guy's TextFormatTemplate and would really like to see it added to Ruby itself.

Re: in ruby (anonymous, 2002-06-18 21:06:46)

I don't think so. Using %(name)s does not require name to be defined when the template string is declared. It is a marker to be expanded later. #{name} or any formatted version of that is expanded at the time the string is declared.

The difference in use is that with the former, you can declare strings, say SQL statements, that have missing values at the top of the program or in another module then execute them over and over by just applying the hash with the names enclosed in %(var)s

This is widely used in Python and is a very nice feature.

Reason for rejection

No need where we already have string interpolation.

RCR 86: Constructors (i.e. initialize) (transami, 2002-04-17 11:16:22)

Status: Rejected

i create highly dynamic scripts, such that i am often creating objects based on information pulled from files. hence i am not using literals. but the notation of the literals is so convienent, which is exactly why we have them! so why can't i create my objects using a constructor AND a literal form contained in a string? for example: aRange = 1..3 is the same as: aRange = Range.new(1, 3, false) but i want to type: aRange = Range.new("1..3") seems to me, by following the idea of least surprise, all constructors should be able to take the literal form as a string. (as an aside i have a pet-pieve about the use of 'def initialize'. why isn't it just 'def new' since that is the call that is made? not very intuitive.)

Comments

Re: Constructors (i.e. initialize) (pong, 2002-04-17 11:27:35)

Why don't you just use aRange = eval("1..3")

least surprise (transami, 2002-04-17 11:33:44)

i did use eval, though from what i've read it is exceedingly slow. my point is merely a suggestion concerning "least suprise".

New vs Initialize (JimWeirich, 2002-04-17 12:38:49)

new is the message sent to the class object to create a new instance. As soon as a new object is created, the class will send an initialize message to the new instance along with any parameters that were given to new.

So new and initialize are actually separate methods in different objects. It is possible to rename the instance method to new, but that might be just as confusing. And initialize is probably a better description of its role in the creation process.

Some More Thought on This (JimWeirich, 2002-04-17 19:52:00)

I see two basic problems with this idea:

Overload a function with different types in Ruby implies the function does a run-time type test on the arguements and programmatically decides on the proper behavior. This is something I would hate to see imposed on every initialize function I ever write.
What about classes that normally take a single string argument to new? How will you differentiate between the normal and literal behavior.

However, I might suggest the following code ...

class Class
  def literal(str)
    result = eval(str)
    fail "Improper literal string for #{name}: #{str}" if ! result.kind_of?(self)
    result
  end
end

Usage:

    r = Range.literal("1..3")
    i = Integer.literal("12")
    n = Numeric.literal("42")
    n2= Numeric.literal("3.1416")
    hex = Integer.literal("0xa5")
    silly = Range.literal("Range.new(1,3,true)")
    exception = Integer.literal("Hello")
    danger = Integer.literal('system "rm -r /"')

I wouldn't suggest this for general consumption because of the open use of eval, but it would be OK for careful, limited use.

Regarding the speed of eval, if you are doing IO on a file, I doubt the speed of eval is going to effect you significantly.

Cheers.

Re: least surprise (matz, 2002-04-18 03:04:03)

Your idea surprises me a lot ;-)

Anyway, your proposal (initializing an object from a string representation) requires parsing, the most slow part of the "eval". So your expectation (fast initialize from string) is somewhat contradicting.

matz.

Safe and Convenient (transami, 2002-04-18 16:58:06)

thanks matz. so speed isn't a significant factor.

JimWeirich's thought on a def literal seems useful, but he suggests that such a method can be dangerous due to the general use of eval. thus it still seems to me that a safe and convenient way to do this would be nice, fast or no. i am stuck with using eval as it is.

Re: Safe and Convenient (matz, 2002-04-18 23:09:32)

Objects with literals, such as integers, floats, and regexps can be converted from Strings, for example Float("3.14").

But since Ranges are formed from two expressions, conversion from string fundamentally requires expression evaluation, which is done by "eval" internally. So your proposal of Range initialization without eval is contradicting.

Do you get it?

matz.

Making Eval Safe (JimWeirich, 2002-04-19 08:23:25)

You can make eval safe as well as convenient. Just validate the string with a simple regular expression before giving it to eval. If the expression consists of a number, 2 or 3 periods followed by another number, then it is safe to give to eval.

Reason for rejection

This will be another form of eval anyway.

RCR 87: Alternate method return value (sodell, 2002-04-19 15:11:05)

Status: Rejected

Ruby returns values from methods in 2 ways: using the last evaluation made, or by an explicit return statement. I would like to see a third which allows you to say: return = "return value" ...which, when given, causes the value of the variable "return" to be returned no matter where else the method may return. This would override the "last evaluation" method, but not the explicit return statement, unless the "return" statement gave no value. Example: dosomething() return = "return value" dosomethingelse() somevar = "random string expression" return ...which causes "return value" to be returned, not "random string expression" nor an implicit nil from the return statement nor anything else. Since the return statement gave no value, the value of the variable "return" is used.

Comments

Could you give us the rationale? (matz, 2002-04-19 21:42:18)

I understand how it works, but I still don't get why we need this.

matz.

BASIC method return value (anonymous, 2002-04-21 06:48:39)

This is the way to do return values in BASIC (and Visual BASIC). This is NOT a language Ruby should ever borrow features from.

I have used the "set default return value" feature of BASIC and it has saved me a few lines here and there.

More important though is that I have come to rely on the automatic check of C++ compilers to warn me when I have forgot the "return ;" statement. In Ruby I tend to forget a return here and there which makes me spend unnecessary time troubleshooting. Or if I build a method relying on the "last evaluation" return method, I sometimes add code at the end, forgetting to add an explicit return statement.

Any comments on this?

/rob

Why (sodell, 2002-04-21 13:22:49)

It's also from Pascal and it is VERY useful.

Suppose a method has to do an iteration to produce its return value. During an iteration of say 100, at some random point, it has found what it needs and can produce a return statement. However, it has to do something before it can return, so it has to break out of the iteration. Here's an example of how Ruby code would look currently:

returnval = nil

100.times do | i |
if(some check) then
returnval = something
break
end
end

dosomething else
return returnval

...that looks sort of messy

With a return variable, you could just say:

100.times do | i |
if(some check) then
return = something
break
end
end

dosomething else

...knowing WITH CONFIDENCE what the return value with be, no matter what the code below ends up doing.

It's strictly a matter of elegance and certainty. The code is cleaner and anyone seeing the keyword return will know where the return value is decided and won't be confused that perhaps the code below is implicitly generating a return value.

Use "ensure" (matz, 2002-04-21 23:47:50)

returnval = nil

100.times do |i|
  if(some check) then
    returnval = something
    break
  end
end

dosomething else
return returnval

...that looks sort of messy

Why don't you use "ensure" like:


begin
  100.times do | i |
    if(some check) then
      return something
    end
  end
ensure
  dosomething else
end

`Don't want it to execute if exception thrown (sodell, 2002-04-23 03:24:06)`

Using ensure forces the call in the event of an exception. We don't want to guarantee the code is called, we just want to guarantee the return value when the function returns non-exceptionally.

`C'mon (adde, 2002-05-03 02:16:13)`

But w-h-y add it to the language? Just do it yourself!

          

          def do_something

          result='some sort of result'

          ...

          ...

          return result

          end

          

          The existing explicit and implicit return methods complement each other.

          This would just add complexity.

`Reason for rejection`

This RCR buys you little, if any.

`RCR 88: allow { } (sodell, 2002-04-21 13:39:17)`

`Status: Rejected`


def allow
  begin
    yield
  rescue
  end
end

I use the above method a lot to make calls that either I don't care if they work or not or I'm going to ignore/test their success/failure in another way sometime after the call. Here's an example:


allow{File::unlink("file.tar")}
system("tar cxvf file.tar *.files")

I don't really care if file.tar exists when I try to delete it; I just want it gone before I make the system call to invoke tar.

I have only seen this keyword in one other language. I think this would be a good addition to the core distribution.

Why would we need this? Because I think it encourages elegant code. Some things should be added as libraries, some things should be left to the users to write for themselves, but some things should be given to them from the very beginning and I think this is one of those functions that should just be there for programmers from the get-go.

`Comments`

`preview (sodell, 2002-04-21 13:44:35)`

So...is there a reason the preview doesn't look anything like the final post?

`allow( *exceptions ) (anonymous, 2002-04-21 18:12:10)`

Interesting. I would though prefer a slightly more general definition wich by default would work as you wrote, but could optionally be given a list of allowed exceptions:


allow( SystemCallError, ScriptError ) {
  ...code...
}


  -- Nikodemus

`allow (dblack, 2002-04-21 20:51:02)`

Hi --

The thing is, though, as it becomes more multi-purpose, it starts to look like just a different conception of how to handle exceptions (as opposed to just using rescue clauses in the code).

`Bad idea (cout, 2002-04-22 08:01:02)`

Catching all (or even a subset of all) exceptions is a bad idea, since you might catch an exception that was caused by being unable to remove the file. If unlink were unable to remove the file, then it may silently fail.

A better solution is to:

Explicitly catch the exception you are expecting,
Check that the file exists and don't try to remove it if it does not, or
Use File::rm_f instead.

`Use "rescue" as a modifier (Nikodemus, 2002-04-22 09:37:20)`

Silly me. Something like this is already implemented: you can use "rescue" as a statement modifier.


def alert
  puts "Alert"
  raise
end

alert rescue
puts "ok"

Produces:


Alert!
ok

`Not "bad" (sodell, 2002-04-22 12:46:52)`

It's only a bad idea if failure would actually cause problems and if the failure were never checked for.

          

          Sometimes I make calls "just to grease the wheels" so to speak, and if the calls fail, it's ok because I'm doing other checks later, or failure doesn't matter.

          

          Here's another example. Let's say I'm doing a little logging routine, and I want the logging routine to start with a fresh log file every time the program starts. It's the program's job to delete the current log file when it exits so the next time it's started, a new one will be created. Well, let's say it failed to do that one time so the log file doesn't go away on exit. At exit, we don't want a lot of errors to pop-up, so if the old log file can't be deleted, no big deal; we can't do much about right then anyway, especially if the whole system is shutting down. However, on start-up, we DO want to check for an existing log file, and that's where the error of "not deleting the log file" will really be handled.

`Not "good", either (devEiant, 2002-05-03 16:08:59)`

Your scenario sounds suspiciously like programming by coincidence. If it can fail with no repercussions at all, why make the call?

`Reason for rejection`

If you want to ignore excpetions, use rescue modifier.

`RCR 90: "{|x|...}" as a Proc expression. (matz, 2002-04-21 23:55:46)`

`Status: Rejected`

It may be possible to implement "{|x| ...}" as a literal form of Proc object, which exactly work as "lambda{|x| ...}". I didn't implemented it, because I thought "lambda{|x|...}" is enough, and it would cause shift/reduce conflict. But I think mandatory "|" after opening brace can solve conflict. It may make Smalltalk lovers happier, but makes the language syntax complex. How do you think? matz.

`Comments`

`something like: (anonymous, 2002-04-22 01:25:24)`

Need some examples...

          

          So it could work something like:

          

          {| puts "Boo!"}.call #-> "Boo!"

          

          or would it be:

          

          {|| puts "Boo!"}.call #-> "Boo!"

`Examples (matz, 2002-04-22 02:35:17)`


{|| puts "Boo!"}.call #-> Boo! 
a = {|x| puts x}
a.call(5)             #-> 5
def foo(n)
  {|x| x*n}
end
b = foo(6)
p foo.call(4)         #->24
p foo.call(8)         #->42

matz.

hmm. (cout, 2002-04-22 07:50:00)

I have two concerns:

As you stated in [], performance is an issue. Creating a proc implicitly might hinder performance;
There is an ambiguity to consider:
```
  def foo(*x, &y)
    # ...
  end

  foo { |z|
    # ...
  }
```
Is this passing a proc in as the argument to foo, or is it passing a block in with no arguments?

Explicitly using Proc.new/proc/lambda is sufficient in most cases, and is not too inconvenient.

literal Proc (Rich_Kilmer, 2002-04-22 09:20:56)

I would rather have the option of specifying the | in a proc than a manditory | after the {

In the case of empty params requiring {||...} seems at odds with the Ruby method calling syntax of leaving off the () where a method accepts 0 or optional params.

So, I think your current methods of creating Procs (lambda or Proc.new) are sufficient.

mandatory || (pong, 2002-04-22 11:41:21)

Would || be mandatory in all blocks, so it would be:

def blah
yield
end

blah { || puts "bleh" }

If so I don't think it is conclusively better, and not worth the extra syntax complexity.

Re: mandatory || (matz, 2002-04-23 02:16:15)

No, it won't. It will be mandatory for Proc literals only, just to remove anbiguity.

matz.

Re: hmm (matz, 2002-04-23 02:22:46)

Your first concern (performance) is not an issue, because "literal Proc" is an alternative for "lambda". Their performance is almost same, rather "literal Proc" is little bit faster.

Your second concern is a serious one. Probably

foo {|x| ...}

will be the block to the method "foo", but it may still confuse programmers.

matz.

Mixed Reaction (JimWeirich, 2002-04-23 09:57:08)

I have mixed reactions on this. Overall, I like the idea of a literal proc syntax. But you only save two characters ("proc{}" vs "{||}").

I think the additional complexity and potential ambiguity weighs against this idea. Unless I see stronger arguments for this, I think I would pass on this suggestion.

less clear with no benefits... (anonymous, 2002-04-23 15:41:25)

I don't think writing proc { } is any trouble at all, and it is a lot clearer that there's a proc object being created than simply writing {|| blah}.

I am also uncertain why the || would be needed. Does a plain { ... } literal have some other meaning?

Alternatives (Nikodemus, 2002-04-23 16:19:38)

"&" is already used to convert blocks to procs and back. Why not use it?

&{ ...code...}

Would be rather unsurprising as a proc literal...

Same applies to:

proc |var|
  ...code...
end

and

lambda |var|
  ..code...
end

Re: less clear with no benefits... and alternatives. (HughSasse, 2002-04-24 04:09:28)

proc {} looks visually very like proc () -- indeed it looks more like a method call than an constructor of a literal.

{ ... } can mean creation of a hash, depending on what the ... is. However, this would just change the amount of lookahead needed in the parser, I suppose. Except that expr, expr is a valid statement...

What would happen if ALL blocks were procs? Why does there need to be a distinction? If it is for performance, that can be optimised away, can't it??
Would this simplify the design of ruby?
Would it ease refactoring code: "extract block as method"? def work(x,y) myblock for example....

As for the alternatives of proc...end and lambda...end, I could live with those, after all, def creates a method object.

Rather not (jtra, 2002-04-30 15:41:27)

Well, I like idea of {|x,y| ...} syntax being used for anonymouse function (I have actually used this in my experimental language jtpl, http://klokan.sh.cvut.cz/~jtra/ ). However it does more confusion when this is added to current ruby syntax. Requirement of using || where no parameter is given is strange and breaks POLS. It would also add more complexity for beginers that tries to learn ruby.

So, my vote was rather not. I have been thinking few times about doing "ruby from scratch" that would have clear grammar (i.e. not written in parser.y only) and almost same semantic (possibly simplified), but it would require a lot of effort.

literal procs (raganwald, 2002-08-01 10:29:12)

Two things come to mind.

First, this seems like 'syntactic sugar'. If that's true, then it seems to me the 'elegant' thing to do is to introduce a macro facility.

I've seen discussions elsewhere pointing out the advantage of compiling ruby to an intermediate form and manipulating the intermediate form.

This is exactly how Lisp 1.5 was first designed, but then they discovered programming the intermediate form was more powerful than having a front end syntax.

Reason for rejection

It seems to introduce another ambiguity in the language.

RCR 91: Numeric#prev (seanoc, 2002-04-23 17:03:39)

Status: Rejected

This is a simple patch. I do a lot of calcs going up and down a number series and would like a builtin way of quickly decrementing a counter.

i = 10
i.prev   &gt;> 9

In tight loops this is faster than writing:

i - 1


Index: numeric.c
===================================================================
RCS file: /src/ruby/numeric.c,v
retrieving revision 1.42
diff -u -r1.42 numeric.c
--- numeric.c   2002/04/10 08:45:22     1.42
+++ numeric.c   2002/04/23 22:02:49
@@ -952,6 +952,17 @@
 }
 
 static VALUE
+int_prev(num)
+    VALUE num;
+{
+    if (FIXNUM_P(num)) {
+       long i = FIX2LONG(num) - 1;
+       return rb_int2inum(i);
+    }
+    return rb_funcall(num, '-', 1, INT2FIX(1));
+}
+
+static VALUE
 int_chr(num)
     VALUE num;
 {
@@ -1656,6 +1667,7 @@
     rb_include_module(rb_cInteger, rb_mPrecision);
     rb_define_method(rb_cInteger, "succ", int_succ, 0);
     rb_define_method(rb_cInteger, "next", int_succ, 0);
+    rb_define_method(rb_cInteger, "prev", int_prev, 0);
     rb_define_method(rb_cInteger, "chr", int_chr, 0);
     rb_define_method(rb_cInteger, "to_i", int_to_i, 0);
     rb_define_method(rb_cInteger, "to_int", int_to_i, 0);

Comments

Post to the list (matz, 2002-04-23 23:42:01)

RubyGarden is not the best media for posting patches. Try again to the ruby-talk list.

matz.

Close this RCR.... (anonymous, 2002-04-28 06:02:18)

After having tested against 1.7, this RCR should be closed: my point is mute/no longer valid.

Reason for rejection

The author wants this RCR to be closed.

RCR 92: Const correctness (anonymous, 2002-04-29 08:07:54)

Status: Rejected

Coming from a C++ background i really miss the ability to declare a method/object/parameter/returnvalue as const.

In Ruby (as in Java) the ability to declare parameters and returnvalues as const are even more important than in C++ since everything is passed by reference.

Without const correctness, the encapsulation that a class provides can easily be broken by mistake by modifying the wrong reference(s).

I fully aware of the builtin ability to freeze an object, but that's a different thing alltogether. You don't wan't to freeze the object, you wan't to prevent modifications thru a reference to the object. Also, freeze can't be used to declare methods as non-modifying (const).

And no, it wouldn't clutter up the language. If you don't want to use it you wouldn't have to. Allthough i'd recommend you to at least consider it.

Some Examples:



def fun1 const param

  param.capitalize!

end



var="value"

fun1 var=> Error, const param can't be modified



---



class Foo

  def method1 param const

    ...

  end



  def method2 param

    ...

  end



  const def method3 const

  end

end



const bar=Foo.new



bar.method1 => OK!

bar.method2 => Error, non-const method 

called for const object



var=bar.method3

var.capitalize! => Error, const object can't be modified

Comments

Voting (anonymous, 2002-04-29 08:28:48)

I would really appreciate it if people who vote took the time to write a comment and explain why they think it's a good/bad idea, what experience they have etc.

Yeah, and this specially applies to the ones voting 'Really BAD idea' ;)

/Author

How could this work? (Dave, 2002-04-29 09:53:27)

In C+, type checking is a compile-time activity. In Ruby, there is no way the compiler could perform these checks, as variables are untyped. This means that the concept of 'const'ness would somehow have to be propogated with objects themselves, which seems to be roughly the same as freezing the object. What am I missing?

Re: Const correctness (anonymous, 2002-04-29 10:05:19)

Ach!

I am a ??? programmer
Why isn't Ruby more like ???
Make Ruby more like ???

Yes it's me again, if this keeps up I'll have to register.

Sv: How could this work? (anonymous, 2002-04-29 10:06:14)

Well, i don't know much about the internal representation of a reference in Ruby, but the 'const'ness has nothing to do with the object itself.

For example:

class Foo
def initialize
@bar="lowercase"
end

# Returns const reference to @bar
const def get_bar
return @bar
end

def capitalize_bar
@bar.capitalize!
end
end

foo=Foo.new
bar=foo.get_bar
bar.capitalize! => Error, can't modify const object
foo.capitalize_bar => OK!

So, internally, Foo can modify @bar.
But when get_bar is used a const-reference to @bar is returned, preventing modification of @bar through that reference.

Besides that you can't freeze method definitions and method parameters.

When you declare a method as const you guarantee that it won't modify the receiver.

When you declare a method with a const parameter you guarantee that the parameter won't be modified by the method.

Or something like that...

It _is_ the object though (Dave, 2002-04-29 10:14:41)

In your example:



  const def get_bar

     "hello"

  end



  b = get_bar

  b.capitalize!  #=> error



  a = b

  a.capitalize!  #=> error (presumably)

So the const-ness is propagated with the object. Remember in C++ you declare variables, and those variables denote the const-ness or otherwise of the things they reference. In Ruby you don't, so there isn't really the concept of const-ness that I can see. That's why objects would have to carry the state around with themselves, and I'm not sure I see that as being different to freezing.--Dave

how C++ const differs from freezing (cout, 2002-04-29 11:56:19)

In C++, I might have this:

  void foo(const int & x) {
    x += 1; // error: x is a const reference
  }
  void bar() {
    int x;
    foo(x);
    x += 1; // x was const inside foo, but can be modified here,
            // because it wasn't frozen
  }

I might also have this:

  class Bar {
    void modify(); // a non-const member function
  };
  class Foo {
  public:
    const Bar & bar() const { return x_; }
  private:
    Bar x_;
  };
  ...
  Foo f;
  f.bar().modify(); // error: f.bar() returns a const reference
  const Foo f2;
  const Bar & b(f2.bar()); // okay: Foo::bar() is a const member function

In C++ I have:

Const objects (the object itself is constant, and I cannot call non-const methods on the object or get a non-const reference to the object).
References/pointers to const objects (the object may or may not be const, but I cannot modify the object through this reference)
Const pointers (the pointer itself cannot be changed to point to a different memory location). This is unnecessary for references, because they can never point to a different object onece they are set.

Ruby does have something like const pointers (these are called constants in Ruby and are created by beginning a variable name with a capital letter). It also has const objects (frozen objects). Ruby doesn't have a concept of a const reference, though. Frustrated with this, I have written some C extensions that simulate const references:

Note that this is still not quite the same as C++, since when a reference is const in C++, so are all the object's members (in Ruby this would require temporary deep freezing, which has a significant runtime hit).

Merits (anonymous, 2002-04-30 01:59:42)

I happen to think that the concept of const correctness stands on it's own merits, and deserves to be discussed.

What i was asking was: 'Would const correctness make Ruby a better language?', not 'Can you make Ruby look like C++'.

But i guess evolution is not your thing, eh?

Is not (anonymous, 2002-04-30 02:06:35)

I totally understand what you're saying.
However, as i tried to show in my example the instance variable @bar isn't frozen, ie. it can still be modified from the inside where the reference is not const.

From your example:
a=b
a.capitalize!

This would cause an error, but not because the object is const (remember, it is the same object that can still be modified by other non-const references), but because a was assigned from a const reference.

The sollution in use today for many these problems is to just .dup the objects you wan't to protect (eg. when returning a reference to an instance variable). I happen to think this is a dirty hack, and it wastes resources.

Re: Merits (anonymous, 2002-04-30 03:48:38)

Sigh. Ok here we go:

"Coming from a C++ background i really miss the ability to declare a method/object/parameter/returnvalue as const."

Very interesting, I miss strong typing.

"In Ruby (as in Java) the ability to declare parameters and returnvalues as const are even more important than in C++ since everything is passed by reference."

I have never encountered the problem that this is supposed to fix when I have programmed in Ruby. I find Ruby easy to work with with almost no gotchas.

What I think the real problem here is that someone who is immersed in C++ is trying to program in Ruby and can't quite shake off the C++ way and embrace the Ruby way of doing things. Just like people typing ++ instead of += 1. It's a thing you do almost unconsiously, after much Perl programming (in which I earn my crust) I keep putting ; on the end of each line for the first 10 minutes of turning to Ruby.

Conversly after Ruby programming I leave off the $ off Perl variables.

Ruby is a good language, lets not just add baggage to it because it allows C++ programmers to write C++ in Ruby.

Otherwise I would like you to consider the following omissions from Ruby :)

Multiple Inheritance
; at the end of every line
$, @ and % as variable type specifiers
variable scope as a keyword prefix
yadda yadda yadda

These all work well in Perl et al, so it wouldn't be impossible to do them. Would they make Ruby a better language - well it dependeds on who you talk to, but it would make things much easier for Perl / Java / C++ programmers who cant be arsed to learn the Ruby way.

I would like to evolve FROM Perl, Java and C++ rather than TO Perl, Java and C++.

Is it that you don't understand? (anonymous, 2002-04-30 04:21:15)

Ok, I admit it, i got the idea from C++. But where do you think the other 'features' of Ruby came from? Some advanced civilization from another planet?

I think you have in fact encountered the problem this is meant to fix, but you may not even be aware of it.

Lets say you declare a class:

class Customer
...
def name
return @name
end
end

customer=Customer.new
name=customer.name
name.capitalize!

I just changed an internal attribute of customer (@name is now capitalized)that was probably never meant to be changed.

By doing this:

class Customer
...
const def name
return @name
end
end

I could effectively prevent anyone outside of class Customer to mess around with the internal attribute @name.

If you have any valid arguments against it (besides RUBY=GOOD => NOT RUBY=BAD) i urge you to share your wisdom. Otherwise go flame Windows users at ./ or something.

Re: Is it that you don't understand? (anonymous, 2002-04-30 07:22:21)

Me again.

It is not that the feature you describe is not available in Ruby as clearly it is not. But is it missing from Ruby?

Just because you can do it in C++ does not make it's omission from Ruby an omission in Ruby. Ruby is not lacking this feature just as it is not lacking multiple inheritance.

Just because you can do something in language ??? does not mean Ruby has to have it.

Strong typing is something I belive in but Ruby works without it and it has tripped me up less than I would have thought so I am not advocating strong typing for Ruby.

What does this feature add to Ruby and how great an advance would it be to have it added. Just adding a feature that happens to exist in another language is not a good enough reason.

Ruby is a pretty complete language and all this const, ++ type of 'improvements' really add nothing to the language. They are quite trivial, probably require a large amount of work and will sully the design.

Comparing to ++/-- (anonymous, 2002-04-30 08:30:54)

I agree that you don't have to add features when they don't fix a problem (like ++).

But this is a real, live problem we have here, regardless of what you can/can't do in other languages.

For comparison, take strong typing:
Errors caused by the dynamic typing in Ruby are usually pretty easy to spot.
Just make sure you test every path of the program and they'll pop up to the surface by themselfes.

When an encapsulated object gets modified thru a reference by mistake it can take forever to find the problem.

So, i argue that a mechanism to prevent write-access thru a reference would make Ruby an even better language.

Re: Is it that you don't understand? (peterhi, 2002-04-30 09:56:22)

Me again (I've registered)

Why not, in your example, do

def name
@name.dup
end

Now name cannot be changed!

Is this what you wanted?

performance (cout, 2002-04-30 10:12:14)

This has a significant performance hit, particularly if @name changes a lot and has to be accessed by many different parts of the system. It's better to have the compiler or interpreter enforce the constness of the returned object.

Re: performance (peterhi, 2002-04-30 10:24:50)

Let me get this right. You want @name to be a constant so it can't be changed and then you expect to change it a lot?

I do not understand what we are trying to fix here and the more we talk about it the more esoteric it seems.

Re: performance (cout, 2002-04-30 11:02:01)

No, the idea is that @name can only be changed by the owner. The way to do this is to return a reference through which @name can be read but not modified. This is not at all an esoteric concept in C or C++, but it's something that's not possible in (and is thus somewhat foreign to) Ruby.

What about using a wrapper? (jarhart, 2002-04-30 14:43:48)

If you want only the owner to be able to change an attribute and you don't want the overhead of returning a copy, you could return a wrapper that doesn't allow changes.

Re: What about using a wrapper? (cout, 2002-04-30 23:00:00)

Writing this wrapper is not simple. A goal of Ruby is to make writing code easier and to reduce the strain on the programmer. A generic const reference (i.e. wrapper) would not work for all cases, but would work in enough cases that it could be very useful. This wrapper could temporarily freeze the object before calling any methods on that object, then unfreeze the object (assuming it wasn't already frozen).

One problem with this is thread safety. Perhaps there is another way to write a generic const reference without temporarily freezing the object.

Unfreeze... (anonymous, 2002-05-01 13:08:20)

From what i understand an object can't be unfrozen...

But i may be wrong!

Re: Unfreeze... (cout, 2002-05-01 14:55:57)

It is possible unfreeze an object from a C extension by directly manipulating the "frozen" flag.

Problems... (anonymous, 2002-05-01 15:21:28)

I saw it mentioned in a discussion at the ruby-mailinglist.

Supposedly you would cause problems for the garbage collector if you allow the user to unfreeze strings, which is what the wrapper would have to do.

Re: What about using a wrapper? (jarhart, 2002-05-01 16:18:56)

Actually, it would be pretty simple to write a generic delegator that's instantiated with an object and a list of methods not to delegate to the object.

Something like:

class ConstWrapper

  def initialize(delegate, *methods_to_skip)

    @delegate = delegate

    skip_names = methods_to_skip.collect { |symbol| symbol.to_s }

    skip_names += self.public_methods

    skip_names -= ['to_s', 'to_a', 'inspect', '==', '=~', '===']

    for method_name in delegate.public_methods

      next if method_name[-1] == ?! or skip_names.include?(method_name)

      eval(

Re: What about using a wrapper? (jarhart, 2002-05-01 16:33:30)

Well, I don't how to get code to look right here, so see ConstWrapper for my example of a generic wrapper to make an object unmodifiable except by the owner.

Alternative Const Wrapper (adde, 2002-05-02 07:31:23)

Below is my take at a constwrapper.
It requires no c-extensions and does not waste resources (each object has at most one wrapper).

You have to specify in the class declaration which methods are const, as in C/C++.

A const-declared method can modify the receiver though (In C/C++ you would have to cast away the constness first), but i think this implementation offers many of the advantages without adding to much complexity.

What's missing is const-declarations for all inbuilt/library classes (only object is provided).

To const declare an object:



          const foo=Bar.new

To declare a method as const:



          class Foo

          const :bar

          

          def bar

          end

          end

To return a const ref:



          class Foo

          def bar

          const @foobar

          end

          end

---



          class NonConstMethodException

`Sourcecode (adde, 2002-05-02 07:58:26)`

It would be really nice if the preview would cut the comment to make it look like it will when you post it.

          

          I have no place to put the sourcecode, but you can email me (adde@restech.se) if you want it.

`owner? what owner?? (patsplat, 2002-05-02 12:25:41)`

Who or what is this owner? The programer writing the code? The class?

`I think it'd make Ruby better. (anonymous, 2002-05-02 18:55:54)`

I voted "good idea". The "opposition" in the long argument earlier didn't seem to understand the concept of a const reference at first, and then got so attached to his opinion that he didn't want to change it ;-)

Ruby is the first language I've found that cleanly and intuitively implements accessor methods (usually achieved with getFoo() and setFoo()) : by making them look like you're accessing the object's attributes directly, but at the same time allowing the class to control the access. But when a getFoo() method returns a reference to an instance variable (ie. any instance variable that's not an immediate value), the encapsulation is again incomplete: you can modify the object (which is part of another object's internal state), or even change the whole object with replace() in some cases.

The current way of preventing this is to return a clone of the instance variable. But this isn't a good solution in some cases. If the object is big and heavy, it takes a lot of resources to clone it (especially if it's done thousands or even hundreds of times), and it will have been for nothing if the user of the object doesn't even try to change it.

But instead of adding "const" keywords in illogical places all around method definitions, maybe there could be a lightweight "copy-on-write" cloning mechanism? It would first create just a light proxy that accesses the original object. This proxy would treat the original object as "virtually frozen", and if the caller tries to do something that changes the object, then it would clone the original and forward all subsequent calls to the new one.

But I'm not clear on how the checking for frozenness is implemented in the first place, so I'm not sure if this idea is doable. I'll leave that question to the philosophers (and creators of the language :-)

P.S. patsplat: the owner would be any code that has a "clean" reference to the object, and not a "const" reference.

Examples (adde, 2002-05-03 02:02:35)

You could say the owner is the namespace currently responsible for the object.

For example:

class Foo
def initialize
@bar=Bar.new
end

const def bar
@bar
end
end

foo=Foo.new
bar=Foo.bar

The class Foo is the owner and the reference you'd wish to prevent write access thru is 'bar'.

What about more complex objects? (pit, 2002-05-03 17:28:49)

What would you suggest for more complex objects, i.e. an array of strings? Should the strings be "const" too or just the array? What if you have a tree or other arbitrarily complex object structures?

dup is bad too (anonymous, 2002-05-05 05:59:53)

I don't think dup() really solves the problem. In addition to the speed hit, there's another issue, involving the expectations of the programmer.

a = b a.capitalize!

would capitalize a copy of the attribute, which sometimes isn't what the programmer wants (he might think he is capitalizing the attribute itself)... The compiler should complain about this, and simply NOT allow to modify the value returned by the accesor, nor to copy the reference to it.

If we had static typing, we'd check the constness of the references each time they're assigned from/to (at compile time, of course), but as we don't have it, maybe one solution would be to introduce a special constref dynamic type, which cannot be assigned to anything, period.

const = type (anonymous, 2002-05-05 06:01:54)

const looks like part of the type of some variable to me. (remember const_cast? aargh!) Ruby is typeless so const is out of the question as far as I'm concerned. That, and the fear that you will have to keep the compiler happy when some parameters are mistakenly not const but you have only got a const variable that you want to pass to it. Very satisfying when you get it right finally, but no real work done.

Ofcourse, there is a real issue here, I'm not against solving it. Just no const please :)

Depends... (anonymous, 2002-05-05 09:24:59)

Ideally you'd want the strings in the array to take on the constness of the array.
Since what you're saying when you const declare something is that you don't want anyone changing the object through that reference (and that conceptually includes the internal rep. of the object).

Would that be a bad thing? (anonymous, 2002-05-05 09:31:50)

I would love the ability to selectively specify types and have them enforced.

But, const really has nothing to do with types.
What you're saying when you const declare something is that it wasn't meant to be modified.

Const lets you catch mistakes early and communicate more of the ideas behind the code to the next one reading it.

Re: Depends... (peterhi, 2002-05-07 03:37:35)

This is completely counter intuative(?). You have an array that is const but you can add new elments to it, move elements around - ie sort? Sorry if an array were const I would expect the whole array to be const not just bits of it.

If you think freeze is difficult to grasp then what you are proposing only makes things worse.

"const" is static typing (matz, 2002-05-07 03:52:06)

The concept of "your" const is a part of static typing. And static typing does not work well with Ruby's design.

If you don't believe me, try to design "Ruby with const". You will find how big the change would be.

matz.

Why adding Const to a language as Dynamic as Ruby? (keroami, 2002-05-07 04:29:51)

I do not think a hyper-dynamic language like Ruby needs something like "const". It feels out of place and smells like "finalize" with which Java hit me in the face.

It is not like the black-box ideal that some OO proponents evangelize, but it allows me to extend your classes and objects, knowing you can not stop me.

If you need "const" for yourself, consider using some test library for Ruby (RubyUnit, Test::Unit or another of your choice)

PS: If speed is an issue, Ruby is not your language of choice anyway.

a big change, but not static typing (cout, 2002-05-07 08:08:17)

There are lots of ways I can think of implementing const, but all of them are a) incomplete, b) buggy, or c) involve large changes to the Ruby interpreter.

However, const is not necessarily static typing. It's an attribute of the reference or object. The trick is that in Ruby, objects have attributes, but references do not. Unfortunately, I do not know a good solution to this problem.

Problem (adde, 2002-05-07 08:09:26)

Have you considered any other solutions for the problems arising from returning references from inside a class and thereby breaking the encapsulation?

Besides .dup, that is.

Const wouldn't change that... (adde, 2002-05-07 08:17:33)

Excuse me if i'm wrong, but, this sounds like another case of the "I really don't understand how it works, therefore it must be bad"-syndrome.

I can't see how const could/would be used to enforce any black-box-ideals in Ruby.

The errors caused by mistakenly modifying some internal attribute of a class are usually very hard to find, and unittests won't help much.

Same, same, but different (adde, 2002-05-07 08:20:38)

What i was saying, though with other words, was this:
Yes, ideally, the strings in "the" array would be const if the array is const.

And no, i don't think freeze if difficult to grasp. Never found any use for it, though.

Re: a big change, but not static typing (matz, 2002-05-08 01:08:01)

It depends on the definition of the "static typing". I consider "static typing" as attributes of references. By this definition, "const" is part of static typing.

"const" we are talking about is NOT an attribute of objects. The attribute to prevent object modification is "freeze" and the poster refused it.

matz.

Re: Problem (matz, 2002-05-08 01:10:50)

Have you considered any other solutions for the problems arising from returning references from inside a class and thereby breaking the encapsulation?
Besides .dup, that is.

"freeze"?

matz.

Back to square one... (adde, 2002-05-08 02:59:08)

But here we go...

Freeze modifies the original object.
While enforcing encapsulation, i.e. the object can't be modified from the outside, it also prevents further modifications by the owner.

So, it renders any attributes accessed once from the outside read-only forever.

Unfreeze would help, but i've understood from the discussions at the mailing-lists that it would be near-impossible to implement.

/Author (registered)

Implementation (adde, 2002-05-08 03:19:14)

I don't know how hard it would be to add any attributes to an ordinary reference in ruby, since i don't know how references are implemented internally.

But conceptually, this is what needs to be changed:

1) A const attribute must be added to references.

2) Some sort of const-enforcement would need to be implemented to prevent calling an non-const (modifying) method on a const object.
If there was an unfreeze method, the object could be frozen before calling the method and unfrozen after.

3) The standard classes would have to be updated to return const references instead of ordinary references where the returned object is not meant to be modified. To prevent users from (mistakenly) breaking the encapsulation.

4) The standard classes would have to be updated to take const parameters instead of ordinary parameters where the parameter is not meant to be modified. This way you can be absolutely sure that an method won't (mistakenly) modify the parameters you send in.

But, none of these changes would affect any existing code (except for revealing bugs, that is).

Const doesn't alter the behaviour of the language, it adds an ability to enforce encapsulation and contracts between a user/class.

Re: Const wouldn't change that... (peterhi, 2002-05-10 04:45:17)

We do understand how it works, many of us have many years experience in a large variety of languages (Cobol, Fortran, Tabol, FCS, Icon, Pascal, Lisp, Prolog, Basic, Perl, Java, Python, C/C++, Assembler (various) ... I could go on, and thats just me). So I'm a language junkie.

We do understand, we have the knowledge and the experience and from that standpoint we are saying that we can't see the point and that it would be harder to do than you seem to think.

If you are determined that it is easy to do then DO IT YOURSELF, the source to Ruby is available and you obviously have the insight and talent. When you prove us all wrong then Matz can merge your work into the main stream and you can bask in the warm glow of rightousness (sp).

I look forward to seeing ConstRuby V1.0, say lunchtime Wednesday?

Offended? (adde, 2002-05-10 05:23:45)

What i was saying was that your arguments had little to nothing to do with the ongoing discussion.

Surely, with such vast knowledge, you would be able to present a couple of valid arguments for your opinion!?

Again, i think you've misunderstood me. I'm not asking for somebody to do it, i'm simply discussing the problem.

Yet i have seen no valid arguments against adding const to the language.
Except maybe, 'It would be very difficult', which isn't exactly the kind of mentality that's going to make Ruby a better language.

Re: Offended? (peterhi, 2002-05-10 05:48:46)

'...which isn't exactly the kind of mentality that's going to make Ruby a better language...'

When Matz, the creator of the language, is telling you that it will not work the way you think it should because Ruby is not structured the way you think it is and you ignore his considerable knowledge and experience in this matter then I have to ask 'are we dealing with an idiot or a genius'.

Rather than be closed minded about this I await proof of your genius.

Matz (anonymous, 2002-05-11 12:20:24)

The thread in which Matz participated is an ongoing discussion where i can't remember having seen any valid arguments against the proposal as such.

I tend not to take anything anybody says as the one and only truth, regardless of who is saying it.
I was born with a brain of my own and i think that not using it to draw my own conclusions would be a waste.

You can stop waiting right now, as i have no intention to go ahead and try to implement anything before i'm sure that it's a good idea. Also, i have no experience at all from implementing compilers/interpreters, so i'd probably do a piss-poor job anyway.

I have to assume that either you have nothing at all of value to add to this discussion, or you're keeping it to yourself.

/Yes, it's me, though not logged in

Re: Matz (peterhi, 2002-05-13 10:06:37)

Taking it from the top one last time.

-- 1 --

You "I would really appreciate it if people who vote took the time to write a comment and explain why they think it's a good/bad idea, what experience they have etc."

A hopefull start, some very experienced and knowledgeable people are going to talk with you.

-- 2 --

Dave "In Ruby, there is no way the compiler could perform these checks, as variables are untyped."

You "Well, i don't know much about the internal representation of a reference in Ruby, but the 'const'ness has nothing to do with the object itself."

Your modesty is shortlived, note that Dave said variables, which are references, you then mentioned objects. Why? Do you perhaps think that references are objects.

Dave again, trying to clear things up for you "So the const-ness is propagated with the object. Remember in C++ you declare variables, and those variables denote the const-ness or otherwise of the things they reference. In Ruby you don't..."

-- 3 --

You "...maybe one solution would be to introduce a special constref dynamic type, which cannot be assigned to anything, period."

Here you combine type and constness, you will forget this later on.

You "But, const really has nothing to do with types."

Oops.

-- 4 --

Matz "The concept of "your" const is a part of static typing. And static typing does not work well with Ruby's design."

Matz again "I consider "static typing" as attributes of references. By this definition, "const" is part of static typing. ... The attribute to prevent object modification is "freeze" and the poster refused"

You "Have you considered any other solutions for the problems arising from returning references from inside a class and thereby breaking the encapsulation?"

Matz ""freeze"?"

You "i don't know how references are implemented internally."

We have noticed.

-- 5 --

You "this sounds like another case of the "I really don't understand how it works, therefore it must be bad"-syndrome."

You "Surely, with such vast knowledge, you would be able to present a couple of valid arguments for your opinion!?"

You have been presented with valid arguments from knowledgeable and experienced figures in the field of Ruby, Matz and Dave. You have rejected them. If you wont listen to them them you will probably not listen to anybody.

You "Yet i have seen no valid arguments against adding const to the language."

You "i can't remember having seen any valid arguments against the proposal as such."

This is not the same as not being presented with valid arguments.

-- 6 --

You "I tend not to take anything anybody says as the one and only truth, regardless of who is saying it."

You "I have no experience at all from implementing compilers/interpreters"

This is a fatal combination if you intend to start a dialog about the feature set of a compiler / interpreter.

-- 7 --

Ruby is not for you but take a look at Sather (http://www.icsi.berkeley.edu/~sather/index.html) which may be more to your liking. It even has templates of sorts along with types and other good stuff.

Honest you could get on with Sather rather well.

Re: Matz (adde, 2002-05-14 02:44:50)

Have you even notices how you turned this discussion from what you don't understand (const) to why i am such a bad person? You should probably try to get some help for this behaviour, it's not normal.

* You seem to have mixed up some other peoples comments with mine just to get to the results you wanted.
This is the kind of thing that usually happens when you know what you want to find before you even start looking.

* A lot of the above comments comes are taken out the original context and placed in your own selective context (where everything i say is wrong).
Context is everything, don't mess with it.

* Freeze DOES NOT solve the problem. Period.

* Real knowledge comes from listening, thinking and questioning. Not from listening and nodding.
This process requires thinking though, something you might want to try sometime.

* Ruby is for me allright. I've used it for over a year, and i'll continue using it. It's really my descision, not yours.

What if I want to manipulate your const result? (keroami, 2002-05-14 18:27:49)

I hope I understand how it works...

Ruby enforces no blackbox at all. It is so wonderfully dynamic I can screw up the langauge by definging some proper method in Object, I guess. Great, I love that!

An internal attribute that should not be modified, is private and not declared attr_writer or attr_accessor.

So, any result that is accessible outside a class, should be modifiable.

My argument is that when I extend your class, fit for a particular purpose, I do not want to be restricted in the way I deal the objects/references/whatever I get, simply because you can not predict the way I might extend your class.

You may not like the argument. You may think it is fragile, 'coz you've never seen something like this happen. Frankly, I haven't seen it very often, either. But it is an argument and it has not been mentioned by anyone else, afaik.

Invariants (adde, 2002-05-20 04:50:23)

Ok, say you write a customer class:

class Customer

          attr_reader :name

          

          def initialize name

          @name=name

          end

          end

cust=Customer.new
cust.name.capitalize!



          

          This would prevent any extensions of the class Customer where you need to modify the ID.

          But since the class wasn't designed for that, it's probably a bad idea anyway.

`Reason for rejection`

This RCR (regardless its static typing or not) will change the language too much.

`RCR 94: Optional type checks (chrris, 2002-05-06 13:21:06)`

`Status: Rejected`

Hi all,

Often I find myself writing manual type checks as the first thing in my methods. e.g:


class Foo
  def Bar(some, other)
    raise SomeException unless some.is_a? Enumerable
    # do stuff
  end
end

Would it be revolting to allow for optional specification of argument types and let the ruby interpreter to handle these checks and spit out nice error messages (with the calling line) if a mismatch is encountered?

E.g.:


class Foo
  def Bar(Enumerable some, other)
    # rest assured some is Enumerable and do stuff
  end
end

2

`Comments`

`Not convinced (Freaky, 2002-05-06 19:00:32)`

I'm typically more worried that an object responds to a particular set of methods than if it's derived from a particular class.

          

          For instance, String and IO both respond to each_line; I typically don't care if it's actually an IO or a StringIO or a String, or a Flurble object.

          

          Your proposal introduces extra syntax to the language for something I don't think is a particularly good idea in many cases, and for which a shorthand can easily be implimented in the current language, e.g:

def bar(some, other)

    some.is_a! Enumerable

end

Where is_a! raises TypeError rather than just returning true/false.

How typing works in Ruby (anonymous, 2002-05-07 07:41:22)

Ruby is not a statically typed language. In Ruby type and class are not related.

The type of an object is defined by, and only by, the way that it responds to messages. Classes and modules define reusable implementations but do not constrain the types of subclasses. It is perfectly reasonable for subclasses to override the implementation of an inherited module or class and to therefore act as a different type. It is also reasonable for a class to implement a type "from scratch" and to therefore be entirely unrelated by inheritance to other implementations of the same type.

For example, if I implemented a class that conformed to the Enumerable type, your code would flag a type error even though the program was type safe.

style issue.... (patsplat, 2002-05-07 11:41:29)

I think this kind of short cut would be easy to add to the language. It would also sort of answer (some) of the fear of static that people have.

However, as other posters have pointed out, Ruby is completely dynamic. That an object is a kind_of? Enumerable in no way ensures that it will act the way you want to. Even if we put on these extra wheels, those who prefer static typing would discover that they still aren't getting the assurance that they would from Java.

Ruby isn't a staticly (sp) typed language; it shouldn't pretend to be one. Adding this feature would create a misleading appearance of stability, and could be more harm than good. If you want security, Ruby's dynamic typing makes it very easy to write unit tests. This provides much more confidence (in Ruby) than any false static typing constructs.

Perhaps method availability would work better (anonymous, 2002-05-09 02:10:19)

Rather than trying to enforce typing (not the Ruby way), perhaps there could be a way to easily check that a particular object offers a certain set of methods expected. Perhaps a method that accepts an array of strings and will test that there is a method available for each name in the array.

there is an easy way... (patsplat, 2002-05-09 14:44:09)

see aObject.methods, aObject.respond_to?

also,

(aObject.methods & expected_methods) == expected_methods

That doesn't actually help (anonymous, 2002-05-13 08:06:06)

See on the wiki for more details.

type checking (anonymous, 2002-05-13 17:35:20)

Type specification could be optional or we could explicitly specify when we don't want it. If ruby doesn't do it first, perl 7 will!

Type != Class (anonymous, 2002-05-15 17:26:00)

Since Smalltalk, the argument "type and class are not related" has been repeated over and over again. While this is true (from a certain point of view), the original question was about type and not class checking (although the example in it was unluckily chosen).

Staticly typed languages used to treat types and classes the same, but the notion of interfaces changed that. Interfaces enforce what Smalltalks protocols merely describe.

The notion of a type is obviously very important for both static and dynamic languages. Static languages, however, at least try to embed that concept into the language explicitely.

So I'll refrase the original question as:

Would it make sense to introduce the concept of an optional type into a dynamic language?

Speaking for myself:
I feel much better if I know a message arguments conformance to a required type is (or can be) enforced, even if this is limited to a syntactical conformance.

Interfaces do not enforce types in polymorphic languages (anonymous, 2002-05-16 10:26:56)

Static typing based on names can only be used to check types at compile time in languages without polymorphism (e.g. pascal, modula-2, etc.). In languages with polymorphism, an interface type is separate from its implementations, and therefore it is up to the implementors of that type ensure that their implementation conforms to the type. Static typing can only enforce part of this conformance: it can catch simple errors but misses more complex errors such as those caused by invalid implementations of the protocol associated with a type. Thus typing in polymorphic languages with interfaces is only a little more than a convention that implementors must follow, rather than a rigorous check.

This is not much more secure than how typing works in Ruby, and a lot less flexible. In Ruby implementors of a protocol must ensure that their class conforms to that protocol. Unit tests are used to check that the implementation actually does conform. Mock objects are used to check that an object or method correctly uses other objects according to their protocols. Those unit tests and mock objects should be provided by whoever defines the protocol. This gives you more flexibility than static typing, and more rigorous checks.

Finally, explicit declarations of protocol conformance such as Java's "implements" clauses, are overly inflexible. If an class implements a protocol but does not declare so, the type system considers the class incompatible with the protocol, even though it is. Also, interface declarations make it hard to work with objects that only require part of a protocol -- just the readchar method of an IO stream or just the each method Enumerable. An example of the difficulties this causes is evident in the Java collection libraries where one must implement a large number of methods just to make something act as a list or collection even if one only needs the size() and get() methods.

This is a specific case of Design By Contract (anonymous, 2002-05-17 18:18:05)

I think it would be more useful to apply design-by-contract than shoehorn what looks like static typing onto Ruby.

How about .. (anonymous, 2002-06-12 01:47:41)

A definition like:

def method(Type a)

would make the interpreter call the method Type passing a as argument, wich should return an object of class Type.

design by contract (raganwald, 2002-08-01 10:35:47)

I agree with this 100%.

Static type checking is an excellent language feature. It doesn't hinder great programmers, and it does make programs more readable and reliable. Constraints such as type checking and unit tests are the foundation of agile development.

BUT

Rather than catching up with C++, Ruby can leap frog it and consider more powerful constructs such as Design by Contract.

Reason for rejection

By this, Ruby will no longer be Ruby. Try UnitTest you program.

RCR 95: require (transami, 2002-05-06 15:24:36)

Status: Rejected

when ruby looks up a file specified by require (and perhaps load) it should search, as it does, in the predefined paths, the RUBYLIB paths, and -I paths (all of which are in $:), but it should also recursively search all sub-directories in those paths as well.

the reason for this is simple. to keep libraries organized it is nice to group related files into sub-directories rather then just dumping them all into one of the predefined directories. but by doing this one is required to manually add the subdirectory to the load path or use "require 'subdir/file'" in one's programs. either way this creates problems with cross-compatability because my libaries then have to be organized in the same manner as yours. to see a good example of how this problem manifests, look at dbi. there's actually a file called dbi.rb that contains one line: "require 'dbi/dbi'". how silly is that!?

if ruby searched sub-directories (giving precedence to files higher-up in the directory tree) these problems would go away. 2

Comments

Re: require (vjoel, 2002-05-06 15:46:44)

The problem seems to be that this opens up a broad global namespace for required files.

One use of subdirs is for a library be able to have some files in a dir that is not on the global search path. The library can require them explicitly, but the files won't conflict with any other library's files. How would you do this if subdirs were searched?

precedence (transami, 2002-05-06 18:08:23)

through the simple use of precedence.

ruby would search from the top of the directory hiearchy down, as soon as it comes across a matching file that's the one it uses. you could still specify files explicitly using an exact path in the require statment, i.e require 'path/file', if needed, which would take care of those rare conflicting cases.

Slow, violates POLS, makes children cry (Freaky, 2002-05-06 18:30:16)

There could be hundreds of directories in $:, do you really think it's sensible to walk them all and possibly hit the wrong file?

Also, no other require implimentation I have seen works this way, and it is not in any way expected behavior.

People will get upset if they do a require 'foobar', foobar isn't there, and end up loading a completely different foobar. Little children will probably cry as they get frustrating and difficult to trace errors.

If you really want this behavior, impliment your own method to do it, but you can be sure it's not what most people want.

Re: precedence (vjoel, 2002-05-06 18:45:22)

What if my-lib has a file called setup.rb, which is in the my-lib/lib subdir. Of course I can require it by explicitly giving the path. But what if another lib also has setup.rb. That lib has to use the explicit path to its own steup.rb, or else it breaks. So now we've fallen back to the old style of require...

In the current require semantics, I can hide setup.rb in a "namespace" (a subdir), and I can structure these "namespaces" isomorphically with Ruby namespaces.

okay point taken, but what about a solution? (transami, 2002-05-06 19:11:06)

i understand the points given against the a recursive search, but we still have the ugliness of the problem i was trying to solve with it.

what about the dbi example i gave?

and another one, that i just delt with which provoked this idea in the first place: i installed xslt4r. xslt4r comes with xmlscan, which it requires. i didn't want to just dump all the the xmlscan files into site_ruby/1.6/. i wanted to cleanly seperate them into a subdirectory 'xmlscan', but this caused me to have to go through every xmlscan script and the xslt.rb script and change every require statement to add the path.

perhaps part of the problem lies with library developers not putting there files into a directory in the first place. but how can you force them to do so?

so, while i see the downside of recursing the load paths, i ask you what is the right solution?

What's the real problem? (root, 2002-05-06 19:23:21)

In the dbi case, that's the organization the author chose. It would also have been possible to put the functionality in the top-level file, but the author prefered grouping it at a the same level.

In terms of forcing them to use directories for libraries: I'm not sure how the recursive scan would help. That's just a matter of making polite suggestions to the authors.

okay let me summuraize the problem (transami, 2002-05-06 20:13:08)

what would happen if i tried to install every libray in the RAA? while i can't be 100% on this since i haven't tried it, i can already tell you from the libraires i have installed that most of them just expect all their files to be dumped into a loadable path, like 'site_ruby/1.6'. now tell me that won't cause any name conflicts! so there's the heart of the problem. it doesn't really matter if there are potential name conflicts with a recursive require, because we already have potentail name conflicts with the way things are. all smart developers should put their libraries in a subdirectory anyway and avoid the problem. and perhaps no library should be allowed onto the RAA if it dosen't.

then again, there is antoher level of thought to this issue. wouldn't it be nice if a library could have private files? but that's probably too much to ask.

anyway, as it stands, to keep manually installed libaraies organized, avoiding name conflicts, one has to potentially define their own subdirectories and alter require statements for every related script. NOT very convienent or time efficient, or portable.

Possible solutions (anonymous, 2002-05-07 11:29:09)

The cure would be worse than the disease, I think, if we recursed subdirectories. Requires take long enough already. Maybe some discipline in naming directories in site_ruby would be better. It might take longer to type require 'foo/bar', or 'foo/bar/baz', but it still isn't quite as horrible as the java com.rubygarden.www.gee.whiz.this.is.long.

If namespace is the issue, recursing directories worsens the problem; it becomes even harder to know just where ruby picked up the library.

Re: require (patsplat, 2002-05-07 11:49:02)

This sort of is related to a suggestion I made to allow require 'aDir'

http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-talk/39025

However, the consensus from the list was that it was not necessary.

I think the point is that if the libraries are disorganized, require will always be difficult. Instead of beefing up require, the libraries should be organized.

~ Patrick

you're right (transami, 2002-05-07 16:52:42)

you're right, it would be worse. just wish there was an easy solution.

any ideas, anyone? (transami, 2002-05-07 16:55:05)

so does anyone have any rock'n ideas on how to get the libraries orgranized?

here's one way: (patsplat, 2002-05-08 17:09:04)

> perhaps part of the problem lies with
> library developers not putting there
> files into a directory in the first
> place. but how can you force them to
> do so?

Don't force them, fork them.

Take the project dir, reorganize the source, make sure as much of the build scripts continue to work, etc. Make it clear that the purpose of this fork is to improve some problem with the way the files are organized.

then...

Politely present it to the maintainer:

"Here's a fork. Here's why I did it."

If they disagree with your suggestions, politely announce the fork publicly:

"I love everything about this library except for the way the src is organized. If you feel the same way, you can download the fork from ______."

If other people agree that you have an important point, the fork will be used. Actual use of a fork is a easier way of justifying a code change. Instead of "I think this, but you think that", substitute "This portion of the community at large agrees with these changes."

After announcing the fork, there are several positive outcomes:

Not many people really care about using the code. Maybe it wasn't a major issue; no bother, you at least fixed it for yourself.
Many people use the fork, and the maintainer agrees that the fork should be merged. You've helped the maintainer by demonstrating the existence of a problem, and a viable solution to that problem.
Many people use the fork, but the maintainer doesn't want to merge the code. Oh well, no biggie. You've made life easier for the people using the fork. Even if you don't want maintain the fork, there's a chance that someone else might want to maintain it.

Open Source developers do not work for you; they work for themselves. It is your own responsibility to make things work for yourself. Chances are, you will generate code that is worth releasing publicly.

why is "com.rubygarden.www.gee.whiz.this.is.long" bad? (anonymous, 2002-05-17 16:57:46)

Hi!

New to Ruby... just thinking about diving into it and decided to read a bit on it here.

Coming from Java, I have a comment on this. To begin with, I really hated all those import statements in Java. However, after I started using IDEA from IntelliJ.com, I realised that this is really not something that a developer should be worried about. Since I started to use a proper editor, then I have never had to write a single import statement or package statement, for that matter.

The point? Why live with the risk of incorrectly resolved "requires" when this is something that can be automated by the editor? Couldn't this be solved for Ruby like IntelliJ has solved it for Java.

Note: never really developed anything in Ruby, so at least I don't have any experience in organizing my code in directories.

Cheers...

Reason for rejection

this can cause unexpected name conflict.

RCR 96: class/module name must (NOT) be a CONSTANT (transami, 2002-05-06 18:58:02)

Status: Rejected

Wouldn't it be nice if mixins could be variable?

i'm trying to build a class that needs two sets of methods to be passed into it. at frist i put the two sets of methods into seperate classes and passed them into the main class as part of the initialize method. but doing so dosen't allow one set of methods to call the methods in the other set. then it occured to me, a-ha!, what i need to use are mixins. but this didn't work because the include statement requires a CONSTANT, and i don't know the method's name before hand! oh-no! is this a limitation of ruby? can ruby be changed to accomadate this? or is there already a way to work around this? here's example code to help:

class MainClass
  def initialize(class1, class2)
    method_set1 = class1.new
    method_set2 = class2.new
  end

  def say_stuff
    method_set1.say1
    method_set2.say2
  end
end

class MethodSet1
  def say1 
    puts "Hello World!"
  end
end

class MethodSet2
  def say2 
    puts say1  # won't work
  end
end

main = MainClass.new(MethodSet1, MethodSet2)
main.say_stuff

on the other hand:

class MainClass
  def initialize(class1, class2)
    include class1  # not a CONSTANT error :(
    include class2  # not a CONSTANT error :(
  end

  def say_stuff
    say1
    say2
  end
end

module MethodSet1
  def say1 
    puts "Hello World!"
  end
end

module MethodSet2
  def say2 
    puts say1  # this would work, if not for the CONSTANT errors
  end
end

main = MainClass.new(MethodSet1, MethodSet2)
main.say_stuff

Comments

you can get at these objects (dblack, 2002-05-06 19:35:22)

Hello --

Here's one drop-in replacement for your initialize, and there are probably slicker ways to do it:

  def initialize(class1, class2)
    (class 


Please don't take offense at this, but it's probably best to bring things like this up on #ruby-talk/comp.lang.ruby, rather than start by suggesting a change to the language.  You'll find lots of help, and often lots of different idioms for doing what you want to do.

(repost) you can get at these objects (dblack, 2002-05-06 19:38:08)

(whoops, the post came out very different from the preview.... here's a plain-text version, not very nice-looking)

Hello --

Here's one drop-in replacement for your initialize, and there are probably slicker ways to do it:

def initialize(class1, class2)
(class <<self;self;end).class_eval <<-EOE
include #{class1}
include #{class2}
EOE
end

Please don't take offense at this, but it's probably best to bring things like this up on #ruby-talk/comp.lang.ruby, rather than start by suggesting a change to the language. You'll find lots of help, and often lots of different idioms for doing what you want to do.

thanks (transami, 2002-05-06 20:21:10)

yes, you're right. no offense taken. but ummm, well...how does one participate in this #ruby-talk business? i can figure it out if you could just point the way. thank you. i know it might seem a dumb question, but sometimes even dumb questions need to be asked!

anyway, your solution is about the uglist piece of ruby code i've ever seen, but if it works, it'll have to do. thanks for that too.

extend (anonymous, 2002-05-07 00:35:37)

class MainClass
  def initialize(class1, class2)
    extend class1
    extend class2
  end

  def say_stuff
    say1
    say2
  end
end

module MethodSet1
  def say1 
    puts "Hello World!"
  end
end

module MethodSet2
  def say2 
    puts say1  # this would work, if not for the CONSTANT errors
  end
end

main = MainClass.new(MethodSet1, MethodSet2)
main.say_stuff

$ ruby 1.rb
Hello World!
Hello World!
nil

perfect (transami, 2002-05-07 03:45:42)

tried and true. thanks.

re: thanks (dblack, 2002-05-07 04:53:04)

Hi --

I always forget 'extend'. Glad someone else is on the ball :-)

There's information about subscribing to #ruby-talk at http://www.ruby-lang.org/en/ml.html. This mailing list and comp.lang.ruby mirror each other.

Reason for rejection

This RCR is based on misunderstanding, I guess.

RCR 97: Assume quotes for simple hash keys (anonymous, 2002-05-06 19:18:02)

Status: Rejected

Allow simple hash keys to be auto-quoted. For example:

days = Hash.new
days[Apr] = 30 # identical to...
days['Apr'] = 30

Coming from Perl, you don't realize how much you miss this until it's not available.

This isn't a big issue, I just thought I'd see what people thought.

Comments

Not in the spirit... (root, 2002-05-06 19:20:47)

Ruby doesn't know that days[Apr] is a hash reference: the resolution of the [] operator is performed at run time. So this RCR really asks for any name between []'s to be stringified, which would be unexpected.

Apr might be a variable (transami, 2002-05-06 19:29:02)

what if you had:

Apr = 'May'

what would you get then? (please ignore that i am overlooking the use of the capitalized first letter.)

produces:
31

A better reason for rejection... (anonymous, 2002-05-07 08:43:41)

...to my mind would be the fact that unlike in Perl, in which hases are only indexed by strings, Ruby hashes can be indexed by anything. E.g., all the following are legal:

h = {} h["the answer"] = 42 h[42] = "the answer" h[[1,5,10]] = "My key is an array" infile = File.new("mystuff") h[file] = "my stuff. keep out!"

Obviously, having the hash index operator automaticly quote it's contents would completely defeat this capability.

BTW, to save a keystroke in hash indexes, use symbols (:key) instead of strings ('key').

Nay (whytheluckystif, 2002-05-08 18:22:36)

I find the non-quoted version so much more confusing (in Perl and PHP also). If it's a single word, I'll often hunt around for the variable it's referencing to no avail. It also seems that Ruby would encourage use of symbols here, so I would much rather see days[Apr] mean day[:Apr], but I'm happy if gets by.

Reason for rejection

Unlike Perl, bare words might be variable names in Ruby.

RCR 100: Array.{assoc,rassoc}_index() (HughSasse, 2002-05-13 11:16:25)

Status: Rejected

1

Array.rassoc and Array.assoc are a great way to use an Array of Arrays as an ordered Hash. However, if you want to update the data in this structure, assoc and rassoc won't tell you where it is, just what it is. You can use find (etc) to locate it, but this means that you don't make use of [or "leverage" if you prefer :-)] the fact that (r)assoc "knew" where the data was when it found it.

Since (r)assoc just returns the whole subarray, what about rassoc_index() which would behave exactly like (r)assoc, but return the index, so you can pass that into Array.[] to read and/or change the data there?

I think it would ease the creation of these nested structures, which would make assoc and rassoc more useful.

Comments

even more (transami, 2002-05-17 13:51:46)

while this would be useful, i think it would be better if an ordered hash class were implemented. i have ran across a couple of needs for such an entity, and have thus had to use assoc arrays instead, but as you point out here, they aren't nearly as convienent. becasue of this i have often ported my assoc arrays back to hashes if they no longer require ordering. but sadly i could find no .to_h that converts assoc arrays to hashes even though there are easy ways to convert the hash to assoc arrays. anyway, lets get past this whole deal and implement ordered hashes as a built-in class, please!

three options (cout, 2002-05-18 09:31:09)

- needs to be a balanced binary tree
- an interface to libavl
- seems to be a wrapper around Array

Reason for rejection

It's not how assoc works. a = [[1,2],[2,4]] b = a.assoc(1) b[1] = 5 p a # => [[1,5],[2,4]]

RCR 101: a little more hash please (transami, 2002-05-17 14:08:25)

Status: Rejected

using hashes quite a bit, i have thought of two simple improvements:

1. add a << method synonymous with update
2. add a collect method like that of the array's

both of these would improve my code.

in general it seems like the array class is much more developed then the hash class. so it would be nice to see the hash beefed up. yum!

a built-in ordered hash class would be very nice too, along with a side of scrambled eggs ;-)

and sure wouldn't hurt if there was a .to_h method to convert assoc arrays to hashes either.

have at it! 2

Comments

Re: a little more hash please (cout, 2002-05-17 15:38:37)

a built-in ordered hash class would be very nice too, along with a side of scrambled eggs ;-)

If it's ordered, then I don't think it is still a hash. I know of no easy way to get O(1) element access and still retain order. The closest thing I know of is a balanced binary tree, like C++'s std::map which has O(log n) element access.

I would definitely like to see a built-in Map.

and sure wouldn't hurt if there was a .to_h method to convert assoc arrays to hashes either.

Hash[] is similar:

irb(main):001:0&gt; x = [ [0, 1], [5, 6] ]

          [[0, 1], [5, 6]]

          irb(main):002:0&gt; y = Hash[*x]

          {[0, 1]=>[5, 6]}

enhancements to Hash (dblack, 2002-05-18 09:28:52)

I'd like to start by suggesting that you use a more precise subject line. Also it might be a good idea to separate out the various things you're suggesting into separate posts.

In any case, a couple of things to note:

Hash#collect already exists:

irb(main):031:0> h={1,2,3,4}
{1=>2, 3=>4}
irb(main):032:0> h.collect do |k,v| 10 * k  end
[10, 30]

There's been a lot of discussion in the past about #to_h possibilities. A good starting point to research these discussions is .

PHP has an ordered "hash" (Freaky, 2002-05-21 06:57:11)

Might be worth a sniff around the Zend sources.

I did find myself in need of an ordered list the other day; a case where a hash-like structure would have make it conciderably cleaner than an array of arrays.

Reason for rejection

Separate your proposals into individual RCRs.

RCR 102: a rand method for Enumerable (repeater, 2002-05-21 17:07:24)

Status: Rejected

a recent post to the rubytalk mailing list has again emphasized a thing i sense as a necessity. apologies if this has already been considered...a search revealed nothing

the best posted answer to a password generation query, were something in the line of:

def generate_password(minlength=5, maxlength=10)
        chars = ("a".."z").to_a + ("A".."Z").to_a + (0..9).to_a
        wordlength = rand(maxlength - minlength + 1) + minlength
        (1..wordlength).collect { chars[rand(chars.length)] }.to_s
end

to me, the following is more elegant, and it illustrates the modification that i request at two places:

def generate_password(minlength=5, maxlength=10)
        chars = ("a".."z").to_a + ("A".."Z").to_a + (0..9).to_a
        #usage 1: wordlength equals a random value in the range
        wordlength = (minlength..maxlength).rand
        #usage 2: a random character is retrieved from chars array
        (1..wordlength).collect { chars.rand }.to_s
end

i therefore request that a rand method is added to Enumerable. the retrieval of a random element from an enumerable construct is a very common practise, and therefore i believe that it is worthy of presence as part of the Enumerable mixin. seeing arr[rand(arr.length)] is a burden to behold

an alternative is to overload the rand kernel function so one can say: rand((10..20)) in addition to rand(int). this would seem more consistent, but less beautiful

Comments

Re: a rand method for Enumerable (anonymous, 2002-05-21 17:38:00)

seems a very reasonable request and the Ruby way

Except... (root, 2002-05-21 17:46:21)

Enumerable does not imply finite: IO classes such as file are Enumerable.

all things a finite (transami, 2002-05-21 18:04:15)

okay so maybe some things are really big, but that's your own silly choice if you try to rand an indefinite IOStream or something. seems to me if you can sort, max, min, etc. an enumberable there's no reason it can't return a rand element.

Re: Except... (patsplat, 2002-05-21 20:36:23)

I don't know if I mind this.

Enumerable already has this risk (patsplat, 2002-05-21 20:39:19)

to_a is already part of Enumerable, and has the risk that rand would.

~ Patrick

True... but (root, 2002-05-21 20:51:17)

I personally think that the Enumerable interface should be split into Iterable and Enumerable, where Iterable means you can access elements one after the other and Enumerable means the elements are countable (and reasonably efficiently so). Methods such as to_a and rand have serious side effects: on infinite streams they'll loop, and on single-pass streams (such as files) they consume the content the first time and then fail.

So... I'd rather not see more methods added to enumerable until the semantics of the module is clarified.--Dave

Re: a rand method for Enumerable (peterhi, 2002-05-22 03:38:30)

If the enumerable object was a stream wouldn't fred.rand have to read in the whole stream before picking an element?

I have a prime number class that is enumerable but has no realistic upper limit. primestream.rand would never return in the life of this universe or any other.

How about rand on range objects (10..13).rand only or is this heading into the Enumerable / Iterable division?

Is there a shuffle operator?

Both are nice ideas (glv, 2002-05-22 10:20:32)

I like this answer, Dave. And once that's done, adding 'rand' to Enumerable sounds really great to this old Icon hacker ...

excellent notion (transami, 2002-05-22 11:04:39)

that's even better. make the split!

Reason for rejection

I'm not sure how useful this is. How about something like "Array#choice"?

RCR 103: nested classes (transami, 2002-05-21 17:56:04)

Status: Rejected

nested classes seem of great usefulness, expect for the fact that they are rather useless ;-) in fact the only use they seem to have is for code organization, which is too bad. for they would be of great use if they were tied to their "nest-parent". if a nested class could not be initiated without first initiating an instance of the nest-parent class, this could allow the nested child to access the instance variables and methods of the that nest-parent. in other words the initialization of a nested class should be within the context of an instance of its nest-parent.

i do not know how hard this would be to implement. it would probably require a proxy of some sort. but its utility would be quite worth it. i have already run across numerous uses and sadly i have been forced to pass the parent's self to the nested class to gain access to the parents internals. ugly. and in one instance that didn't seem to cut it either. frustrating. 2

Comments

namespace organization is nice :-) (patsplat, 2002-05-21 20:34:47)

in fact the only use they seem to have is for code organization

this is not trivial

for they would be of great use if they were tied to their "nest-parent"

This would tie namespace organization with the implementation. Currently, User could be a data object, and User::Login could be a statement object which logs in the user. Currently, these two classes can have totally different interfaces appropriate to their different purposes. Binding them together would break this.

i do not know how hard this would be to implement

This behavior sounds like a form of inheritance to me. implementing it might be easier than you think :-)

later,

~Patrick

Re: nested classes (matz, 2002-05-22 00:01:00)

As patsplat stated, namespace issue is very important. Your proposal may be useful for some cases, but narrows application. For example, File::Stat class will not work any longer with your proposal.

matz.

Nifty in Java; Superflous in Ruby (anonymous, 2002-05-22 09:38:35)

The reason inner classes are so useful in Java is that Java doesn't have closures. Ruby's closures have all the power of inner classes, with considerably more flexibility.

If you're trying to emulate Java's anonymous inner classes (nameless inner classes defined within a method), you can do something very similar with singleton classes defined within methods.

thanks patrick! :-p to the rest of you! (transami, 2002-05-22 11:01:32)

class Class

        INEW_MISSING = %q{
                def method_missing(m, *a)
                        if a.empty?
                                eval("#{m}", INEW_BINDING) 
                        else
                                eval("#{m} *ObjectSpace._id2ref(#{a.id})", INEW_BINDING) 
                        end
                end
        }

        def inew(b, *args)
                const_set("INEW_BINDING", b)
                module_eval(INEW_MISSING)
                alias_method :initialize, :instance_initialize
                obj = self.new(*args)
        end
        
end


class P

        attr_reader :x, :c

        def initialize
                @x = 10
        end

        def y
                puts "self.name"
        end

        def z(n)
                puts "self.name #{n}"
        end

        def make_c
                @c = C.inew(binding)
        end


        class C
                
                def instance_initialize
                        puts "Instance initialize with whomever's binding."
                end
                
                def all
                        puts "#{x}"
                        y
                        z 20
                end
        
        end

end

p = P.new
p.make_c
puts p.c.x
p.c.y
p.c.z 20
p.c.all

notice the lack of @ on the x in class C! spooky. only downside is having to pass the binding with Class#inew. also notice that C dosen't have to be defined in P for this to work. if Class#inew were to be implemented as a standard part of ruby these short-comings could be remedied. okay, someone prove to me why this is a bad idea. ;-)

Try it with Delegate (JimWeirich, 2002-05-22 12:10:34)

What you are asking for sounds a lot like delegation. Have you checked out the delegate.rb file that comes with Ruby? Your example could be rewritten as this ...

require 'delegate'

class P
  # ...
  # Most of P stays the same ... see the original example 
  # ...

  def make_c
    @c = C.new(self)
  end
  
  
  class C 

What is missing from this technique that is included in the RCR?  I see two things ...

Self needs to be explicitly passed to new. Annoying, but not a show stopper.
No access to instance variables. Since Ruby (like Eiffel) practices object based protection rather than class based protection (like C++ or Java), this seems like a reasonable restriction.


I haven't seen a big need for this idiom in my own code.  Would you mind posting some examples where this would be a big win?

Try it with Delegate (JimWeirich, 2002-05-22 12:14:59)

Yuck, my previous posting got scrambled when posting ... I'm trying again. -- Sorry

What you are asking for sounds a lot like delegation. Have you checked out the delegate.rb file that comes with Ruby? Your example could be rewritten as this ...

require 'delegate'

class P
  # ...
  # Most of P stays the same ... see the original example 
  # ...

  def make_c
    @c = C.new(self)
  end
  
  
  class C &lt; DelegateClass(P)
    def all
      puts "#{x}"
      y
      z 20
    end
  end
end

p = P.new
p.make_c
puts p.c.x
p.c.y
p.c.z 20
p.c.all

What is missing from this technique that is included in the RCR? I see two things ...

Self needs to be explicitly passed to new. Annoying, but not a show stopper.
No access to instance variables. Since Ruby (like Eiffel) practices object based protection rather than class based protection (like C++ or Java), this seems like a reasonable restriction.

I haven't seen a big need for this idiom in my own code. Would you mind posting some examples where this would be a big win?

delegator helps and example explination (transami, 2002-05-22 13:21:40)

well the Delegator class is sufficient for my needs. i can make that work. i looked at the code and its much more long-winded then the missing_method hack i did, but appears the more appropriate means.(forgoing my RCR)

as for an eample, the two i have are too big to fit in this little comment box, so i'll just describe them. in both, the need for the idiom arises from the use of REXML's stream parser. if you've ever used it you'd probably understand. one must define a class, called a listener, which defines the required parse methods. that listener class fits most nicely as a nested class inside the class the calls on it, and as such could very well use easy access to the nest-parents methods and instance variables -- in my case a validating xml document and three "piecemeal" parsers that the listener could use but i also want publically available in the main class. does that explain a good use?

Re: nested classes (transami, 2002-05-22 22:18:57)

hi matz. i was just thinking about this a bit more. the kind of nested class i am describing seems just to be an object-specific class with a name, so that it can be initiated, but only from the nest-parent class. would it be possbile to have another "type" of class, say an instance_class or modifier like class Parent ...(parent_stuff)... end p = Parent.new class <<p ...(child stuff)... end takes on the form

class Parent
  ...(parent_stuff)...
  private
  instance_class Child
    def instance_initialize
      ...
    end
  end
end

or the instance_class line could read, "class Child
~tom

delegate didn't cut it (transami, 2002-05-23 14:45:00)

so i tried delegate with the example i was siting and it wouldn't work. it kept telling me that i was passing 1 argument for 0 when trying to create my child class. in other words it wouldn't except the self argument. go figure. i looked it over again and again to make sure i had it all just so, and could not find the flaw. oh-well. i scrapped delegate and worked around my problem another way.

p.s. i had said that the lack of @ was spooky in a previous post. my mistake, at the time i hadn't realized the child was simply accessing the attribute method.

Initialize problem? (JimWeirich, 2002-05-24 11:43:07)

Sounds like a problem with initialize in your delegating class. Did your delegating class have an initialize method? Did the initialize method explicitly pass the "delegatee" object to super? It should probably look like this (warning ... untested code):

  class Nested 

The example in my original post didn't have an initialize method, so the default one was inherited from DelegateClass(Parent).

Initialize problem? (JimWeirich, 2002-05-24 11:44:51)

Beware of < characters when you post in HTML format. Using them will cause your post to look crazy. Sorry, this is the second time in this thread I've done that

Sounds like a problem with initialize in your delegating class. Did your delegating class have an initialize method? Did the initialize method explicitly pass the "delegatee" object to super? It should probably look like this (warning ... untested code):

  class Nested &lt; DelegateClass(Parent)
    def initialize(delegatee, other_args)
      super(delegatee)
      # more code
    end
  end

The example in my original post didn't have an initialize method, so the default one was inherited from DelegateClass(Parent).

that explains it, but sort of defeats it, and a better idea (transami, 2002-05-24 18:12:38)

that would explain the problem, but then it sort of defeats the purpose. i can always just pass the parent's self and access it that way, which is what i did in fact. ex-


class P

  attr_reader :x

  def initialize
    @x = 10
    c = C.new(self)
  end

  class C
    def initialize(p)
      puts p.x
    end
  end

end

my original thought on all this was that passing the parent's self should be implicit since it is a NESTED class. seemed to me they should do more than just organize code, but i digress.

perhaps that's what i should really be asking for: a given built-in parent object for private nested classses, or hell, why not just have a built-in object called parent for all classes which points to the object from which it was created. that would solve this whole issue and then some. what do you think of that?

Anonymous Class (devEiant, 2002-06-07 07:47:59)

If you want a class that only the "parent" can see that inherits from itself, try this:

class Parent

        @@innerClass = Class::new( Parent ) {
                self.class_eval %{
                        def initialize
                                puts "I'm a child"
                        end
                }
        }


        def initialize
                @myChild = @@innerClass.new
                puts "Created inner child #{@myChild.inspect}"
        end
end


p = Parent::new
p2 = Parent::new

hmm :o (anonymous, 2002-11-27 05:51:06)

ive run into some situations where a parent object would be very useful. It would help simplify writing objects that react to the context of their creation. although, an object can exist out of its parents scope, which might skew the users understanding of the childs functionality. I can live with just passing the parent to its child.

scratch that... (anonymous, 2002-11-27 05:53:03)

didnt see the 'private' part.

Reason for rejection

Incompatible, too big to change.

RCR 104: New methods: Array#rotate{,!} (nobu, 2002-05-22 22:28:50)

Status: Rejected

Array#rotate!(index) rotates self and places the element was at index at first position.
Array#rotate(index) returns rotated new array.

[1,2,3].rotate(1)  #=> [2,3,1]
[1,2,3].rotate(-1) #=> [3,1,2]
[1,2,3].rotate(3)  #=> IndexError

Comments

Relative version of rotate... (Stephan, 2002-05-23 02:14:57)

When I see rotate(n), I would interpret it a rotate this n times:
[1,2,3].rotate(3) #=> [1,2,3]
That is, I would expect rotate(i) to relatively rotate the elements n.abs times right/left (depending on the sign of n).

Re: Relative version of rotate... (peterhi, 2002-05-23 03:56:45)

Yes, this is what I would expect too.

both kinds (transami, 2002-05-23 14:38:41)

could have a rotate(n) which is relative, as i would expect it to be as well, and a rotate_to(i) for the first notion.

Re: New methods: Array#rotate{,!} (anonymous, 2002-05-23 23:46:15)

Well what about using the standard mathematical term ``transposition''.
(Modulo Argument exception)

class Array
def transpose(i,j = 1)
tmp = self[i]
self[i] = self[j]
self[j] = tmp
self
end
end

/Christoph

Sounds good (Stephan, 2002-05-24 02:30:52)

I was thinking about having both methods but din't find a good name for the 'absolute' version.
rotate_to(i) sounds good.

No need for two methods (marko, 2002-05-24 04:51:57)

rotate(n) and rotate_to(n) would be the same for nsize is a special case that doesn't justify a method on its own.

marko

true, although lacking convenience (transami, 2002-05-24 17:52:03)

so your saying:

arr.rotate_to(i) = arr.rotate(arr.length - i)

is that the right calculation?

transposition is not a rotation (transami, 2002-05-24 17:54:31)

this would be another method all together.
might not be a bad addition on its own, but it is not a rotation.

Re: true, although lacking convenience (marko, 2002-05-25 08:01:31)

No, if Array#rotate() rotates to the "left" (pushing members that fall of the beginning unto the end), then if i < arr.size



          arr.rotate_to(i) == arr.rotate(i)

The difference between them is merely in what happens when i becomes too large (and maybe too small).

If they rotate in a different direction, I would rather call them rotate_left and rotate_right

Note: Sorry, my previous post got mangeled, because of < and > in it.

marko

Re: both kinds (nobu, 2002-05-29 21:36:06)

What do you mean by relative and what is absolute?
I guess rotation is always relative.

Reason for rejection

Do we REALLY need this?

RCR 105: Change do...end to support exception handling (ser, 2002-05-24 18:44:33)

Status: Rejected

I'd like to see the Ruby syntax to be extended to support blocks-as-exception-handling. This would be purely syntactic sugar, but it would also allow easier handling of exceptions in some specific cases.

Ruby already allows you to shortcut the begin...rescue...end syntax in method definitions with:

def method
  #...
rescue Exception
  #...
end

I suggest extending this syntax to do...end, so that do...rescue...end becomes legal.

The motivation for this pertains to threads. Right now, exceptions occurring in threads are silently swallowed by Ruby, so exception handling in threads becomes:

my_thread = Thread.new do
  begin
    #...
  rescue Exception
    #...
  end
end

With the new block syntax, this would become:

my_thread = Thread.new do
  #...
rescue Exception
  #...
end

This syntax of course would be usable anywhere blocks occur, and I believe it would make the overall syntax cleaner and encourage responsible exception handling.2

Comments

What about {} style blocks (JimWeirich, 2002-05-28 09:46:01)

Should {} style blocks also support this?

For example ...

my_thread = Thread.new {
  #...
rescue Exception
  #...
}

It depends. (ser, 2002-05-28 18:55:53)

At first blush, I'd say (IMHO) no, just because the syntax looks funny to me. However, if it were easier to implement than to not implement -- that is, if the block-parsing algorithm in Ruby is the same for {...} as for do...end, and if Matz would have to specifically add code to disallow {...rescue...} -- then I'd say it should be allowed.

As to implementation. (nobu, 2002-06-01 01:29:24)

There's no problem to modify do...end syntax only, by changing just 2 lines in parse.y.

It should. (anonymous, 2002-06-05 10:26:23)

Since it's repeatedly stated in documentation (and Programming Ruby) that the only difference between do/end and {} is the precedence, making them different in this way would break the Principle of Least Surprise.

Reason for rejection

rescue etc in "do .. end" seem weird when a block consists a loop.

RCR 106: built-in init alias for initialize (transami, 2002-05-26 17:25:48)

Status: Rejected

can we just type "init" instead of "initialize", pretty please.

i know i can add an alias to Class myself, but then i'd have to include that change with every program i make public. besides, i think the whole community would be pleased to share in this very simple convenience. its the simple things, you know. 2

Comments

Pleased community? (marko, 2002-05-27 14:14:32)

The first couple of votes don't seem to be from a pleased community. ;-)

Honestly: I think there are too many aliases already.

marko

Use your editor (anonymous, 2002-05-28 06:56:39)

Every decent text editor allows you to define shortcuts to make typing more convenient - may I suggest to do it this way (I'm sure you can define a shortcut in EMACS and use it by pressing just seven keys at once ;-))

I see two important disadvantages of your approach (even the alias added to Class):

1. It makes code unreadable - the next one will use "int" instead of "initialize", another one may prefer "Klasse" instead of "Class". Obsfuscated ruby is difficult to write, but you are on the right way...

2. It will break old code, where a class may have a method named "init".

#2 is a good answer (transami, 2002-05-28 10:00:55)

#2, that's a very good point --that it may break old code.

it isn't a big deal obviously, i was up quite late over the memeorial day weekend banging out ruby code, and i had to type initialize once again, and thought "why was 10 a letter word used for something that every class has, and is generally referenced by a completly different method name of just 3 letters?

i think if i ever create a language i'll be sure to use Supercalafragalistic as a required method name. ;-)

so anyway just thought i'd throw it out there and see what would get thrown back.

but i always like constructive answers like yours. thank you.

how many? (transami, 2002-05-28 10:03:37)

curious. i didn't realize their were so many. how many would you say? is there a list of all the built-in aliases somewhere? that would be interesting to see.

I prefer #1... (patsplat, 2002-06-04 02:58:37)

making code readable is as important as backwards compatiblity (if not more...).

:initialize is a bear to type. I don't think the idea of an alias is bad, but :init isn't a good one.

:setup reads better than :initialize, but it's too entrenched in the ____Unit frameworks.

Any good synonyms?

~ Patrick

Existing aliases (marko, 2002-06-04 06:27:24)

I don't know how many there are, but here are some methods, that are synonyms for each other (not sure whether they are strictly aliases):

class Object:
- __id__ = id
- __send__ == send
- is_a? == kind_of?
- type == class
class Array
- indices == indexes
- map! == collect!
- length == size
- [] == slice
- to_a == to_ary
As in Array you will also often find methods which are only convinient shortcuts (first, last) and other methods which are similar but nor quite the same:
- at != []
- reject! != delete_if

As a hint: Searching for the word 'synonym' in /usr/lib/ruby/1.6/ri/ gives me 75 occurences in 42 (core) class and module descriptions.

marko

Re: built-in init alias for initialize (peterhi, 2002-06-05 09:58:57)

Are you really saying that typing initialize once per class is really that much of a chore?

How about all the keywords in every possible language so that we non american speakers do not have to learn a random collection of letters to program.

This is not an unreasonable request. If your first language is chinese then learning to program is much harder when it is a matter of stringing meaningless symbols from an alphabet you don't use.

Given the choice I would vote for the pre/post increment operator rather than this (possibly even const correctness :-) ).

synonyms (transami, 2002-06-13 11:47:01)

new, make, set, create, start, begin, launch, instate, teeoff

'new' makes the most sense but i believe comes into conflict with the class method. so if 'new' can't work there's plenty of others, though 'init' is pretty readable to me. and there's 'teeoff' if you like golf :-)

synonyms (haldane, 2002-06-19 06:47:30)

Perhaps we could have the UK English spelling "initialise"?

Actually I would vote for being able to use "init" partly because I always forget to use the american 'z' instead of an 's' in the unabbreviated word.

Reason for rejection

shorter name may cause more conflicts.

RCR 108: nil.to_f (transami, 2002-06-14 00:19:59)

Status: Rejected

NilClass#to_f always returning 0.0.

Comments

Explain (anonymous, 2002-06-14 15:51:33)

Why do you want this?

very useful, try typecasting cgi values (transami, 2002-06-14 19:00:49)

currently NilClass has .to_a, .to_s and .to_i. why not .to_f?

the utitlity of these methods comes up often. there are many times when a method returns a nil, for one reason or another. if it weren't for these above methods you'd have to write an exception routine in every case. these methods thus allow you to continue-on without missing a beat in many situations.

want a specific example? working with cgi: what if you're typecasting a cgi value that's not there? as with all hashes you get back a nil. in most cases you just want it to become 0 or '' or 0.0 and what have you.

i think its always good to offer ways for a program to move seemlessly through the most common cases. since nil arises so often, being able to deal with it in ways common to many other objects is quite helpful in this. in fact i modify NilClass to include a size and length method as well which also returns 0.

Alright (anonymous, 2002-06-15 04:18:11)

That makes sense. Either you have all conversions or none. I think I would prefer none, but having all of them is always better than having only some of them.

All object have to_a and to_s (cout, 2002-06-16 11:24:19)

The Object class defines default implementations for to_a and to_s. It does not define one for to_i, and I'm not sure why NilClass defines to_i. I think it should not define anything that Object does not unless there is a clear reason.

Why not do it yourself? (anonymous, 2002-06-17 05:04:22)

As you write, you know how to modify NilClass. Why don't you simply add the method yourself?

This is not worth an RCR.

i have and not so (transami, 2002-06-17 23:52:18)

i have done it myself! and do so for nearly all my projects. i have it in a general library which i call on.

i disagree. this is most certainly worth an rcr. in fact, it is quite basic and useful as i have explained earlier. not to mention that it is ridiculously easy to implement.

from a purely "idea" perspective what does NIL mean? zilch, zero, nothing, nata, etc. under different contexts this translates differently: a nil array [], a nil string '', a nil integer 0, a nil float 0.0. dosen't it make sense to have this "nil" representation for all the basic types?

the utility of this is obvious. why have to_s, to_a, or to_i in the first place? there's a good reason for those, because it makes a programmer's life easier.

instead of this:

if obj
obj = obj.to_s
else
obj = ''
end

one only requires:

obj = obj.to_s

when typecasting, the MOST COMMON CASE for dealing with a nil is to turn it into the "nil equivalent" as mentioned above. and that's why this makes a damn fine rcr.

Use Float (flori, 2002-06-18 09:36:39)

Why don't you use Float(nil)? It isn't necessary to implement to_x-Methods for every possible type x in NilClass.

that's a solution, but is also indicative (transami, 2002-06-18 14:44:03)

while not an object-oriented approach it certainly works. i did not know about these Kernel methods before. thanks for the spark.

but funny thing about these methods, they are indicative of this rcr. you'll notice that there are four such methods: Array, Float, Integer and String
Array and String use .to_a and .to_s verbatium. Integer, on the otherhand differs from .to_i to deal with leading radix indicators. (why dosen't .to_i handle them?) and our Float, well it uses .to_f in all cases EXCEPT NIL! in that case it returns 0.0. The fact that this EXECPTION is made is indicative that the NilClass should get a .to_f method.

so if you think about this for a moment, you realize that what we have here is just a slight bit of inconsistency and non-elegence which has led-to or come-from an unbalanced set of type conversion methods in NilClass (i.e. the lack of .to_f) and four nearly redundent methods in the Kernel Module. i thus suggest cleaning this up a bit by adding the .to_f to the NilClass and finding a better way to deal with Integer's radix handling. in so doing we'll have a consistent mapping between the Kernel methods and their .to_x counterparts.

on the contrary (transami, 2002-06-18 14:51:44)

on the contrary Object could very well use a .to_i method aliasing .id, consistant with the functioning of .to_a and .to_s. but it dosen't really matter if the Object class has these methods or not. they are intended to be overridden anyway.

Strong disagreement (anonymous, 2002-06-25 12:12:09)

I strongly disagree that NilClass should get a to_f method; I'm fairly unhappy that it has to_anything methods already.

This should NOT be the common case; the common case should be not dereferencing nil, ever.

When you get implict conversions from nil to anything, you end up with potential cascade errors. Ideally, you want to catch all errors at the points where they are made. Making nil non-convertible by default makes this happen in a lot of cases. Making it convert to 0.0 removes a lot of error cases in order to allow "drop through" syntax for the abnormal cases where it is correct. Yes, it is the most common of the correct cases, but it also removes a bugblocker.

I'm tempted to file the opposite RCR - remove the to_xxx methods NilClass already has - but I think that would probably be bad due to deployed code.

Eivind.

whose bug is it anyway? (transami, 2002-06-25 15:39:54)

you know, i've never understood this "bug" mentality. is this something you learn in academia? you're saying you'd rather an exception error be raised then have a default case?

i've been writing code for 20 years, in all varieties of language, and of the general principles i've developed, one of the highest on the list is simply not to raise errors! if i write a piece of code such that i arrive at a place where i need to raise an error, then i've written it wrong! of course, the real world is not always ideal and sometimes i let it go, but all in all i beleive this adage holds true:

if you have to raise an error, you've made an error.

so i say, give me as many common case defaults as possible. i'll deal with the excpetions.

p.s. i can only imagine what you would think of my modifications to the NilClass. how about this one:



          def []

          &nbsp;&nbsp;nil

          end

no more:
undefined method '[]' for nil (NameError)

Push the logic into an abstraction (RustyF, 2002-08-07 16:45:38)

I think in this instance I would probably create an abstraction for a "nullable field" that hides this recurring logic (been there, done that too). Such an abstraction may indeed be a good home for other behaviour such as default values, etc.

Reason for rejection

type conversion should be explicit.

RCR 109: Kernel conversion methods to use to_flt, to_int, to_ary, to_str (Nikodemus, 2002-06-25 03:07:42)

Status: Rejected

The four functional style Kernel conversion methods (Float, Integer, String, Array) are ideologically stricter than to_f, to_i, to_s, to_a, but there is no way to capture this difference in user-defined classes.

If functional style methods would use to_flt, to_int, to_str, and to_ary methods to convert their arguments users could utilize this distinction in their own classes.

Comments

more details (transami, 2002-06-25 03:31:45)

could you explain that a little more? what are the distinctions?

according to Programming Ruby by Thomas and Hunt, Array(arg) return arg.to_a, String(arg) returns arg.to_s, and Float(arg) returns arg.to_f with exception to nil. Integer(arg) is the only one that seems different in that it recognized radix indicators. is this not correct?

thanks.

details in ruby-talk (Nikodemus, 2002-06-25 06:26:33)

See for example:

Clean up the language (anonymous, 2002-06-25 13:58:15)

For me this is a bad idea. These four functions are completely weird and should be removed - so no attempts to make them more popular please.

If they were removed, the "new" shortcut could be implemented: "SomeObject.new(1,2)" would be "SomeObject(1,2)" instead.

ruby-talk leaves me searchless (transami, 2002-06-25 15:09:21)

you know, i've tried to search ruby-talk and i never get any results. don't know what i'm doing wrong. tried a number of things, but it never seems to work.

yes, i can go through the messages one by one, or even topic by topic, but that sure is alot of manual searching to find something.

thanks for the links on this topic though.

Ruby is not Python (anonymous, 2002-06-26 07:14:47)

IMHO, the "new shortcut" described above would also be a wart on the language. Keep the language as simple as possible, please!

Objects being allocated by class methods and initialised by instance methods is an elegant mechanism. It results in readable code. It also naturally supports multiple different "constructors" per class, unlike Python's allocation syntax.

Ruby should be C++ :) (tsuihark, 2002-06-26 12:48:06)

Actually, I like it because this way of constructing is used in C++ here and there and I used to like C++ :)

Hmmm, I just noticed that the shortcut might eventually lead to C++ madness. Never mind :)

Reason for rejection

user customizable String etc. is an interesting idea, bot not in the way described in this RCR.

RCR 110: IO subclass for string (anonymous, 2002-06-26 14:35:18)

Status: Rejected

I'd like to see a sub-class of IO that used a String for internal storage.

Something like:

a = IOString::new
a.puts "Hello, world!"
str = a.string # maybe a.to_s ?
puts str
b = IOString::open(str)
str = b.gets
puts str

I use this kind of structure to store parts of files in place, for parsing later... I created my own class to do this (which was easy, since I knew exactly which methods of IO I'd be talking to), but it seems like something that belongs in the standard library.

Comments

Try StringIO (matz, 2002-06-26 19:48:19)

..from RAA. It is bundled with 1.7 CVS snapshot.

matz.

Reason for rejection

use StringIO.

RCR 112: Ruby/Tk should not need env vars (anonymous, 2002-07-04 23:22:02)

Status: Rejected

(somewhat bluntly...): Neither Python nor Perl need to have the Tk libs in a env variable on Windows. Ruby should not have to either.

Comments

Oh, is THAT the solution?! (anonymous, 2002-08-25 17:11:29)

This isn't obvious from the docs, either. On linux, it just worked.

Reason for rejection

I'm not an Windows guy. Report your problem to comp.lang.ruby as a bug report, not RCR.

RCR 113: overload pack() method to return template length if receiver is empty (djberg96, 2002-07-12 16:51:50)

Status: Rejected

Currently, Ruby chokes if you try to call pack() on an empty receiver. However, there are times when you may simply want to get the template length for a future read() operation on binary data without having an array defined.*

# Desired behavior
template = 'A8 A8 A16 l'
[].pack(template) -> 40
# or perhaps just allow [].pack(template).length

# ...and later
f = File.new('/var/adm/wtmp')
while f.read(len)
...

Hardcoding the length in a read operation would be a bad idea since different platforms will return different values. Using my example above, Solaris returns 40 while BeOS returns 36.

See "Perl for System Administration, p.296" for an example of why this would be useful.

*It has been pointed out that Array.new(4).pack(template).length works, but that still requires that I know the record count in advance, which may not always be the case.

Comments

Is this the right solution? (cout, 2002-07-14 16:20:53)

It's not immediately obvious to be that [].pack(str) is going to return the length (if anything, I would expect it to return an empty string). Having a method return two different types depending on the state of the object it is called on (in this case Integer for an empty Array, and String otherwise) is generally not a good idea.

There are also some templates (such as 'M') for which the size of the returned string changes depending on the contents of the array; what should happen in a case like this?

I definitely see a use for what you propose, but I think that another solution (perhaps Array.packlen()) might be more appropriate.

If more methods are added to support packing/unpacking, it might also make sense to move these operations into their own module, instead of splitting them between the Array and String classes.

I see your point (djberg96, 2002-07-15 08:51:07)

Ok, that makes sense. Hmm...what about making pack templates their own class?

t = PackTemplate.new('A8 A8 A16 l')
t.length -> 40 (or whatever)

Thus, if you just send a string to 'pack()' it would automatically be understood as a PackTemplate object.

However, I can't think of any methods except for 'length' at the moment, so perhaps that's overkill. Feedback/ideas welcome.

perhaps... (cout, 2002-07-15 11:30:55)

p = Pack.new('llll')
s = p.pack([42,42,42,42])
p s #=&gt; "*

last post didn't work right (cout, 2002-07-15 11:35:11)

p = Pack.new('llll')
s = p.pack([[0x42424242,0x42424242,0x42424242,0x42424242])
p s #=&gt; "BBBBBBBBBBBBBBBB"
a = p.unpack(s)
p a #=&gt; [1111638594, 1111638594, 1111638594, 1111638594]
p p.length #=&gt; 16

This would probably work well as an extension.

I hereby nominate cout (djberg96, 2002-07-15 12:44:58)

This would probably work well as an extension

Yes, probably. Get started. :)

Seriously, this should probably be submitted as a separate RCR. I don't want to submit it because I don't think I'm qualified to write it.

Any takers?

Found something interesting in ruby 1.7 (cout, 2002-07-15 13:20:07)

require 'dl'
p DL.sizeof('C8C8C16l') #=&gt; 36

The spaces are not allowed, and I used 'C' instead of 'A', because 'A' can vary based on input.

Reason for rejection

'dl' may work for you.

RCR 117: File::Stat structure returned by FileTest methods (jfh, 2002-10-11 08:42:25)

Status: Rejected

Would it cause any problems if the methods in FileTest were modified to return a File::Stat object instead of 'true'? E.g., when doing a recursive comparison of two directory trees, it's nice not to essentially call stat(2) twice: Find.find(".") { |afile| st = File::stat(afile) newfile = "../othetree/" + afile if (nst = File.exists?(newfile)) if (st.ino != nst.ino) # do stuff end end } Same goes for directory?, etc...

Comments

File::Stat structure returned by FileTest methods (jfh, 2002-10-11 08:48:47)

Gak, sorry, I goofed

Would it cause any problems if the methods in FileTest
were modified to return a File::Stat object instead of 'true'?

E.g., when doing a recursive comparison of two directory trees,
it's nice not to essentially call stat(2) twice:

Find.find(".") { |afile| 
    st = File::stat(afile) 
    newfile = "../othetree/" + afile
    if (nst = File.exists?(newfile))
        if (st.ino != nst.ino) 
          # do stuff 
        end 
    end }

Same goes for directory?, etc...

?-suffixed methods should be predicates (flori, 2002-10-21 15:36:06)

I'm against this proposal. Methods with ?-suffix should be predicates and return boolean values. You could perhaps use exceptions to solve your problem:

Find.find(".") do |afile|
    begin
        st = File::stat(afile)
        nst = File::stat("/somewhere/#{afile}")
    rescue Errno::ENOENT
        next
    end
    if (st.ino != nst.ino)
        # do stuff 
    end
end

Reason for rejection

I think it ambiguate the function.

RCR 118: String.subs (jfh, 2002-10-11 09:28:59)

Status: Rejected

Would anyone be interested in seeing "subs", "subs!", "gsubs", and "gsubs!" methods?

Basically, they return false if the substitution doesn't happen:

class String
    def subs(pat, sub)
        tmp = self.sub(pat, sub)
        if tmp == self
            return false
        else
            return tmp
        end
    end
end

foo = "foo"

foo.subs("foo", "bar") # -> bar
foo.subs("moo", "bar") # -> false

The above ruby code would probably best be replaced with smarter C code in string.c .

Comments

sub! and gsub! (cout, 2002-10-15 10:20:38)

sub! and gsub! already return nil when the substitution does not occur:

[cout@localhost cout]$ ruby -v
ruby 1.6.7 (2002-03-01) [i686-linux]
[cout@localhost cout]$ irb
irb(main):001:0&gt; s = 'foo'
"foo"
irb(main):002:0&gt; s.sub!('foo', 'bar')
"bar"
irb(main):003:0&gt; s.sub!('moo', 'bar')
nil

Instead of using str.subs, you can instead use str.dup.sub!.

hmm (anonymous, 2002-11-27 07:59:12)

i think what he meant was a string - string substitution, even quoted the pattern is converted to a regular expression. and i agree with him, sometimes using a regular expression is overkill, and you just want to substitute a string with a string.

Reason for rejection

do not be afraid regex too much.

RCR 120: Should Ruby have static typing? (anonymous, 2002-11-18 13:44:09)

Status: Rejected

Should Ruby have static typing?

Note: please refer all future ruby-talk threads on this subject to the results of this RCR.

Comments

This has been discussed many times before (ysantoso, 2002-11-18 16:36:53)

Please see , and

No (anonymous, 2002-11-18 20:15:23)

If you want static typing then there are plenty of OO statically typed languages available to you. Java and C++ come to mind.

Start with the basics (anonymous, 2002-11-19 04:20:33)

To have static typing in Ruby you first need typing. Ruby is a dynamic language that does not type in an enforceable way (no const types, no typed method declarations and no static types).

And lots of people like it that way and it has been done to death so explore the links that have been suggested and if you have something further to add then go ahead.

Importantly why do you feel you need a static variable when you have instance variables?

Re: Start with the basics (anonymous, 2002-11-19 04:22:49)

Oh and as a final point you might like to look at Sather which has strong typing na dis also very OO.

A nice language (not sure about statics though).

static typing != static variable (cout, 2002-11-19 13:21:42)

A static variable is analgous to a class variable.

Static typing just means that the type of the object it known at compile-time; in Ruby it is not known until well into run-time.

Sather and others (cout, 2002-11-19 13:27:20)

Sather does look really nice. It's almost like a cleaner C++.

I hear that OCAML also has static typing, but has a type system that makes static typing more than bearable. I've never used it, though, so I can't elaborate.

Reason for rejection

No, it should not.

RCR 122: uncatchable Deadlock exception (aidenc, 2002-11-21 15:34:56)

Status: Rejected

It currently looks like a 'fatal' (i.e. uncatchable) exception is raised when a thread deadlock condition occurs... Of course, this makes debugging these problems a real chore. Would it be possible to change the behavior to raise a Deadlock exception in every locked Thread around these conditions? It'd make it a lot easier to diagnose what's gone wrong.

Here's some example code:

#!/usr/local/bin/ruby

require 'Thread'

m = Mutex::new
m.lock
Thread::new(0) { |i|
  puts 'before'
  sleep(2)
  puts 'after'

  this_will_raise_an_uncatchable_exception

  m.unlock
}

begin
  m.lock
rescue
  puts "Hey!"
end

Reason for rejection

I don't understand this RCR. What is the difference between fatal and uncatchable Deadlock exception?

RCR 124: New Root for Class Hierachy (anonymous, 2002-11-25 13:51:26)

Status: Rejected

I sometimes would like to have another, more pure, root for class hierachy than Object. Object is pretty fine for everday usage and the predefined methods are needed in there, but sometimes they interfere with what I try to do. Sometimes I would like to base on a class with no predefined methods.

This is USEFULL! I would like to create a mock object, which simply notifies me when a method is called and then hands this call over to the real object the programm was expecting. I can do this with a hack similar to delegates.rb. But it would be much easier and elegant if I could just inherit from, say, Entity and override a private method method_call(:method_name,*args) and state there what to do.

Comments

Second (anonymous, 2002-11-26 16:46:43)

I second the notion of a "clean" object to build upon. i imageine a few methods would still need to remain, but a bare minimum object would indeed be useful.

for the hell of it i'll also throw in here that i think a networkable proxy object would be cool too.

agreed! (anonymous, 2002-11-27 05:24:28)

Yeah, a clean slate would be nice. the ruby flavored root is nice, but theres always a special case.

for all intensive purposes... (anonymous, 2002-11-27 07:48:45)

you can use remove_method, and an iteration through .methods, would make a suitible basis for a mock object, i guess.

SmallTalk (anonymous, 2002-11-27 20:23:46)

Seems like a good idea.

Doesn't SmallTalk have something like this?

FYI (anonymous, 2002-11-28 12:00:01)

It's "intents and purposes", not "intensive purposes".

KernellessObject (devEiant, 2002-12-01 03:41:47)

Check out the KernellessObject hack from the excellent RubyTreasures collection:

This may do more than what you want (ie., not only does it not inherit from Object, but it doesn't include the functions from Kernel, either), but it may give you a point from which to start.

This makes code unreadable and difficult to maintain (anonymous, 2002-12-12 17:19:39)

Are the kernel method names really so difficult to avoid? If you want more elegant delegation, request more elegant delegation. There's no need for an atomic bomb where an axe will do.

Reason for rejection

See technique used in delegate.rb for such purpose.

RCR 125: regex search in Array of Strings (anonymous, 2002-11-25 13:54:47)

Status: Rejected

I'd like to see an instance method in Array called =~ (just like String) that returns the array index that the string/regex is found in.

OK, first off... I'm a new user, but I didn't see this posted previously... sooooo:

I'd like to see an instance method in Array called =~ (just like String) that returns the array index that the string/regex is found in.

That way I can keey the file in an array of strings form from beginning to end instead of flattenning and splitting.

eg (ommitting error-handling). Old style:

data = IO.readlines(filename)
data.flatten!
startindex = (data =~ BEGIN_BLOCK) + BEGIN_BLOCK.length
endindex = (data =~ END_BLOCK) - 1

data = data[startindex..endindex].split("n")

yadda...yadda...yadda

New style:

data = IO.readlines(filename)
startindex = (data =~ BEGIN_BLOCK) + 1
endindex = (data =~ END_BLOCK) - 1
data = data[startindex...endindex]

This isn't anything major, it would just make stuff slightly cleaner. Additionally, you could do something wacky like this...

lineIndexArray = (data =~ search)

Where lineIndexArray[0] is the index into the parent array, lineIndexArray[1] is the index into the subArray, etc. And if the the Array is a single dimension, it would simply return an integer instead of an array...

Comments

I'm not sure what the problem is you're solving (DavidBlack, 2002-11-25 18:17:08)

Hi --

I'm not sure what you mean about flattening and splitting and all that. If you read a file in with readlines, flattening it won't do anything; that is, you've got a strictly one-dimensional array of strings, so you can't flatten it.

If you're looking to get part of an array based on regex matches, you could use the "flip-flop" operator:

a = %w{one two three four five}
a.select {|e| e if /two/.match(e) .. /four/.match(e)}
# => ["two", "three", "four"]

So many choices (whytheluckystif, 2002-11-26 14:05:09)

See, if I saw usage of Array.=~, I would think that it should somehow return both the index in the Array and the index in the String of the match. Or simply returning the matching string:

  class Array
    def =~( re )
      detect { |x| x =~ re }
    end
  end

If I were doing a BEGIN_BLOCK and END_BLOCK sort of matching, though, I can think of a lot of other Ruby methods that could work out nicely. The flip-flop in David's example seems exactly what you're looking for. Use with each_line:

  buf = ""
  File.open( filename ) { |in|
    in.each_line { |l|
      buf 

You also don't need to flatten.  It's alot more efficient for you to use IO#read directly (depending on your file sizes you may want to read in blocks).

So many choices (whytheluckystif, 2002-11-26 14:05:49)

See, if I saw usage of Array.=~, I would think that it should somehow return both the index in the Array and the index in the String of the match. Or simply returning the matching string:

  class Array
    def =~( re )
      detect { |x| x =~ re }
    end
  end

If I were doing a BEGIN_BLOCK and END_BLOCK sort of matching, though, I can think of a lot of other Ruby methods that could work out nicely. The flip-flop in David's example seems exactly what you're looking for. Use with each_line:

  buf = ""
  File.open( filename ) { |in|
    in.each_line { |l|
      buf &lt;&lt; l if BEGIN_BLOCK.match(l) .. END_BLOCK.match(l)
    }
  }

You also don't need to flatten. It's alot more efficient for you to use IO#read directly (depending on your file sizes you may want to read in blocks).

extending to other RE related methods (anonymous, 2002-11-28 08:53:51)

If you'd ever put something like this in Ruby itself, methods like scan() should have a proper meaning, too.

Returning indices by themselves is not the way to go. The matching strings would be required, at least.

index() should return the array index and the string index... It gets a bit unclear what the benefits are for this.

Reason for rejection

use "grep" or "select".

RCR 127: What ever happened with 'Design by Contract' being implemented in the Ruby inter (ktethridge, 2002-12-01 00:47:31)

Status: Rejected

There are a few references to 'Design by Contract' in Ruby on the web. I was wondering if anything is being done with that. Is this maybe going to be implemented into the Ruby interpreter in a future release?

Comments

Where? (anonymous, 2002-12-12 23:54:03)

Could you point out an example of these references? Frankly, I'd be happy with a kernel method called "assert(&block)". Only thing: Assertions must be checked by default, and you should need to provide an explicit "don't check assertions" flag.

Also, it would be really cool to have some way to prevent assertions from having observable side effects. Of course, this MAY be slammed as tantamount to the halting problem, but it's not quite. There are steps you can take. For example, don't allow IO access or writes to non-local variables. Attempts could unwind the check and raise an AssertionSideEffectError at the point of the assert statement.

example usage:

class Fraction
def initialize(numerator, denominator)
assert {denominator != 0}
end
end

proof-of-concept hack (anonymous, 2003-05-08 10:30:47)

http://www.pragmaticprogrammer.com/ruby/downloads/dbc.html

Reason for rejection

As long as it can be done in the library level, I won't put it in the core. If you come up with great idea which should be in the core, submit new RCR please.

RCR 128: require default index.rb (anonymous, 2002-12-08 16:35:02)

Status: Rejected

a nice shorthand for require would be to allow it to accept a directory as an argument such that a library located in a subdirectory would be required either by defaulting to a file with the same name as the directory or by looking for an index.rb file/link.

a nice shorthand for require would be to allow it to accept a directory as an argument such that a library located in a subdirectory would be required either by defaulting to a file with the same name as the directory or by looking for an index.rb file/link.

For example:

mylibrary/
  mytool1/
    mytool1.rb
    index.rb --> mytool1.rb

rather than having to do this:

require 'mylibrary/mytool1/mytool1'

one could simply do:

require 'mylibrary/mytool1'

how is this useful? i've just compiled a set of libraries into a single group call xmltoolkit which has a number of subdirectories for the various differnt tools. so the above would make it less redudant to require the specific tool.

i've also noticed that often ruby libraries have an include file that does nothing more that require another file located in a subdirectory. the above would generally alieviate the need for this.

Comments

Re: require default index.rb (matz, 2002-12-09 02:55:34)

Why not prepare mytool1.rb instead of index.rb, which requires mytool1/mytool1.rb? i.e.

mylibrary/
  mytool1.rb
  mytool1/
    mytool1.rb

No extension, no new feature, rather simpler.

efficiency (minor but nonetheless) (anonymous, 2002-12-09 11:35:11)

funny thing about your example: that's one of the things that i was saying i thought was so redundant. look at what you've typed, i see mytool1 typed three times and this technique involves a whoele extra file just to work such that ruby has to require a file just to require another file.

i know that its not that big of a deal, but wouldn't it simply be more efficient for ruby to have some sort of default behavior like i described? finding the file with the same name as the directory that is being required is rather trivial. currently this is something that would throw an error. its always nice see a uselessness become useful. and in this case it has some advantages, albiet minor, without backward incompatability.

so in short, this is useful both as a shortcut for coders and in execution efficiency compared to requiring a file to require another, and it is very simple to implement without breaking backwards compatability.

If you feel that strongly about it.... (chadfowler, 2002-12-10 09:52:10)

You could always just add something like this trivial bit to your code (untested):

alias :require_copy :require

          

          def require(file_or_directory)

          if File.stat(file_or_directory).directory? then

          require_copy "#{file_or_directory}/index.rb"

          else

          require_copy file_or_directory

          end

          end

Re: efficiency (matz, 2002-12-11 23:14:19)

But you can reduce only several strokes by this feature. I don't think it's worth for a new feature.

matz.

thanks (anonymous, 2002-12-22 00:20:19)

i'll add that to tomslib/rubylib.

thanks!

the lcd code standard in ruby libs (patsplat, 2003-01-31 17:56:15)

is to have a file with the same name as the directory:

mylibrary/mytool/otherfiles*.rb
mylibrary/mytool.rb

then:

require 'mylibrary/mytool'

~ Patrick

Reason for rejection

No need to add this. Use top .rb file to require files in the directory.

RCR 131: debugging END{} section (anonymous, 2002-12-19 07:27:33)

Status: Rejected

It would be nice to be able to set breakpoints in BEGIN{} and END{} sections.

I've tried to debug the BEGIN{} and END{} sections of source file. I can step into the BEGIN one but never the END one. I think it could be interesting to allow to step into or put valid breakpoint in the END{} section. For example I've tried to debug the rubyunit.rb by two way : putting a breakpoint in the END{}section or trying to step into it, but none of the solutions run. it's a pity because it's an entry point and I've lost a lot of time to understand that it was there.

Reason for rejection

It's a bug to be fixed, not RCR.

RCR 133: Require quirks (anonymous, 2003-01-11 20:50:00)

Status: Rejected

I would like ruby to be able to 'require' files from the dir where the currently active file is located without specifying the full path.

Let's say i have the following files:

foo/foo1.rb
foo/foo2.rb
bar/bar.rb

Now, if i wan't to require foo1.rb from bar.rb i can just write:

reqire '../foo/foo1.rb'

The problem is that if foo1.rb requires foo2.rb like this:

require 'foo2.rb'

(Which it should be allowed to do since that's how you normally require files that resides in the same dir as the current file.) Ruby tries to search for the file foo2.rb in the directory where bar.rb is located. Ofcourse, i could just add everything i $LOAD_PATH or whatever, but then how can i descide which file to require when i have two files with the same name in different dirs? I can use module to create namespaces on a class-level, but i can't use dirs to create namespaces on a file-level. I've seen some discussions on this in the mailinglists, but no conclusions.

Comments

reasonable solution? (anonymous, 2003-01-12 03:13:56)



  alias :require_basic :require

  def require(file_or_directory)

      fp = File.dirname(File.expand_path(file_or_directory))

      $:

note (anonymous, 2003-01-12 03:16:25)

by the way, i posted this on the mailing list just today. in this version i removed the 'directory require' part that is in the original, if your wondering why the argument is called file_or_directory.

lets try this again (anonymous, 2003-01-12 22:50:20)



  alias :require_basic :require

  def require(file_or_directory)

      fp = File.dirname(File.expand_path(file_or_directory))

      $:

keeps cutting off, once again (anonymous, 2003-01-12 22:52:08)



  alias :require_basic :require

  def require(file_or_directory)

      fp = File.dirname(File.expand_path(file_or_directory))

      $: &lt;&lt fp

      r = require_basic file_or_directory

      i = $LOAD_PATH.index(fp)

      $LOAD_PATH.delete_at(i) if i

      return r

  end

finally! (anonymous, 2003-01-12 22:53:21)

okay there's the code.

Re: Require quirks (cout, 2003-01-12 23:25:36)

Ruby tries to search for the file foo2.rb in the directory where bar.rb is located.

Actually it tries to search for the file in the load path ($:). The current directory ('.') is in the load path, so that's one of the places it looks.

I think that being able to require a file from the same directory as the file doing the requiring is a good idea. I've been too lazy to write an RCR, so I'm glad someone else did.

Note that this RCR is not without a caveat. You shouldn't be able to write require 'foo2.rb' to get foo2.rb. It would be too easy for someone to place a rogue socket.so in the same directory as foo1.rb; then when foo1.rb goes to require 'socket', it would get the rogue socket.so instead of the real one. That is why '.' is at the end of the load path; it's a security measure.

I think a reasonable solution to this shortcoming is to use a different name than require. In RubyTreasures, I used requirelocal:

requirelocal does a few things:
- It loads files from the same directory as the file doing the requiring.
- If that file is a symlink to another file, and other file has already been loaded, the result will be a no-op.
- After loading loaders.rb, all files get required with their full path. Thus, if you requirelocal a file, then require the same file, the require will be a no-op.

Helpful for web sites and testing (anonymous, 2003-01-14 12:28:44)

This is a great idea for situations where multiple developers are working on a single server with something like mod_ruby. It can be inelegant or difficult to set $: for each sandbox.

Maybe the "local" behavior could be an additional modifier to require.

The downside of it is that the normal $: behavior is really more appropriate in most cases, and it would be bad if it crept into library modules by accident.

maybe this... (anonymous, 2003-01-14 23:36:01)

require ((File.dirname __FILE__) + "/blah.file.thing.rb")

(This is actually my normal file naming convention. I still can't find work for some reason :-/)

I'm afraid (matz, 2003-01-16 01:15:40)

that this proposal may increase chances for potential name crash.

matz.

name crash? (djberg96, 2003-01-16 14:20:50)

Can you please elaborate on how this would happen? An example perhaps?

This hasn't been a problem with Perl, and in the case where people choose generic names, the problem is solved by created a toplevel module and putting the .rb file in its own directory.

e.g.

Bar::Foo would go into the sitelibdir under bar/foo

Cab::Foo would go into the sitelibdir under cab/foo.

This requires a bit of responsibility on the part of the Ruby community when it comes to their install scripts, but I'm the trusting sort...

updated version, for what it's worth (anonymous, 2003-01-17 20:19:40)



  alias :require_basic :require

  def require(file_or_directory)

    begin

      fp = File.expand_path(File.dirname(caller[0]))

      $: &lt;&lt; fp

      r = require_basic file_or_directory

      i = $:.index(fp)

      $:.delete_at(i) if i

      return r

    rescue

      if FileTest.directory?(file_or_directory)

        require "#{file_or_directory}/#{file_or_directory}"

      else

        raise

      end

    end

  end

conflict (anonymous, 2003-01-21 22:47:01)

i have just run into a potential conflict. i named a file to require 'DBI.rb'. of course the problem with this is that it could require the typical DBI library rather than my file. of course i could rename my file to avoid the conflict, but that means being very aware of the fact that DBI exists. so i have taken up the alternate suggestion:



  def import(file_name)

    require File.join(File.dirname(caller[0]), file_name)

  end



  alias :require_basic :require

  def require(file_or_directory)

    begin

      require_basic file_or_directory

    rescue

      if FileTest.directory?(file_or_directory)

        require "#{file_or_directory}/#{file_or_directory}"

      else

        raise

      end

    end

  end

Reason for rejection

this proposal may increase chances for potential name crash.

RCR 134: propagating comparisons like Python (slukejones, 2003-03-06 13:55:51)

Status: Rejected

It would be handy sometimes if Ruby had the kind of comparison-result propagation that Python has, where a == b and b == c and c == d can be written as a == b == c == d.

Comments

Re: propagating comparisons like Python (anonymous, 2003-03-07 03:33:44)

Icon has this too and it took quite some time to get out of the habit of writing

if 1 <x <= 100 then

It would improve code readability if we could do this. Im not sure if this was a piece of sugar added to Icon or some property of the language.

Note that ranges dont help in the above example only in the 1 <= x <= 100 or 1 <= x <100 type conditions.

references (cout, 2003-03-11 16:26:36)

See: []
There was also another thread on ruby-talk in which this was discussed, but I wasn't able to find it quickly. Does anyone else know which thread it was?

propagating comparisons like Python (oinkoink, 2003-03-17 18:28:45)

This chaining of comparisons (especially
for inequalities) is really the only thing I miss about Python. Ruby is a
more interesting and more powerful language, but the ability to chain
comparisons is definitely cool!

Gotchas (lennon, 2003-03-18 07:40:45)

Personally, in my 3+ years of writing Python code, I never chained comparison operators the way you describe above. The reason was simple: I don't like forcing the language to resolve a statement that looks ambiguous to the human eye, and I don't like seeing shortcuts that eliminate important semantic cues.

Really, when you write a == b ==c, what you mean is probably a == b and b == c, which is much more explicit to someone reading your code, and doesn't force weird parsing magic to turn a simple binary operator like '==' into a "magic" operator that chains any number of operands when repeated.

Re: Gotchas (chaining comparisons) (oinkoink, 2003-03-18 12:08:55)

The idiomatic use of this is something
like:
if 3 <= x <y <5 then foo(x,y) end
This sort of chaining a <b <c <d as
a single statement reflects normal mathematical
practice (and is therefore desirable to
like me, who are chiefly interested in
mathematical applications).

Weird parsing magic (anonymous, 2003-05-09 07:57:08)

Really, when you write a == b ==c, what you mean is probably a == b and b == c, which is much more explicit to someone reading your code,

I don't think so. For example, if you say 1 twenty dollar note is the same as 2 ten dollar notes or 4 five dollar notes, you're saying they're all the same. You're not saying merely that '(1 twenty == 2 ten) && (2 ten == 4 five)'. After all, '==' means an equality relationship and equality is meant to be transitive, so that (a == c). Which implies that what you really mean is '(a == b) && (b == c) && (a == c)'. And suddenly that's more to type, and harder to read. To anyone who knows high school maths well, (a == b == c) expresses it most succinctly.

and doesn't force weird parsing magic to turn a simple binary operator like '==' into a "magic" operator that chains any number of operands when repeated.

Yes, it might be awkward, but I'd imagine plenty of Ruby syntax forces weird parsing magic. The point is that languages are better when they express things naturally. And in maths, you naturally express 2 <x <12 without worrying about the fact that "<" is a binary relationship.

It's the kind of thing that we don't need really that often, but makes code less like its own syntax that we have to learn for no reason.

Junctions? (tcfelker, 2003-05-24 00:46:19)

You could do something like this with junctions ala Perl 6: all(a, b, c) == d. Is there anything like this in Ruby? (If not, it might be easy to add to Object.)

Reason for rejection

It's hard to define semantics of chain comparison in "OO" way. Use ranges instead.

RCR 135: Versioning support in 'require' (ser, 2003-04-08 12:57:40)

Status: Rejected

Versioning support in 'require' would be useful. There are hacks that can be used to support multiple versions of the same library on one machine, but they generally do not support dynamic ranges of versions, and the support is not native to Ruby.

Portage's versioning mechanism is an excellent model:

        # In all examples, the require mechanism attempts to use
        # the most recent (latest) version installed that satisfies
        # the requirements.

        # Require exactly version 1.0.1 of package1
        require 'package1', "=1.0.1"
        # Require a version of package2 that is greater than 1.0, but
        # less than or equal to 2.1.1
        require 'package2', '&gt;1.0, &lt;=2.1.1'
        # Require a version of package3 less than version 2.0
        require 'package3', '&lt;2.0'
        # Use the latest version of package4 installed
        require 'package4'

This syntax is basically a simplified version of:

        require 'package2', Version::range("&gt;1.0, &lt;=2.2.2")

or

        require 'package2', ( Version.new("1.0") .. Version.new( "2.1.1") )

This should be part of the standard Ruby installation, rather than an add-on, since it affects core Ruby functions (locating and loading resources), and is of most use to third-party applications (as opposed to local scripts).

Comments

I second the nomination.... (anonymous, 2003-04-08 14:28:11)

Versioning should be included in the standard installation, and soon.

In the meantime, why not distribute your versioning code?

Suggestion: What if we made a 'require' that not only takes a file to require, but can also take a block:

require 'package' { |ver| ver> 0.1.1 }

where 'ver' comes from the filename', like:

package-0.0.1
package-0.1.0
package-0.1.1

All three of these would be evaluated for requirement, but package-0.1.1 would be chosen.

So essentially, this overidden 'require' would look in the standard lib (and other libs which are specified by -I, for example) for 'package*' and there would be an array of package names built:
['package-0.0.1','package-0.1.0','package-0.1.1']

The filenames in this array would be split on '-' and the version number passed to the block (it could be some kind of special Version type that gets passed to the block). The block is then used to determine which one of these packages is chosen.

I think this approach would:
1) be easy to implement (because there is not much parsing to do)
2) be powerful because you could put any kind of expression you'd like in the block.
3) be the Ruby way to do it (using a block) ;-)

As a policy, we would have to require that the versioning of a filename should be specified after the '-'.

versioning, will this help? (HughSasse, 2003-04-11 10:35:23)

I wrote a while back, if that helps...

Hugh

Code (anonymous, 2003-04-12 13:34:22)

Here's some code for doing this...

#####################################################33
# Version - takes a string in the form: 'X1.X2.X3...Xn'
# (where 'Xn' is a number)
# #####################################################
class Version
  include Comparable
  def initialize(str)
    @vs = str.split('.').map!{|i| i.to_i}
  end

  def [](i)
    @vs[i]
  end

  def to_s
    @vs.join(',')
  end

  def (other)
    if other.class == String
      other = Version.new(other)
    end
    @vs.each_with_index { |v,i|
      unless v == other[i]
        return v  other[i]
      end
    }
    return 0
 end

end

module Kernel
  def require_with_ver(file,&b)
    if b 
      dir = ""
      files = []
      $:.each {|dir|
        files = Dir[file+"*"]
        if files.length&gt; 0
          break
        end
      }
      p files
      files.each { |f|
        if b.call(Version.new(f.split('-')[1].split(/.rb/)[0]))
          puts  "require '#{f}'"
          require f
          return
        end
      }
    else
      require file
    end

  end
end


if $0 == __FILE__
  $:  '0.1.1' }

end

Let's try that again (anonymous, 2003-04-12 13:42:38)



#####################################################33

# Version - takes a string in the form: 'X1.X2.X3...Xn'

# (where 'Xn' is a number)

# #####################################################

class Version

  include Comparable

  def initialize(str)

    @vs = str.split('.').map!{|i| i.to_i}

  end



  def [](i)

    @vs[i]

  end



  def to_s

    @vs.join(',')

  end



  def (other)

    if other.class == String

      other = Version.new(other)

    end

    @vs.each_with_index { |v,i|

      unless v == other[i]

        return v  other[i]

      end

    }

    return 0

  end



end



module Kernel

  def require_with_ver(file,&b)

    if b #block_given?

      puts "block given"

      dir = ""

      files = []

      $:.each {|dir|

        files = Dir[file+"*"]

        if files.length&gt; 0

          break

        end

      }

      p files

      files.each { |f|

        if b.call(Version.new(f.split('-')[1].split(/.rb/)[0]))

          puts  "require '#{f}'"

          require f

          return

        end

      }

    else

      require file

    end



  end

end





if $0 == __FILE__

  $:  '0.1.1' }



end

Native to Ruby (ser, 2003-04-14 15:26:09)

In the meantime, why not distribute your versioning code?

There are numerous third-party solutions to this problem. I could have simplified the RCR by simply requesting "Native support for 'require' versioning in Ruby", but I thought I'd mention how I'd like it to look :-).

The problem with third party add-ons is that this really is something that should be native to Ruby, since the final solution will probably affect how packages are installed. Any short-term solution (that I can think of) would break existing Ruby apps. Consider your proposed solution (or mine, for that matter): how would an older app that simply called:

require "package"

work? Wouldn't installing this system require that all libraries be re-installed with a special directory structure?

Reason for rejection

Although I admit usefulness of library versioning, I feel this RCR itself won't work well. Need more investigation.

RCR 137: break/continue accept numbers (anonymous, 2003-05-14 11:39:40)

Status: Rejected

I would like to see something like this:

while (true) {
  do something
  lines.each do |x|
    break if isA
    break 2 if isB
  end
end

"break" or "break 1" will exit from the innermost loop, and "break 2" will jump out two loops.

This should also apply to "continue". "continue" or "continue 1" will re-enter the innermost loop, and "continue 2" will re-enter the second-innermost loop.

Exceptions should be thrown when the number is invalid. This might be dangerous in some cases. However it also buys us some flexibility.

Comments

Interesting... (cout, 2003-05-14 16:48:13)

But why would we want to add "break 2" to the language when throw/catch is already sufficient? Is there another case that I am missing?

Alternative Suggestion (JimWeirich, 2003-05-15 15:50:44)

Numbered breaks are too fragile. Adding or removing a nested loop would require the numbers on all the breaks to change.

I would prefer named breaks, where a block of code gets a label and the break names the block of code that it will exit. I propose spelling this new break as 'throw :label', and to spell the block label as 'catch(:label) { }'.

Matz, how soon can you have these changes ready. ;-)

Matz is finished already!? (anonymous, 2003-05-17 07:50:27)

Don'T we have this functionality already?

not about throw/catch (anonymous, 2003-05-19 11:27:13)

Even "break 2" can be achieved by a lot of different techniques, "break 2" is more intuitive about what the code is doing.

different story (anonymous, 2003-05-19 11:40:24)

"Adding or removing a nested loop would require the numbers on all the breaks to change." is not what I would predict.

let's look at this code:



loop1: for(...) {

  loop2: for(...) {

    beak 2

  }

}

"break 2" will need change only when you insert another loop between loop1 and loop2 to make it



loop1: for(...) {

  newLoop: for(...) {

    loop2: for(...) {

      beak 3

    }

  }

}

IMO, This doesn't happen as often as we imagine. And even if it happens, people definitely need to carefully review the code that jumps out of the outer loop.

New feature in Ruby 1.7 (anonymous, 2003-05-21 05:53:02)

Found something somewhat related on:
http://www.pragmaticprogrammer.com/ruby/new_features.html

1.7 -- 'break' and 'next' now take an optional expression, which is returned by the enclosing block

Could be used to signal the outer loop to break from the inner loop without using variables.

Too Subtle? (JimWeirich, 2003-06-02 11:14:34)

Sorry, I was trying to make a point by showing that the functionality already existed. (It was also a very indirect reference to Guido's time machine, which I assume Matz must have borrowed on occasion.)

Same Story (JimWeirich, 2003-06-02 11:20:29)

[...] "break 2" will need change only when you insert another loop between loop1 and loop2 [...]

Break 2 would still need to change anytime an enclosing (is that a better term than nested?) loop is inserted between the loop targetted for exit and the break statement.

Actually... (anonymous, 2003-06-10 06:57:38)

IMO, This doesn't happen as often as we imagine. And even if it happens, people definitely need to carefully review the code that jumps out of the outer loop.

I frequently find myself refactoring code in a way that would cause this kind of change - a classic example is splitting a loop into two subloops:
for val in 0..99 do
becomes:
for digit1 in 0..9 do for digit2 in 0..9 do val = 10*digit1 + digit2
Except then I have the individual digits available. Another example happens when I restructure a container object to become a list-of-lists, rather than one big list.

Besides, symbolic names are always nice to let you know why you're going back to a specific spot. After an appropriate choice of symbol names, you may not even need comments.

However, catch/throw doesn't quite do what I want. I personally think perl does the right thing in this regard - I want optional symbol labels on break, redo, next and retry. I have found that these features of perl (labeled redo and next) enable very readable code in certain situations.

Reason for rejection

use catch/throw for this purpose.

RCR 141: Matrices and Matlab syntax to Ruby (ruby_on_wall_st, 2003-05-29 17:36:58)

Status: Rejected

I would like Ruby to resemble Matlab as far as possible in terms of the syntax for dealing with vectors and matrices. In particular the operators ':' and 'end' as indices are useful. It would be nice if the notion of at least of a 2D matrix (like NArray) were a native (for speed) Ruby type like Float. The standard operators should also work with 2D matrices as they do in Matlab. Combining Matlab's built in matrix language with Ruby's networking and database access capabilities would make it even a better data analysis language.

Comments

Examples? (anonymous, 2003-06-03 18:44:49)

For those of us who haven't used Matlab before, can you offer some examples?

matlab syntax (anonymous, 2003-06-11 13:41:48)

Matlab has an incredibly expressive syntax for manipulating multidimensional arrays, which,
I think it is fair to say, hinges around the concept of using another *array* as the indexing variable. For example, in 1D, we
can make an array

> N=1:10
N=[1 2 3 4 5 6 7 8 9 10]

Then lets do an elementwise squaring:
> M=N.^2
M=[1 4 9 16 25 36 49 64 81 100]

Now consider the expression M(10:-2:2).
The -2 is the step used in going from 10 to 2
so the 'index' evaluates to the array
[10 8 6 4 2]. These are then used as indices into M:
> M(10:-2:2)
[100 64 36 16 4]

The arrays don't have to be ranges and they
don't have to be one dimensional. Eg,
a literal 2D matrix can be written as
[1 2; 3 4] where the ; means 'next row'.
So you can write

>M([1 2; 3 4])
1 4
9 16

You can even do assignment in the same way:
say we want to negate certain elements of M:

>M([3 6 9])=-M([9 6 3])
M=[1 4 -81 16 25 -36 49 64 -9]

All of this generalises to higher dimensional
matrices very neatly and make Matlab
very good for writing complex numerical
algorithms. It's it pity the rest of the language
is so primitive; hence the need for matlab
like syntax in more powerful languages like
Ruby, Scheme or Python.

it should go in a library, not new ruby syntax (denis, 2003-06-12 08:03:03)

I dont like the idea of extending ruby syntax for supporting such features. Ruby syntax should stay simple.

What is needed for that is a separate library, maybe a mixin module that would extend Array like classes (NArray, Vector...) with better indexing methods.

Reason for rejection

it should go in a library, not new ruby syntax

RCR 142: Return multiple values from subroutines (anonymous, 2003-06-04 09:44:52)

Status: Rejected

IMHO, if a subroutine can conveniently return multiple values to the caller, some code can be more intuitive and optimized. Besides the return value, a sub sometimes produce other artifacts which interest the caller. Currently, to obtain those byproducts, the caller either have to make a call to another method which will be redundant, or store the byproducts in a class/instance method, or create some dummy class to store both the returning value and byproducts. None of those are good from the code readability or performance perspective.

Here is what I expect to do:

def isSomethingAvailable
  something = a_long_call
  produce("something", something)
  return !something.nil? 
end

def caller
  if (isSomethingAvailable) 
# not necessary to call a_long_call to get it.
    something = consume("something")   
  end
end

Comments

Don't see the need for change (root, 2003-06-04 09:47:31)

In your existing example, the method can simply return "something" -- if it is nil, then it isn't available.

In the general case, methods can already return multiple values: they come back as an array:



   return 1,2

let's look at this (anonymous, 2003-06-04 14:28:43)

The example I gave is not good. there happens to be a good alternative approach.

Your suggestion of returning array is valid regarding to offer a solution. However, my concern is the subroutine semantics (signature, returning type) remain the same, meanwhile, it has more flexibility to pass information back to the caller.

Re: let's look at this (HughSasse, 2003-06-05 08:30:20)

Well, I'm not clear what more information you would want, but don't forget that for a given method (or subroutine) you can do



   if block_given?

      blockresult = yield more information

   end

   return result

then you can use a block to do the extra testing.

non fatal errors (anonymous, 2003-06-05 10:25:18)

The information I want to pass back include:

1. reusable resources obtained in the called method like DB connections;

2. non-fatal error code / message, the called method doesn't have to know how they should be logged, reported;

3. various byproducts which might interest some caller;

...

Stateful objects and OO design (lennon, 2003-06-05 17:42:59)

I'm confused about why you wouldn't want to just use instance variables for this -- the kinds of information you're talking about storing is exactly what instance members and accessors encapsulate very well.

Perhaps a better way to factor the above code, or anything following a similar pattern, is like this:

class MyClass

attr_reader :something

def isSomethingAvailable
  @something ||= a_long_call
  return !something.nil?
end # isSomethingAvailable

end # class MyClass

That way, you get the benefit of seperately accessing the truth value of isSomethingAvailable, and don't require any funky changes to the language semantics.

Or, just implement produce and consume as module-level methods which use a singleton dictionary/queue/other collection of your choice, and allow arbitrary values to be "tagged" and passed from call to call, object to object, or module to module.

I could definitely see applications where that sort of top-level messaging system could be useful, but I don't think it requires language extensions. You could look at the global DataBus in FreeRIDE for one example of how to implement such a system on top of "vanilla" Ruby.

instance variables work for some cases. (anonymous, 2003-06-06 12:07:40)

First, I got the idea when I'm mainly using Java. If Ruby has some features working the similar way, or it can be easily done by adding methods to Object, then please explain it.

Second, I mentioned this can be achieved by using instance/class variables, creating dummy class, or making a second call to get "something". Those work but I want a native way to get the intermediate result in the called method. Someone might think that's not good, but definitely it's useful in some cases.

For example, in the project(Java) I'm working on, we have some util class for accessing ldap. It should not be instantiated so that use instance variable is not ok. and it'll be used concurrently, so class variable won't fit. We do have a method called hasEntry(dn). It'll try to read a entry and return true if it succeeds. Please don't tell me to use getEntry(dn) because we don't want the boolean expressions like getEntry(dn) == null. So now please tell me a way to reuse the LDAPEntry object generated by hasEntry().

I do think above case is normal and a high level design (DataBus) don't fit in.

you could use arrays (anonymous, 2003-06-12 05:17:37)

why not this way?

def func()
return [1,2,"foo"]
end

a,b,c = func

Re: non fatal errors (anonymous, 2003-06-18 08:36:01)

...all of which are more than capable of being passed as arguments to yield(...) or returned in an Array.

There are solutions. I want another one (anonymous, 2003-07-01 12:20:03)

Using array is one solution. However, it's not an answer to my question. I'm not asking how I can do this. Just like most of other feature requests, there are solutions/workarounds for the problem I'm tackling. What I'm asking for / proposing is a new approach. Please discuss its pros and cons more instead of arguing it can be done as I know that.

Reason for rejection

this would cause serious semantic incompatibility.

RCR 143: Add a 'Boolean' superclass for TrueClass and FalseClass (djberg96, 2003-06-08 12:37:31)

Status: Rejected

Currently, in order to type-check a variable that is true or false, you must do something like:

if val.kind_of?(TrueClass) || val.kind_of?(FalseClass)

# or

if val == true || val == false

I would like to request a "Boolean" superclass, of which TrueClass and FalseClass would be a subclass. Then, I could simply do:

if val.kind_of?(Boolean)

This would both look better and ease my carpal tunnel. I could also then do silly stuff, like create a subclass called "MaybeClass". :)

Comments

It just makes sense... (anonymous, 2003-06-08 18:03:41)

I never understood why things aren't how you describe. It seems to make a lot of sense for TrueClass and FalseClass to be subclasses of a Boolean class.

make it an RCR...

Reason for rejection

true and false are only typical values. Everything but "false" and "nil" is truth value. So I think there's no need for Boolean class.

RCR 144: supply a block to Array#join (neoneye, 2003-06-11 20:12:07)

Status: Rejected

[1, 2, 3].join(", ") { |v| "(#{v})" }    
# "(1), (2), (3)"

See the discussion of it, here:

Comments

Not a very good idea (androflux, 2003-06-11 22:28:49)

I don't like the idea because it combines two things that don't need to be combined: a #collect and a #join.

Re: Not a very good idea (neoneye, 2003-06-12 17:49:44)

Yes it can be solved by using collect+join.. please read the discussion.

One problem is when #collect and #join is located far from eachother (several lines), then its difficult figuring out what is going on. Array#join which takes a block, can rule out such confusion.

Another problem is that I am lazy and guess others is lazy as well. When im in a hurry, I do like this:

"XXX "+data.join("
XXX ")

Which is short, but repeating yourself is bad. I could use collect... but I don't.

Isn't Ruby about making programming easy?

I'm still not convinced (androflux, 2003-06-13 19:15:02)

"please read the discussion."

I have. Even threw in a couple of comments. I just don't see what the big deal is. #join does one thing: it joins arrays together into strings. #collect does another.

The point is, saying the a block for #join should be like collect is arbitrary: Somebody else might want a block to #join to act like a #reject and then #join. Or maybe #partition or #grep or.....

Granted, making it act like #collect makes the most sense. But #collect has nothing to do with joining.

In that thread, Joel VanderWerf threw out the idea that join might take two args, and the return string would be what goes between the two args given. I don't really see a need for that either, but it's closer to what join does.

"Isn't Ruby about making programming easy?"

:-)

I (Jason Creighton) said almost the exact same thing in that thread.

Re: I'm still not convinced (neoneye, 2003-06-13 19:50:50)

At this moment #join does not take any block. I don't think it would break existing code if you could supply a block to #join. Question is, what kind of behavier would make sense within that block?

collect-join behavier could be very useful, therefore my RCR proposal. In the discussion I show several example of how it can be used.

pair-join behavier is only little useful.

"I (Jason Creighton) said almost the exact same thing in that thread."
Yes I ripped it from you.. :-)

Re: I'm still not convinced (androflux, 2003-06-14 01:09:48)

"At this moment #join does not take any block. I don't think it would break existing code if you could supply a block to #join. Question is, what kind of behavier would make sense within that block?"

Well, in the rest of Ruby, a block is usually a "dynamic" version of the normal method. What you propose isn't a dynamic version of join: You want a collect and then join. And, IMO, there's no reason to overload the join method that way.

"collect-join behavier could be very useful, therefore my RCR proposal. In the discussion I show several example of how it can be used."

Yes, it could, but I don't want #join overloaded to mean two different things like that.

Re: I'm still not convinced (neoneye, 2003-06-14 04:50:22)

"Well, in the rest of Ruby, a block is usually a "dynamic" version of the normal method. What you propose isn't a dynamic version of join:"

true... But in Ruby there is already (al least one) example of methods, which behaves radical different if they are supplyed with a block. If given a block to File#open then it closes the file afterwards.

IANYM == I am not Yukihiro Matsumoto

Radically different method if block supplied? (anonymous, 2003-06-19 06:09:00)

Actaully File::open is a class method. The file is not open when this method is called. It is opened and passed to the block for the block to use and then closed after the block exits. This actually isn't radically different if you think about it. It does assume that if you gave a block then you want to have the block establish a transaction space for operating on the file and when the block is done, then you are done with the file, so it closes it. It is actually the right place to put it as well.

As for extending join to take a block, I don't think that it is ideal. Why? join is about what the seperator string should be between elements of an array that you want converted to a string. So your converting array elements to strings and then wanting to place the join string between them.

Your example:
[1, 2, 3].join(", ") { |v| "(#{v})" }
# "(1), (2), (3)"

is more about having a block transform/operate on the data then give it to join to concate on the seperator string. To me that is a different operation. It's not just a join! A join only does a to_s on the data so that it can be printed.

Collect, select and those methods are about appling a transform and getting the result.
Join is about getting it as a string...

Just an opinion....

Sam
staypufd at mac dot com

yes, exactly! (androflux, 2003-06-19 16:36:43)

"To me that is a different operation. It's not just a join!"

Exactly what I have been trying to say.

Re: yes, exactly! (neoneye, 2003-06-19 19:20:54)

The behavier is enhanced if a block is supplyed. The seperator-string behavier is preserved. I don't think it will break existing code.

Its this enhancement which can make many things simpler (avoiding typing collect all the time). I think a block supplied to #join is only a minor change.

Im terrible at defending myself in a language which is not my native language. Every time I bring up ideas then everyone is against me.. It would be nice if people would take my side and help me :-)

"To me that is a different operation. It's not just a join!"
Yes.. its an enhanced join which is highly useful :-)

Reason for rejection

This is not intuitive enough for my eyes.

RCR 145: Automated dynamic type-checking for methods (anonymous, 2003-06-12 11:35:30)

Status: Rejected

I'd like some way to automate dynamic type checking for method calls. Instead of having to do something like:

   def foo(a, b, c)
      if !a.kind_of?(Numeric)
         raise ArgumentError.new ("Expected argument a to be of type Numeric")
      elsif !b.kind_of?(Numeric)
         raise ArgumentError.new ("Expected argument b to be of type Numeric")
      end
      # etc...
   end

if we had a built-in mechanism to throw these exceptions, it would be great. For example:

   def foo(Numeric a, Numeric b, c)
      # etc...
   end

could throw an ArgumentError exception if a or b did not return true for kind_of(Numeric).

The problem with this way of handling things is that the exception cannot be caught within this method itself. I was thinking that maybe to allow more flexibility, we could have a keyword called 'type_check' that would check the arguments when invoked. This would allow the exception to be caught within the method itself if desired and also allow some pre-processing code to be added before the type-checking. For example:

   def foo(Numeric a, Numeric b, c)
      a = a.to_f if a.kind_of(String) # If strings were just as good for a

      begin  # Now do type-checking
         type_check
      raise ArgumentError => e
         p e
         return
      end
      # etc...
   end

So what do you think?

Comments

You can already do that in Ruby! (anonymous, 2003-06-12 15:50:04)

This can be done entirely in Ruby. I implemented multi method dispatching in Ruby which allows you to do this and some other nice things like dispatching based on whether an argument responds to given methods or the like. (There might be a better (faster) implementation for class-only dispatching however.)

My implementation isn't entirely done yet (there's still some documentation missing), but I think that it can serve as an example of the power Ruby gives you nevertheless. See .

Of course there are also benefits of having this build into the language like nicer syntax.

Ryan Pavlik's strongtyping does this (markvwilson, 2003-06-14 19:18:52)

See the RAA. A better name for this class would be ArgCheck (in my opinion).

Ruby already has "Strong" typing (chadfowler, 2003-06-14 23:29:48)

"Dynamic" and "loose" aren't synonymous. Maybe it could be called "StaticishTyping" :)

types.rb implements this (anonymous, 2003-06-16 10:50:50)

I don't immediately like the idea of having the method do pre-ops or catch the type exceptions - it "smell dangerous", particularly as it stops you from being able to fully view the types as invariants, and being able to do contract thinking around them. Instead, I think the thing to do is to make a flexible type system that allows the CORRECT set of restraints to be specified as types.

See http://people.freebsd.org/~eivind/ruby/types/ for my take on this.

It pre-dates strongtyping and is more powerful. I have not released it to the RAA because I have not yet found any good way to implement co-variant typing in types.rb.

The library implements

typesig Numeric, Numeric, Object
def foo(a, b, c)
# etc
end

for the case you show above. It also allows more extensive type declarations, like

typesig [Numeric, String], Numeric, Object
def foo(a, b, c)
# Allows a to be either a Numberic or a String
end

There are a bunch of different types; off the top of my head I remember Type::Or (which has a shorthand like above), Type::And (all of the type predicates must match), Type::Regexp (obvious, isn't it?), Type::Hash (allows type testing on individual hash keys; not sure the API for this is quite good enough), Type::Multi (allows multiple arguments to match a group, so you can represent type sets like

typesig Type::Multi(String, Numeric)
def foo(*args)
# args must consist of pairs of String and Numeric
end

There are more than these, but those are characterized by me remembering them without having to think much or look them up ;-)

Feedback is welcome.

Eivind.

Re: types.rb (anonymous, 2003-06-17 08:08:58)

I find this very interesting and it looks very nice and modular, too. :)

What I'd like to find out is whether this is a subset of my dispatch.rb from above or if it is able to do everything that dispatch.rb does.

I think I'll have a closer look at this and mention it as an alternative in the documentation of dispatch.rb with a link to http://people.freebsd.org/~eivind/ruby/types/ if you're okay with that.

type != class (any more :-) (DavidBlack, 2003-06-20 05:09:28)

Hi --

This is an on-going topic of discussion, and will probably continue to be for a long time... but anyway, a couple of comments.

Numeric, String, MyClass, etc. are not really types; they're classes. Unfortunately, in versions of Ruby up to 1.6 (I think), 'type' (the method) was a synonym for 'class'. As I understand it, this was because of problems getting 'class' (which is also a keyword) to parse as a method name.

The 'type' method is gone (or at least deprecated), but the habit of thinking of an object's class as its type is pretty widespread. The disadvantage of this (i.e., why it's not just a matter of Ruby pedantry) is that the class of an object doesn't actually/necessarily tell you what you need to know about the object -- namely, whether it will respond to the messages you send to it. (Nor does it tell you how it will respond, but I don't think anything can do that.)

Obviously Ruby has kind_of?, and objects are willing to tell you their class and ancestors. But depending on this to determine an object's interface is somewhat illusory, since Ruby objects can change dynamically. Also, while in practice this will often work (if you can be certain that objects remain unextended in relevant ways, classes unchanged, etc.), thinking of an object's class as its type means that one has ruled out other, possibly more dynamic and useful ways of characterizing type. (I'm not entirely sure what I think those are, but I think they are there :-)

David Black

writing from Karlsruhe, eagerly awaiting the European Ruby Conference!

I'm looking for this, but can it be done more nicely? (anonymous, 2003-07-31 13:45:20)

I'm looking for a solution like:


def foo(n:Numeric, s:String)
end

def foo(s:String, n:Numeric)
end

A Type Is An Expectation (matju, 2003-08-10 19:36:06)

A type is an expectation, and a bug is a failure to meet expectations. type-checking don't prevent bugs from occurring, but they can help finding where the fault is. Type checking encodes expectations into the program.

Contract-checking encodes even more expectations into the program, but less people use this practice, because it's more cumbersome. I'd say it is at its most useful when library code is using a user-provided implementation of a given interface. Makes it easy to spot whether it's the library's fault or the user's fault.

the anti-type-checking cult is founded on misunderstanding of the concept of expectation.

Reason for rejection

It's against dynamic (duck) typing.

RCR 147: RUBYINIT environmental variable (androflux, 2003-06-24 17:14:11)

Status: Rejected

I think there should be a RUBYINIT environmental varible, that, if set, would be a script run when the Ruby interpreter starts before processing other scripts.

Comments

Re: RUBYINIT environmental variable (tsuihark, 2003-06-25 15:41:45)

What kind of script would be useful for each and every Ruby program you will ever run?

preferences (cout, 2003-06-25 16:15:55)

I might want to turn the verbosity level up for all programs so I don't have to add $VERBOSE=true to the top of every script.

There might be other variables someone would also want to change such as $/ or $ or $-K.

Option to disable (djberg96, 2003-06-25 18:50:30)

This could lead to some serious confusion, especially with shared code and/or debugging. I would definitely want some way to disable/ignore RUBYINIT for either all code or sections of code, should this be implemented.

Re: RUBYINIT environmental variable (androflux, 2003-06-25 20:18:47)

I was thinking mostly of adding more than one user-defined path to $LOAD_PATH. So maybe allowing a colon separated list in RUBYLIB would be better.

Use RUBYOPT? (Reimer_Behrends, 2003-06-25 23:48:31)

Doesn't RUBYOPT=-r/path/to/script work for you?

Re: Use RUBYOPT? (androflux, 2003-06-26 00:59:27)

"Doesn't RUBYOPT=-r/path/to/script work for you?"

Okay, I feel stupid now......

Reason for rejection

I think it's a feature blongs to application, not language.

RCR 165: #inject, #partition expand array if arity> 2 (neoneye2, 2003-10-29 14:38:05)

Status: Rejected

It would be awesome, if #inject could do splitting when arity> 2. For instance converting an array of pair into a hash (arity==3):

x=[["name", "john"], ["age", 20]]
p x.inject({}){|h,k,v|h[k]=v;h}
#=> {"name"=>"john", "age"=>20}

An possible implementation could be:

Also #partition cannot deal with arity> 2..

--
Simon Strandgaard

Comments

You can do it now (matz, 2003-11-20 05:02:39)

x=[["name", "john"], ["age", 20]]
p x.inject({}){|h,(k,v)|h[k]=v;h}
#=> {"name"=>"john", "age"=>20}

Reason for rejection

split explicitly by using parentheses.

RCR 166: Allow Enumerable#* to use user-specified iterators (pcdavid, 2003-10-31 03:04:17)

Status: Rejected

All the utility methods in Enumerable have a hard-wired assumption that the iterator to use is always named #each. What if it's named something else? What if I have multiple iterators in the same class? For example, String has #each_line (aliased to #each) and #each_byte. Currently, I can not use #sort and firends on #each_byte.

I propose to add an optional parameter to most Enumerable methods to specify which iterator to use. Of course, it would default to #each so as not to break existing code.

Example usage for String:

"foo
bar".sort # => [ "bar", "foo" ] (uses #each)
"foo
bar".sort(:each_byte) # => "
abfoor"

Comments

I like it (anonymous, 2003-10-31 03:22:47)

I think this is a great idea.

What about extending it even further and allow not only passing in an iterator method name, but also (alternatively) allowing a proc that can be called?

doesn't work for me (dblack, 2003-10-31 11:21:34)

Hello --

This idea doesn't do much for me. Using :each_byte as an argument makes a kind of dead body out of the name of a method, and even in terms of the logic or semantics, "sort by 'each_byte'" does not sound right and does not sound like it means the same thing as "sort by byte". It's possible of course just to decide that it means that, but it seems inelegant and awkward to me.

(I also don't think sort uses each, or any other iterator.)

Also, most Enumerable methods already *are* iterators. I think you're pointing toward a different language design with an impedance mismatch with respect to Ruby. The mechanism already exists to define 'each' for a given class or object, and the other Enumerable methods (those that do use 'each') will behave accordingly. So if you want 'detect' to only search even-numbered elements (or whatever), you can change your object's notion of 'each' (or write your own special-purpose iterator -- you're not limited to what Enumerable gives you).

David Black

re: doesn't work for me (cout, 2003-11-03 10:27:32)

(I also don't think sort uses each, or any other iterator.)

[pbrannan@zaphod tmp]$ cat test.rb
class Foo
  def each
    yield 5
    yield 4
    yield 1
  end

  include Enumerable
end

p Foo.new.sort

[pbrannan@zaphod tmp]$ ruby test.rb
[1, 4, 5]

Re: doesn't work for me (pcdavid, 2003-11-04 10:48:38)

> [...] "sort by 'each_byte'" does not
> sound right and does not sound like it
> means the same thing as "sort by byte".

Agreed, using sort as an example was not a good idea. However, for some other methods, I think the proposed syntax sound very natural:

aString.select { |line| ... }
aString.select(:each_byte) { |byte| ... }

And if I write a text-processing package and extend String to make it aware of paragraphs:

aString.select(:each_par) { |par| ... }

> The mechanism already exists to define 'each'
> for a given class or object [...].

But the whole idea of this RCR is that it often makes sense for a class to define multiple iterators. For example, an object representing an XML element might have #each_child and #each_attribute. Nothing in the language prevents you to do this, but it means only one of these can benefit from Enumerable. It's like Enumerable had decided "There shall be only one iterator worthy of my services, and it shall be named #each." I'm just asking to remove what I consider a limitation of Enumerable.

Of course, you can rewrite you own versions of #collect for each additional iterator:

def collect_attributes attrs = Array.new self.each_attribute do |attr| attrs But don't you think just being able to say elt.collect(:each_attribute) {|attr| ... } is simpler and more expressive ?

Just create a new enumerable that delegates the iterator (jarhart, 2003-11-04 12:36:29)

e.g.

class EnumerableProxy

  include Enumerable

  def initialize(iterator)
    @iterator = iterator
  end

  def each(&block)
    @iterator.call(&block)
  end

end

s = "foo
bar"

e = EnumerableProxy.new(s.method(:each_byte))

s.sort  # uses #each
e.sort  # uses #each_byte

iterators (dblack, 2003-11-06 09:23:43)

Hi --


> def collect_attributes
> attrs = Array.new
> self.each_attribute do |attr|
> attrs

> But don't you think just being able to say

> elt.collect(:each_attribute) {|attr| ... }

> is simpler and more expressive ?

Why wouldn't you just do:

  elt.attributes.map {|attr|... }

since what you want is a mapping across the attributes anyway? Or if you want an #each-style, rather than #map-style, traversal:

etl.attributes.each {|attr| ...}

or if you've defined the separate #each_attribute method:

elt.each_attribute {|attr| ... }

BTW, the arr.map(:meth) {...} syntax, used for a slightly different purpose, has been rejected as an RCR. (May or may not be relevant.)

David

try 1.8.1 (riffraff, 2003-11-08 20:04:37)

I think this is actually easy in 1.8.1 (dunno if it is possible in 1.8.0):

require 'enumerator'
str = "xyz"

e=Enumerable::Enumerator.new(str,:each_byte)
a = enum.map {
|b| '%02x' % b
} #=> ["78", "79", "7a"]

Enumerator (matz, 2003-11-20 04:59:04)

use 'enumerator' bundled with the 1.8 distribution.

Reason for rejection

use 'enumerator' bundled with the 1.8 distribution.

RCR 169: resume after raise (transami, 2003-11-20 02:36:35)

Status: Rejected

Add a resume method that reenters execution right after the instigating raise call. example:

def resume_example(x)
  begin
    print x
    x = x + 4
    if x &lt; 10
      raise
    end
    print x
  rescue
    x = 10
    resume
  end
end
resume_example(5)  # -> 510

This would compliment retry which returns execution at the top of begin clause rather then after raise. It would also allow better seperation of concern, as you could write an interface on top of a library and report warnings and messages without putting interface code in library.

Comments

You can do it now (matz, 2003-11-20 04:49:35)

def resume_example(x)
  print x
  x = x + 4
  begin
    if x

Reason for rejection

You don't need it; use retry.

Rejected RCRs

Status: Rejected

Comments

Possible reason for the rejection. (HughSasse, 2001-08-02 06:50:45)

Reason for rejection

Status: Rejected

Reason for rejection

Status: Rejected

Reason for rejection

Status: Rejected

Comments

Re: Infix 'function composition' operator (Stephan, 2001-08-01 02:13:31)

Reason for rejection

Status: Rejected

Comments

use iterate { &lti, j> ... } notation (joe, 2001-08-29 12:31:29)

Reason for rejection

Status: Rejected

Comments

We need to flesh this out a bit... (Dave, 2001-08-10 00:18:30)

Warn about variables used only once (Moxon, 2001-10-14 07:43:58)

Reason for rejection

Status: Rejected

Comments

I'm not sure I see this one (Dave, 2001-07-30 17:25:56)

Vague; some issues not considered (matju, 2001-08-31 07:31:12)

Reason for rejection

Status: Rejected

Comments

That might introduce some ambiguities (Dave, 2001-07-31 23:04:14)

#foo would be only for variables (anonymous, 2001-08-08 03:30:12)

Reason for rejection

Status: Rejected

Comments

This would be useful (Dave, 2001-07-31 17:15:10)

Reason for rejection

Status: Rejected

Comments

here here (ianm74, 2001-07-30 16:20:50)

A must-have if we can resolve all the issues (Dave, 2001-07-30 17:22:43)

XML and Web Services (Rich_Kilmer, 2001-07-30 17:38:23)

expat (anonymous, 2001-08-01 08:08:21)

XML support essential (xen, 2001-08-05 23:54:30)

How about data-binding (anonymous, 2001-08-11 13:04:44)

Data-binding and Ruby (Rich_Kilmer, 2001-08-12 14:43:07)

fix to previous post... (Rich_Kilmer, 2001-08-12 14:45:30)

wave goodbye to expat (xen, 2001-10-01 23:42:48)

full DOM (tobi, 2001-10-09 15:04:42)

Xerces, expat, REXML ... why should I know? (fmitchell, 2001-10-24 10:23:44)

XML in the standard lib of Ruby (tobi, 2001-11-13 17:50:16)

Reason for rejection

Status: Rejected

Reason for rejection

Status: Rejected

Comments

Possibly because... (Dave, 2001-07-30 17:32:30)

Reason for rejection

Status: Rejected

Comments

Would this work for all kinds of numbers? (anonymous, 2001-08-01 07:32:37)

Non numeric method? (matz, 2001-08-02 02:38:30)

Reason for rejection

Status: Rejected

Comments

We don't need this. (matz, 2001-08-01 03:08:54)

Reason for rejection

Status: Rejected

Comments

suggestions for enhancements (anonymous, 2001-08-09 01:05:25)

Re: suggestions for enhancements (HughSasse, 2001-08-09 05:52:30)

Reason for rejection

Status: Rejected

Comments

Internationalisation (anonymous, 2001-08-17 12:30:10)

Re: Internationalisation (aj, 2001-08-17 13:39:04)

Re: Internationalisation (anonymous, 2001-08-17 20:43:27)

Sounds perfect for a mixin (anonymous, 2001-08-17 23:00:11)

Even better... a Behaviour (anonymous, 2001-08-18 10:15:29)

Reason for rejection

Status: Rejected