• Re: lisp idiom for processing each line in a file?

    From B. Pym@Nobody447095@here-nor-there.org to comp.lang.lisp on Mon Jun 30 11:12:53 2025
    From Newsgroup: comp.lang.lisp

    Rob Warnock wrote:

    | I'm trying to process each line in a file, and am wondering if there is
    | a simpler or more idiomatic way of doing it than the following:
    |
    | ;;; Simplified version of my real process-line function
    | (defun process-line (line)
    | (format t "~A~%" line))
    |
    | ;;; Process stream, calling per-line-fn on each line of the stream
    | (defun process (per-line-fn stream)
    | (let ((line (read-line stream nil)))
    | (if (not (null line))
    | (progn (funcall per-line-fn line)
    | (process per-line-fn stream)))))
    +---------------

    This can blow up if your file is large and your CL implementation
    doesn't happen to perform tail-call optimization on the tail call
    to PROCESS. Note: Common Lisp is not Scheme.


    Yes, CL (COBOL-Like) is so clunky that it can't even handle
    recursion properly. It requires a Lispy language like Scheme
    to do that. CL is used only when a COBOL-Like language will
    suffice.



    +---------------
    | And I kick off the process like so:
    |
    | CL-USER> (with-open-file (stream "../../words/test.txt")
    | (process #'process-line stream))
    +---------------

    The standard CL idiom omits the intermediate PROCESS function
    entirely, and does the whole thing in the WITH-OPEN-FILE call:

    (with-open-file (stream "../../words/test.txt")
    (loop for line = (read-line stream nil nil)
    while line do
    (per-line-fn line)))

    +---------------
    | Is there a simpler way of doing what I'm trying to do here? +---------------

    See above.


    Gauche Scheme:

    (with-input-from-file "data.bak"
    (^() (do ((line #f))
    ((eof-object? (set! line (read-line))))
    (print line))))


    +---------------
    | I've seen some loop-based solutions, but am staying away from
    | loop for the moment.
    +---------------

    Why?!? It's the most natural way to code this particular task, IMHO.

    Paul Graham:

    I consider Loop one of the worst flaws in CL, and an example
    to be borne in mind by both macro writers and language designers.


    [In "ANSI Common Lisp", Graham makes the following comments:]

    The loop macro was originally designed to help inexperienced
    Lisp users write iterative code. Instead of writing Lisp code,
    you express your program in a form meant to resemble English,
    and this is then translated into Lisp. Unfortunately, loop is
    more like English than its designers ever intended: you can
    use it in simple cases without quite understanding how it
    works, but to understand it in the abstract is almost
    impossible.
    ....
    the ANSI standard does not really give a formal specification
    of its behavior.
    ....
    The first thing one notices about the loop macro is that it
    has syntax. A loop expression contains not subexpressions but
    clauses. The clauses are not delimited by parentheses;
    instead, each kind has a distinct syntax. In that, loop
    resembles traditional Algol-like languages. But the other
    distinctive feature of loop, which makes it as unlike Algol as
    Lisp, is that the order in which things happen is only
    loosely related to the order in which the clauses occur.
    ....
    For such reasons, the use of loop cannot be recommended.


    Dan Weinreb, one of the designers of Common Lisp:

    ... the problem with LOOP was that it turned out to be hard to
    predict what it would do, when you started using a lot of
    different facets of LOOP all together. This is a serious problem
    since the whole idea of LOOP was to let you use many facets
    together; if you're not doing that, LOOP is overkill.


    Barry Margolin:

    My recommendation is based on seeing many question in the past
    of the form "What happens if you use both XXX and YYY in the
    same LOOP?" The unfortunate fact is that when we were writing
    the standard we didn't have time to nail down all the possible
    interactions between different LOOP features, so many of these
    are not well specified. And even if we did get it right in
    the standard, it's likely to be difficult to find them and I
    wouldn't trust that all implementors got it right (many of
    those questions were probably from implementors, trying to
    figure out what they were supposed to do). And even if they
    all got it right, someone reading your code may not be able to
    figure it out.

    So, with all those potential problems, my feeling is that if
    you have to ask, it's probably better to use something other
    than LOOP.


    Barry Margolin:

    3. Loop is very powerful, granted, and many people are trying to
    argue that "you can do so much with loop that it's unreadable."
    This is not an argument.

    But it is! Because any use of LOOP has the potential to be
    unreadable, the reader must read it carefully to verify that
    it's just one of the cases that doesn't require careful
    reading!


    Barry Margolin: (05 Apr 2002 20:57:48 GMT)

    This seems like a big change just to clean up the way LOOP is described.
    And LOOP will still be a wart, because it will be the only language feature that uses "per-macro keywords". Providing this interface and giving a name
    to them would encourage other macro designers to do something similar, and
    we don't want more things like LOOP.


    John Foderaro:

    I'm not trying to join a debate on loop. I just wanted to present
    the other side of [the issue so that] the intelligent people can
    then weigh the arguments on both sides.

    I'm not suggesting that loop can be fixed either by adding
    parenthesis or coming up with ways of indenting it to make it
    understandable. It's a lost cause.

    ...

    Another great example from kmp:

    === from kmp

    For example, you might think
    (loop with i = (random 100) for x from 1 to 10 do (print (list i x)))
    and
    (loop for i = (random 100) for x from 1 to 10 do (print (list i x)))
    meant the same in English, [but they don't do the same thing in loop]

    === end kmp

    loop lulls you into thinking that you understand the program since
    you understand English. Make no mistake about it, loop is its
    own language. If you use it you condem everyone who reads the
    code to also learn the loop language.
    --- Synchronet 3.21d-Linux NewsLink 1.2
  • From B. Pym@Nobody447095@here-nor-there.org to comp.lang.lisp on Mon Jun 30 16:34:56 2025
    From Newsgroup: comp.lang.lisp

    B. Pym wrote:


    Gauche Scheme:

    (with-input-from-file "data.bak"
    (^() (do ((line #f))
    ((eof-object? (set! line (read-line))))
    (print line))))

    One could do it this way, but the repetition of "read-line"
    is clunky.

    (with-input-from-file "data.bak"
    (^() (do ((line (read-line) (read-line)))
    ((eof-object? line))
    (print line))))

    Let's use "<>" (it's used in the macro "cut") to indicate
    duplication of the preceding expression.

    (with-input-from-file "data.bak"
    (lambda()
    (do. ((line (read-line) <>))
    ((eof-object? line))
    (print line))))

    (define-syntax do.-aux
    (syntax-rules ( <> )
    [ (do.-aux ((v init <>) more ...) seen stuff ...)
    (do.-aux ((v init init) more ...) seen stuff ...) ]
    [ (do.-aux (what more ...) (seen ...) stuff ...)
    (do.-aux (more ...) (seen ... what) stuff ...) ]
    [ (do.-aux () seen stuff ...)
    (do seen stuff ...) ] ))

    (define-syntax do.
    (syntax-rules ()
    [ (do. (things ...) more ...)
    (do.-aux (things ...) () more ...) ] ))

    --- Synchronet 3.21d-Linux NewsLink 1.2