Haskell/DG

Haskell	Dogelang
import System.Environment import Data.String import Data.List import Data.Char import Data.Ord data Tree a b = Node a b (Tree a b) (Tree a b) \| Nil deriving (Show) letters = "abcdefghijklmnopqrstuvwxyz" filter_letters = filter (flip elem letters) filter_words = filter (((<) 2) . length ) update t k def fn = case t of Node k' v' cl cr -> case compare k k' of EQ -> Node k' (fn v') cl cr LT -> Node k' v' (update cl k def fn) cr GT -> Node k' v' cl (update cr k def fn) Nil -> Node k def Nil Nil inorder t = case t of Node k v cl cr -> (inorder cl) ++ [(k, v)] ++ (inorder cr) Nil -> [] main = do args <- getArgs content <- readFile (args !! 0) let ws = filter_words $ map filter_letters $ words $ map toLower content t = foldl (\t w -> update t w 1 ((+) 1)) Nil ws print $ take 20 $ inorder t print $ take 20 $ sortBy (\a b -> compare (snd b) (snd a) ) $ inorder t	import "/sys/argv" import "/collections/namedtuple" import "/string/ascii_lowercase" Node = namedtuple "Node" ("k", "v", "cl", "cr" ) letters = ascii_lowercase filter_letters = (bind str.join "") <- bind filter (in letters) filter_words = bind filter ((> 2) <- len) update = t k def fn -> if (k', v', cl, cr = t) => if (k == k') => Node k' (fn v') cl cr (k < k') => Node k' v' (update cl k def fn) cr (k > k') => Node k' v' cl (update cr k def fn) t is None => Node k def None None inorder = t -> if (k, v, cl, cr = t) => (inorder cl) + [(k, v)] + (inorder cr) t is None => [] main = -> content = open $ argv !! 1 \|>.read! words = (bind str.split sep:" ") <- (bind str.join " ") <- str.splitlines ws = filter_words $ map filter_letters $ words $ str.lower content t = foldl (t w -> update t w 1 (+ 1)) None ws print $ list $ take 20 $ inorder t print $ list $ take 20 $ sorted (inorder t) key:snd reverse:True main!

Haskell

Dogelang

import System.Environment
import Data.String
import Data.List
import Data.Char
import Data.Ord

data Tree a b = Node a b (Tree a b) (Tree a b) | Nil deriving (Show)

letters = "abcdefghijklmnopqrstuvwxyz"
filter_letters = filter (flip elem letters)
filter_words = filter (((<) 2) . length )

update t k def fn = case t of
  Node k' v' cl cr -> case compare k k' of
                      EQ -> Node k' (fn v') cl cr
                      LT -> Node k' v' (update cl k def fn) cr
                      GT -> Node k' v' cl (update cr k def fn)
  Nil -> Node k def Nil Nil

inorder t = case t of
  Node k v cl cr -> (inorder cl) ++ [(k, v)] ++ (inorder cr)
  Nil -> []

main = do
  args <- getArgs
  content <- readFile (args !! 0)

  let ws = filter_words $ map filter_letters $ words $ map toLower content
      t = foldl (\t w -> update t w 1 ((+) 1)) Nil ws
  print $ take 20 $ inorder t
  print $ take 20 $ sortBy (\a b -> compare (snd b) (snd a) ) $ inorder t

import "/sys/argv"
import "/collections/namedtuple"
import "/string/ascii_lowercase"



Node = namedtuple "Node" ("k", "v", "cl", "cr" )

letters = ascii_lowercase
filter_letters = (bind str.join "") <- bind filter (in letters)
filter_words = bind filter ((> 2) <- len)

update = t k def fn -> if
  (k', v', cl, cr = t) => if
              (k == k') => Node k' (fn v') cl cr
              (k < k')  => Node k' v' (update cl k def fn) cr
              (k > k')  => Node k' v' cl (update cr k def fn)
  t is None => Node k def None None

inorder = t -> if
  (k, v, cl, cr = t) => (inorder cl) + [(k, v)] + (inorder cr)
  t is None => []

main =  ->

  content = open $ argv !! 1 |>.read!
  words = (bind str.split sep:" ") <- (bind str.join " ") <- str.splitlines
  ws = filter_words $ map filter_letters $ words $ str.lower content
  t = foldl (t w -> update t w 1 (+ 1)) None ws
  print $ list $ take 20 $ inorder t
  print $ list $ take 20 $ sorted (inorder t) key:snd reverse:True

main!

Fpp

Published: Thu 12 November 2015
python

Deciding on the pure-ness of a python function

Through the work with dogelang I am currently looking at pureness of python functions and on how to decide and annotate pureness so that I, in theory, could write a static checker that checks whether the annotated functions are pure.

We can take the definition of pureness from the wikipedia and it is as follows
1. The function always evaluates the same result value given the same argument value(s). The function result value cannot depend on any hidden information or state that may change while program execution proceeds or between different executions of the program, nor can it depend on any external input from I/O devices.
2. Evaluation of the result does not cause any semantically observable side effect or output, such as mutation of mutable objects or output to I/O devices.
Python makes few restrictions on side-effects and mutability and since we even can override basic operators (like the dot-operator for attribute access), it is hard to infer anything remotely useful or non-trivial from a python-program.

I have been, for example, responsible for code like that (paracoded for your convenience):
```
def cls(object):
  def __init__(self):
    self.d = {}

  def __getattr__(self, attr)
    if not attr in self.d:
      self.d[attr] = someobj(attr)
    return self.d[attr]

A = cls()
A.foo # this line has side effects
```
This violates the basic principle of python that an expression should either have a return value or a side-effect but not both. But we might have done this for good reason. In this case, the academic value of the "front-end" of the library outweighs the atrocities of the implementation. But it serves as a fine example that, it is hard to reason about even simple looking expressions, if they include the dot-operator (or any other operator for that matter), especially if you don't have the full class definition at hand.

But it isn't only user-made classes that don't hold true to that principle. Most low-level functions that interact with the operating system do that in some way. They have their side-effect within the realms of the rest of the system and still they have a return value that represents some aspect of the operation within our program.

os.write(fd, str)

Write the bytestring in str to file descriptor fd. Return the number of bytes actually written.

open(file, ...)

Open file and return a corresponding file object. If the file cannot be opened, an OSError is raised.

So we would want to mark any function as impure that has a reference to one of these impure functions. But we would also want to mark any functions defined within these impure functions to be themselves impure. Looking at the following code it would be hard to infere whether a function has a reference to, let's say 'open'.
```
o, f = open, "file.tmp"
def fun():
  o(f)
```
Given Rice's theorem, I'd think that properly tracking references would either be undecidable or at least a problem of exponential growth (tracking all possible assignments to some reference). So to be on the safe side, we state the following as our impurity rule number 1: A function is impure if it uses a reference from an impure function or has a reference to an impure function. We would also want an exhaustive list of builtin impure functions, applying the rules recursively.

In the former code example the surrounding block leaks (that's what we will call it) the reference to both o and f into the namespace of fun. This would not be a problem. We even want this behaviour. Let's look at this example:
```
def f():
  return x*x
x = 5
print(f()) # 25
x = 7
print(f()) # 49
```
This is contrary to our definition of purity, that the function, given the same arguments should evaluate to the same value. So this only happened, because the reference was updated. Let's add impurity rule number 2: A function is impure if a name is assigned twice and a reference to that name is leaked. This should cover all cases since shenanigans like the following cause a NameError.
```
def a():
  yield (lambda : x*x)
  x = 3
  yield None
x, i = 5, a()
g = next(i)
print(g()) # Name Error: free variable 'x' referenced before assignment in enclosing scope
next(i)
print(g())
```
Here we see the yield-keyword we have the case of suspended computation. The next-function is of course part of the set of impure builtin functions because it returns a value and advances the internal state of the generator. Since for rule 2 we introduced the leaked name, we can still allow multiple assignments within a function if the name is not leaked because we want to allow, for example, the generator-function infinity to be pure.
```
def infinity():
  i = 0
  while True:
    yield i
    i = i + 1
```
Here, I find, we arrive at an impasse. Every iteration over a generator "uses the generator up". We could say that we won't allow references to generator-objects but that seems arbitrarily strict. Also, we can't really know which name could be used by a generator-object which would lead to not allowing any bare names (only function calls). Disallowing generators seems impossible since map and filter and other functional primitives depend on it.

I would welcome any ideas for a rule that would solve this issue of leaking the internal state of a generator.
Published: Mon 09 November 2015
coffeescript javascript

CoffeeScript is a missed opportunity
- list comprehension
- missing primitves
- associativity of application
- missing bind
- prefixing/infixing operators/functions
Published: Thu 05 November 2015
Updated: Thu 05 November 2015
dg python functional conference

dogelang adding pattern matching

If you haven't read about dogelang, you really should. It is python with haskell-syntax where you can use all python modules. So basically coffee-script but for a language that is already useful. And python bytecode is emitted instead of first compiling to python itself, but that just as a side-node.

On my behest, the developer of dg added pattern matching so that in an if-condition a tuple-unpacking-expression does not raise an exception if the unpacking fails but returns false.

Therefore one can now implement their own zip and list-constructor in the following manner:
```
zip = x' y' -> if
  ([x, *xs], [y, *ys] = x', y') =>
    yield (x, y)
    yield from $ zip xs ys
  otherwise => yield from ()

list = x' -> if
  ([] = x') => []
  ([x, *xs] = x') => [x] + list xs
```
This looks almost like haskell's case-pattern-matching but not quite, so it is sure to annoy every haskell-programmer.

You might need to execute python3 -m dg -b once after installing/updating dg to rebuild the interpreter/compiler-bundle with this feature.

In other news: It is quite likely that I will hold a small talk about dogelang at Leipzig's next Haskell conference HaL-10.

« Page 4 / 4

Haskell/DG

Deciding on the pure-ness of a python function

CoffeeScript is a missed opportunity

dogelang adding pattern matching

links

social