Tips for golfing in Julia

21

3

What general tips do you have for golfing in Julia? I'm looking for ideas that can be applied to code golf problems in general that are at least somewhat specific to Julia (e.g. "remove comments" is not an answer).

Jonathan Van Matre

Posted 2014-03-17T17:09:06.287

Reputation: 2 307

Answers

19

NOTE: The below may contain some outdated tips, as Julia is not quite stabilised in terms of structure, yet.

A few tricks to save a few characters

  1. Overload operators with frequently used binary functions. For example, if you need to do a lot of integer divisions, and no need for inverse division, use \ =div, and you can then type a\b instead of div(a,b). Note the space - this is necessary to avoid it parsing as a "\=" operator. Also note that, if overloaded at REPL prompt level, use (\)=Base.(\) or \ =Base. \ to reset it. NOTE: some functions have existing UTF-8 operators pre-defined, such as ÷ for div, as noted by Alex A.
  2. Use ^ with strings for conditional output. That is, rather than a>0?"Hi":"", use "Hi"^(a>0) to save one byte, or for boolean a, use "Hi"^a to save three bytes.
  3. (sometimes) Store small fixed-size vectors as separate variables. For instance, rather than a=split("Hi there"," "), you may be able to avoid a[1] and a[2] by using a,b=split("Hi there"," "), which can be referenced as a and b, saving three bytes for each use, at the cost of just two extra characters at assignment. Obviously, do not do this if you can work with vector operations.
  4. Access first element of Array with [] - for arrays, the expression A[] is equivalent to A[1]. Note that this does not work for Strings if you wish to get the first character, or for Tuples.
  5. Don't use isempty for Arrays, Tuples, or Strings - instead, use ==[] for arrays and ==() for tuples; similarly, for the negative, use !=[] and !=(). For strings, use =="" for empty, but use >"" for not-empty, as "" is lexicographically before any other string.
  6. Use the right short-circuit boolean operator in place of "if". It might be a bit less Julia-specific, but it is worth bearing in mind. x<=1&&"Hi" can be written as x>1||"Hi", saving a character (so long as the return of the boolean isn't important).
  7. Don't use contains to check presence of character in string - if you're restricted to the basic ASCII, use in('^',s) rather than contains(s,"^"). If you can use other characters, you can save a bit more with '^'∈s, but note that is 3 bytes in UTF-8.
  8. Looking for minimum/maximum values in an array? Don't use minimum or maximum - rather than using minimum(x) or maximum(x), use min(x...) or max(x...), to shave one character off your code, if you know x will have at least two elements. Alternatively, if you know all elements of x will be non-negative, use minabs(x) or maxabs(x)
  9. Where possible and permitted by the challenge, use an actual newline instead of \n - note that this will make your code harder to read, and may mean that you need to provide an "ungolfed" version to make it possible for people to actually understand it.
  10. Put options after the regex string - if you want to have a regex string in multiline mode, for example, don't type r"(?m)match^ this", type r"match^ this"m, saving three characters.
  11. Reverse 1-D arrays using flipud - reverse(x) is one byte longer than flipud(x) and will perform the same operation, so the latter is better.
  12. Where possible, use array concatenation instead of push!, unshift!, append!, or prepend! - for normal arrays, this can be done easily. For arrays of type Any with array elements, you will need curly brackets around the added arrays (that is, {[1,2]}, not {1,2}) - for Julia 0.4, you would need Any[[1,2]].
  13. Use array indexing to get the size of an array or string - when you use end within array indexing, it automatically gets converted to the length of the array/string. So rather than k=length(A), use A[k=end] to save 3 characters. Note that this may not be beneficial if you want to use k immediately. This also works in a multidimensional case - A[k=end,l=end] will get the size of each dimension of A - however, (k,l)=size(A) is shorter by one byte in this case, so use it only if you want to immediately access the last element at the same time.
  14. Get an index iterator using array indexing - Similar to 13, you can also obtain an iterator matching the length of an array using A[k=1:end], in which case k will hold an iterator matching 1:length(A). This can be useful when you want to also use array A at the same time.
  15. Don't use collect to convert a string into a char array - instead of collect(A), use [A...], which will do the same thing and save 4 bytes.
  16. Need a number converted to a string? Use "$(s[i])" or dec(s[i]) for expressions or multi-character variables, and "$i" for single-character variables.
  17. Use ?: instead of && or || for conditional assignment - that is, if you want to perform an assignment only on some condition, you can save one byte by writing cond?A=B:1 rather than cond&&(A=B), or cond?1:A=B rather than cond||(A=B). Note that the 1, here, is a dummy value.
  18. Use union or instead of unique - union(s) will do the same thing as unique(s), and save a byte in the process. If you can use non-ASCII characters, then ∪(s) will do the same thing, and only costs 3 bytes instead of the 5 bytes in union.

Glen O

Posted 2014-03-17T17:09:06.287

Reputation: 2 548

You can split on spaces using simply split("Hi there") since the pattern argument defaults to a space. – Alex A. – 2015-07-15T14:23:18.133

@AlexA. - I know, but it's not the point of the tip, and the tip applies equally well either way. – Glen O – 2015-07-15T14:28:29.703

Point 12 has changed in 0.5. – Lyndon White – 2016-10-02T14:44:03.343

@Oxinabox - I'm not surprised, I'm pretty sure a few of them are outdated by now. I originally wrote most of the tips for 0.3, I think. – Glen O – 2016-10-02T16:19:51.030

?: seems to be less useful in recent versions of Julia, as both need to be surrounded by spaces, adding 4 useless bytes. – ETHproductions – 2018-09-26T18:20:52.147

@ETHproductions - yeah, there have been some changes that undoubtedly cause issues with some of the tips. I'll eventually update the tips to suit current behaviour. – Glen O – 2018-09-30T07:19:10.497

3Oh how I would love that first trick in Python. – seequ – 2014-06-04T18:04:48.603

15

Redefine operators to define functions

Redefining operators can save a lot of bytes in parentheses and commas.

Recursive unary operators

For a unary example, compare the following recursive implementations of the Fibonacci sequence:

F(n)=n>1?F(n-1)+F(n-2):n # 24 bytes
!n=n>1?!~-n+!(n-2):n     # 20 bytes
!n=n>1?!~-n+!~-~-n:n     # 20 bytes

Try it online!

The redefined operator retains its initial precedence.

Note that we could not simply swap out ! in favor of ~, since ~ is already defined for integers, while ! is only defined for Booleans.

Binary operators

Even without recursion, redefining an operator is shorter than defining a binary function. Compare the following definitions of a simple divisibility test.

f(x,y)=x==0?y==0:y%x==0 # 23 bytes
(x,y)->x==0?y==0:y%x==0 # 23 bytes
x->y->x==0?y==0:y%x==0  # 22 bytes
x\y=x==0?y==0:y%x==0    # 20 bytes

Try it online!

Recursive binary operators

The following illustrates how to redefine a binary operator to compute the Ackermann function:

A(m,n)=m>0?A(m-1,n<1||A(m,n-1)):n+1    # 35 bytes
^ =(m,n)->m>0?(m-1)^(n<1||m^~-n):n+1   # 36 bytes
| =(m,n)->m>0?m-1|(n<1||m|~-n):n+1     # 34 bytes
m\n=m>0?~-m\(n<1||m\~-n):n+1           # 28 bytes

Try it online!

Note that ^ is even longer than using a regular identifier, since its precedence is too high.

As mentioned before

m|n=m>0?m-1|(n<1||m|~-n):n+1           # 28 bytes

would not work for integer arguments, since | is already defined in this case. The definition for integers can be changed with

m::Int|n::Int=m>0?m-1|(n<1||m|~-n):n+1 # 38 bytes

but that's prohibitively long. However, it does work if we pass a float as left argument and an integer as right argument.

Dennis

Posted 2014-03-17T17:09:06.287

Reputation: 196 637

11

  1. Don't be too easily seduced by factor(n) The tempting base library function factor(n) has a fatal flaw: it returns the factorization of your integer in an unordered Dict type. Thus, it requires costly collect(keys()) and collect(values()) and potentially also a cat and a sort to get the data you wanted out of it. In many cases, it may be cheaper to just factor by trial division. Sad, but true.

  2. Use map map is a great alternative to looping. Be aware of the difference between map and map! and exploit the in-place functionality of the latter when you can.

  3. Use mapreduce mapreduce extends the functionality of map even further and can be a significant byte-saver.

  4. Anonymous functions are great! ..especially when passed to the aforementioned map functions.

  5. Cumulative array functions cumprod, cumsum, the flavorful cummin and the other similarly named functions enable cumulative operations along a specified dimension of an n-dimensional array. (Or *un*specified if the array is 1-d)

  6. Short notation for Any When you want to select all of a particular dimension of a multi-dimensional array (or Dict), e.g. A[Any,2], you can save bytes by using A[:,2]

  7. Use the single-line notation for functions Instead of function f(x) begin ... end you can often simplify to f(x)=(...)

  8. Use the ternary operator It can be a space-saver for single-expression If-Then-Else constructions. Caveats: While possible in some other languages, you cannot omit the portion after the colon in Julia. Also, the operator is expression-level in Julia, so you cannot use it for conditional execution of whole blocks of code.
    if x<10 then true else false end vs
    x<10?true:false

Jonathan Van Matre

Posted 2014-03-17T17:09:06.287

Reputation: 2 307

1Might I suggest adjusting tip 3? mapreduce is longer than either mapfoldl or mapfoldr, and can have varying behaviour depending on implementation. mapfoldl and mapfoldr are left- and right-associative (respectively) consistently, and thus are a better choice. This also applies more generally to reduce (use foldl or foldr). – Glen O – 2015-07-15T09:07:55.630

factor is gone for the standard library – Lyndon White – 2017-07-08T03:17:26.257

3How on Earth is "use the ternary operator" somewhat specific to Julia? It's relevant to every language which has it. – Peter Taylor – 2014-03-17T17:40:19.633

5It's relevant that it has it. Many languages also have map, anonymous or pure functions, some kind of shorthand for any/all, cumulative functions, etc. If we were to reduce every tips thread to only the features absolutely unique to that language, there would be very little tips content. – Jonathan Van Matre – 2014-03-17T18:04:18.153

There are probably almost as many languages which don't have comments as there are languages which don't have the ternary operator. – Peter Taylor – 2014-03-17T18:24:02.687

3Gosh, only all the functional ones for starters, where everything returns a value so a ternary op would be redundant. If you must have specific examples: Go, Haskell, Scala, Lua, VB, Prolog, PL/SQL...even Python didn't for many versions. Of the near-dozen languages whose tips threads mention their ternary operator is there some reason you only chose to come be provincial in mine? – Jonathan Van Matre – 2014-03-17T18:50:23.390

I haven't seen the others. – Peter Taylor – 2014-03-17T19:01:53.993

3Well, hey, at least you're an equal-opportunity curmudgeon. ヘ( ̄ー ̄ヘ) – Jonathan Van Matre – 2014-03-17T19:14:37.123

let us continue this discussion in chat

– Peter Taylor – 2014-03-17T20:39:38.953

9

Iterate over functions

This is also possible in other languages, but usually longer than the straightforward method. However, Julia's ability to redefine its unary and binary operators make it quite golfy.

For example, to generate the addition, subtraction, multiplication and division table for the natural numbers from 1 to 10, one could use

[x|y for x=1:10,y=1:10,| =(+,-,*,÷)]

which redefines the binary operator | as +, -, * and ÷, then computes x|y for each operation and x and y in the desired ranges.

This works for unary operators as well. For example, to compute complex numbers 1+2i, 3-4i, -5+6i and -7-8i, their negatives, their complex conjugates and their multiplicative inverses, one could use

[~x for~=(+,-,conj,inv),x=(1+2im,3-4im,-5+6im,-7-8im)]

which redefines the unary operator ~ as +, -, conj and inv, then computes ~x for all desired complex numbers.

Examples in actual contests

Dennis

Posted 2014-03-17T17:09:06.287

Reputation: 196 637

6

  1. Don't type return f(x)=x+4 is identical to but shorter than f(x)=return x+4. Julia always returns the result of the last statement.
  2. Use = instead of in. [x for x in 1:4] is 3 characters longer than, but equivalent to [x for x=1:4]

gggg

Posted 2014-03-17T17:09:06.287

Reputation: 1 715

6

  1. Keywords can sometimes immediately follow constants without the need for a space or semicolon. For example:

    n->(for i=1:n n-=1end;n)
    

    Note the lack of a space between 1 and end. This is also true for end occurring after a close paren, i.e. )end.

  2. Perform integer division using ÷ rather than div() or overloading an operator. Note that ÷ is worth 2 bytes in UTF-8.

  3. Use vec() or A[:] (for some array A) rather than reshape() whenever possible.

  4. Create functions rather than full programs when allowed in the challenge rules. It's shorter to define a function which accepts input rather than defining variables by reading from stdin. For example:

    n->(n^2-1)
    n=read(STDIN,Int);n^2-1
    
  5. Variables can be incremented inside the argument to a function. For example, the following is my answer to the Find the Next 1-Sparse Binary Number challenge:

    n->(while contains(bin(n+=1),"11")end;n)
    

    This is shorter than incrementing n inside of the loop.

Alex A.

Posted 2014-03-17T17:09:06.287

Reputation: 23 761

5

Use Broadcasting function calls.

Introduced in Julia 0.5. It is like map, but uses less characters, and does broadcasting behavour over it's args -- which means you can write less lambda's to deal with things.

Rather than:

  • map(f,x) -- 8 characters.
  • f.(x) -- 5 characters

Better still:

  • map(a->g(a,y),x) -- 16 characters
  • g.(x,[y]) -- 9 characters

Lyndon White

Posted 2014-03-17T17:09:06.287

Reputation: 1 021