<div dir="ltr">Hello dataHaskell!<div><br></div><div>I'd like to inaugurate our mailing list by showing something that I'm working on for our documentation site.</div><div><br></div><div>After checking some notebooks in Kaggle, <a href="https://www.kaggle.com/mrisdal/titanic/exploring-survival-on-the-titanic">this notebook</a> by Megan Risdal really caught my eye. It looks really complete</div><div>and also, accomplishes a lot of tasks ranging from easy ones as simple feature engineering by extracting titles from the</div><div>names, to more complex ones like applying a Random Forest. It also includes visualization, which is cool too.</div><div><br></div><div>I'm currently working on porting it to Haskell, so we can see what's missing and what's there. I'm using Eric Conlon's (@ejconlon)</div><div>Analyze library, and even though its still young, I absolutely love it. It allows easy CSV loading and there are no name clashes</div><div>like in Frames, nor you have to define your datatypes first as in Cassava.</div><div><br></div><div>The notebook itself can be found <a href="https://github.com/NickSeagull/ex01-exploring-titanic/blob/master/src/Lib.hs">here</a>, it's in a repo in my Github and it can be loaded in HaskellDO, although right now I'm using</div><div>Vim + Stack REPL, until error highlighting is implemented.</div><div><br></div><div>Cheers</div></div><div dir="ltr">-- <br></div><div data-smartmail="gmail_signature">nikita tchayka . software craftsman<br> { <a href="http://nickseagull.github.io">nickseagull.github.io</a> }</div>