6 things FELD M learned at useR!2019 in Toulouse

on 23.07.2019 by Linda Le

Hi, I´m Linda. I am part of the Data Science team at FELD M and was excited to participate this year’s useR!2019 conference, which took place in Toulouse.

That meant 4 days full of great

  • 3h tutorials
  • keynotes
  • 30 min blocks of 6*5 min lightning talks
  • 1,5h blocks of 5*18 min talks
  • sponsor talks
  • poster session
  • social events, …on up to 6 parallel tracks!

The complete list of talks including slides can be found here http://www.user2019.fr/talk_schedule/ and video recordings of the keynotes here: https://www.youtube.com/channel/UC_R5smHVXRYGhZYDJsnXTwg/videos. The video recordings of all talks are uploaded here: https://www.youtube.com/channel/UC_R5smHVXRYGhZYDJsnXTwg/videos.

Let me tell you about the conference’s input as I guide you through a typical project´s timeline. I took advantage of a nice Machine Learning Workflow Hexa-Diagramm and added a 6th Hexagram, adding ‘Communication’ of projects.

Let’s go through the 2nd, 3rd and 6th Hexagon to give some examples, what I took with me from useR! and where we now are taking some deep dives to improve our workflow.

 

  • {tidyr} by famous Hadley Wickham (a must read for everyone advancing in R is his recent 2nd edition of “Advanced R” book: https://adv-r.hadley.nz/index.html) is updated. In the area of web analytics we, at FELD M, receive raw data, in which all touchpoints of all visitors/customers are recorded in rows. In order to analyse customer journeys, we need to reshape our data, so that we have the customers in rows and all touchpoints per customer, i.e. the customer journey in another column. The transformation of reshaping the data from long format to wide format is therefore a regulary used transformation in Data Science projects. The current functions to reshape data are spread() and gather(), where many R-users had to strugggle with the logic. So, Hadley Wickham showed us the work in progress functions pivot_longer() and pivot_wider(), with a more intuitive function and arguments name to reshape data. https://tidyr.tidyverse.org/
  • When working with large data sets we usually use either data.table or SparkR (which we currently prefer over sparklyr because of its more similar syntax to PySpark and hence easier switch between Python and R). The latter two methods rely on RAM for their performance. Since our datasets often don’t fit into the RAM anymore but are still below real big data (calculations can’t be handle by a single machine anymore), the newly developed package {disk.frame} (https://rpubs.com/xiaodai/intro-disk-frame) offers an interesting possibility to store and process medium sized datasets. Data larger-than-RAM is split up and stored in chunks on the harddrive and {disk.frame} provides an API for manipulating these chunks. Unlike Spark, {disk.frame} does not require a cluster and can use any function in R.
  • Before we build a model, we first analyse the data on a descriptive level to decide what assumptions we make to build a model. Visualizing high-dimensional data can then be a cumbersome task. In a tutorial Di Cook showed us her packages like {tourr} https://github.com/ggobi/tourr, which visualizes higher-dimensional (>3) data in an animated rotation. You can take a variable and rotate it out of the projection and see if the structure persists or disappears. The package {nullabor} https://github.com/dicook/nullabor is a tool for graphical inference. Your data plot will be displayed among several random nullplots (plots representing your nullhypothesis). If the difference is visible, there is probably a statistical significane in the structure of the plot.
  • Due to the individual advantages of Python and R, at FELD M Data/Software Engineering is mainly done in Python, while the analysis (building models, statistical tests) by the Data Science Team is more focused on R. Our Data/Software Engineering- and Data Science Team is already working very closely together on Advanced Analytics projects to take the advantage of both expertises and both languages (Python and R). Of course, it is in general our goal to build our (data) products in one programming language. Nevertheless, sometimes we build prototypes, which have to live in both worlds and require to use both languages. The {reticulate} package https://rstudio.github.io/reticulate/ makes it possible to call Python out of RStudio. Rounded off by the GUI developments of knit Rmarkdown, it will be easier to bridge language silos.

 

  • When it comes to building a model, it is always important to know the cause of a variable, as we all know “correlation != causation”. Under the assumption, that causal relationship leaves a structure in the data, there are many procedures that detect this causation. Causaldisco summarizes the causal discovery procedures in R and filters the appropriate procedures for your data when you choose your properties. http://biostatistics.dk/causaldisco/.

 

All in all, the success of a project depends not only on the methods, such as those mentioned above, but also on the environment you create in your company. Julie Lowndres showed us in her keynote (https://www.youtube.com/watch?v=Z8PqwFPqn6Y&t=2806s), how she and her team work by embracing open data science, openess and the power of welcome.

FELD M is looking forward to take some deep dives into the learnings listed above now and to put them into practice to improve our workflow and smoothen the journey for our customers.

If you are interested in our work, come and check out our portfolio: https://www.feld-m.de/service/data-strategy-advanced-analytics/.

Or if you are a NGO/NPO, come and check out our contribution to Data Science for good with our “Data Ambulance”: https://www.feld-m.de/datenambulanz/

24 Antworten auf „6 things FELD M learned at useR!2019 in Toulouse“

더존카지노 sagt:

Hello, I think your web site may be having web browser compatibility problems. When I look at your site in Safari, it looks fine but when opening in Internet Explorer, it’s got some overlapping issues. I merely wanted to provide you with a quick heads up! Aside from that, wonderful site!
https://www.wooricasino.site/thezonecasino

카지노사이트 sagt:

Great post. The conclusion is one aspect of my writing that I need to approve upon. I’ll definitely incorporate some of these ideas.
https://txt2080.com/

Bryan Parker sagt:

An impressive share! I’ve just forwarded this onto a co-worker who has been conducting a little research on this. And he actually bought me dinner because I stumbled upon it for him… lol. So let me reword this…. Thank YOU for the meal!! But yeah, thanx for spending the time to talk about this matter here on your site.
http://www.charlestonstateuniversity.com/

Joanne Rush sagt:

Great blog here! Also your web site loads up very fast! What host are you using? Can I get your affiliate link to your host? I wish my web site loaded up as fast as yours lol
https://www.uniofdenton.com/

온라인카지노 sagt:

This is really good! One of the best ones i have found online

https://joinlive77.com/

바카라사이트 sagt:

Great post. The conclusion is one aspect of my writing that I need to approve upon.

https://joinlive77.com/

대구오피 sagt:

“Anyone wishing to study medicine must master the art of massage.”
https://opviewer.com/

Neurodrine sagt:

Through research with mice and humans, scientists have found that brainy activities stimulate new connections between nerve cells and may even help the brain generate new cells, developing neurological „plasticity“ and building up a functional reserve that provides a hedge against future cell loss.
https://ipsnews.net/business/2021/12/15/neurodrine-reviews-supports-healthy-brain-memory-function/

Lanta Flat Belly Shake sagt:

You can also expect some unsupportive behaviors it’s not uncommon for a partner or other companion to feel threatened as you lose weight. Your mother may feel hurt if you don’t eat her famous dessert, or your friends may beg you to skip a workout to go out for pizza. Remind your loved ones that while you’re changing your lifestyle, you’re not changing your feelings for them. Give them specific ways to help you reach your goals and share your journey with them.
https://ipsnews.net/business/2021/12/16/flat-belly-shake-reviews-metabolic-support-that-slims-waist/

Harmonium Sleep Support sagt:

Set aside no more than eight hours for sleep. The recommended amount of sleep for a healthy adult is at least seven hours. Most people don’t need more than eight hours in bed to achieve this goal. Go to bed and get up at the same time every day. Try to limit the difference in your sleep schedule on weeknights and weekends to no more than one hour. Being consistent reinforces your body’s sleep-wake cycle.
https://ipsnews.net/business/2021/12/16/harmonium-sleep-support-reviews-new-deep-sleep-support/

ParActin Joint sagt:

One of the most effective home remedies when it comes to fighting joint pain is the application of heat to the painful area. You can use a hot water bottle, an electric mat, or poultices on the joint in question.
https://ipsnews.net/business/2021/12/11/paractin-joint-pills-reviews-supports-healthy-bone-function/

Tinnitus Guard sagt:

When the sound waves hit the eardrum in the middle ear, the eardrum starts to vibrate. When the eardrum vibrates, it moves three tiny bones in your ear. These bones are called the hammer (or malleus), anvil (or incus), and stirrup (or stapes). They help sound move along on its journey into the inner ear.
https://ipsnews.net/business/2021/12/15/tinnitus-guard-customer-reviews-no-more-ringing-or-buzzing/

SlimCrystal Slimming Water Bottle sagt:

Getting enough sleep is unbelievably necessary for weight loss, in addition on forestall future weight gain. Studies have shown that sleep-deprived folks are up to fifty five additional possible to become corpulent, compared to those that get enough sleep. This range is even higher for kids.
https://ipsnews.net/business/2021/12/11/slimcrysta-water-bottle-reviews-its-support-healthy-weight-loss/

NervoLink sagt:

Medications used for epilepsy or depression may be helpful due to their effects on nerve signals, while anti-inflammatory medications such as ibuprofen usually are not. Opioid analgesics, sometimes called narcotics or painkillers, are an option in some cases but are not the first choice. Medication typically does not completely eliminate pain.
https://ipsnews.net/business/2021/12/16/nervolink-ingredients-reviews-simple-way-to-support-your-nerves/

Energy Focus sagt:

Brain blips are frequently linked to situational factors and normal age-related changes. For most people, mental flexibility starts to become a bit less efficient with each passing decade from our late 20s onward, and memory starts to decline in our late 30s, so it is common to notice more memory problems as we age.
https://ipsnews.net/business/2021/12/16/earthnutri-energy-focus-reviewssupports-mental-performance-and-memory/

YEC Keto sagt:

Keto has some traits that are reminiscent of the Atkins diet and other low-carb plans. The keto diet involves dramatically reducing your carbohydrate consumption and substituting fat instead. This dietary alteration will put your body into a metabolic state known as ketosis.
https://ipsnews.net/business/2021/12/27/yec-keto-customer-reviews-attain-faster-weight-loss-with-bhb/

Green Galaxy CBD Isolate Gummies sagt:

CBD is one of the compounds in the cannabis plant, better known as marijuana. Unlike the famous cannabinoid tetrahydrocannabinol (THC), CBD doesn’t cause the psychological effects typical of being “high”. Both CBD and THC act on the body’s natural endocannabinoid system, which plays a role in many processes including appetite, pain and memory.
https://ipsnews.net/business/2021/12/18/green-galaxy-cbd-gummies-reviews-advanced-pain-relief-with-natural-hemp/

Pure Neuro sagt:

Brain healthy habits, including exercise, nutrition, sleep, and stress management, contribute to or detract from the brain’s performance. Training and lifestyle factors can maintain and even strengthen brain health by keeping the brain’s complex circuits highly synchronized with a healthy blood supply. Better brain health means better brain performance.
https://ipsnews.net/business/2021/12/27/pure-neuro-ingredients-reviews-supports-healthy-brain-memory/

Keto Pure Select sagt:

The ketogenic diet is one of the most talked-about, and perhaps trusted diet plans to lose weight. From celebrities who vouch about its success, and the many health benefits it offers apart from weight loss, Keto remains to be one of the top buzzwords when it comes to dieting and nutrition.
https://ipsnews.net/business/2021/12/27/keto-pure-select-pills-reviews-burn-fat-for-energy-not-carbs/

GlucoTrust sagt:

When your blood sugar levels are running high, your body will try to flush excess sugar out of your blood through the urine. As a result, your body will need more fluids to rehydrate itself. Drinking water can help the body with flushing out some of the glucose in the blood. Just a word of caution to be sensible with drinking water; water intoxication (which can result in death) is possible if a number of litres water are drunk in a short space of time. This is rare and quite difficult to manage but it pays to be aware of this.
https://ipsnews.net/business/2021/12/05/glucotrust-customer-reviews-reducing-sugar-gives-deep-sleep/

Insta-ToxTM sagt:

Wrinkles are folds or ridges on our skin. Also known as rhytides, they are creases on the skin and are the result of lack of moisture and elasticity of the skin. Wrinkles generally hit people in the old age but due to modern lifestyles, people at a younger age are also becoming its victim. Exposure to the sun, pollution, stress, smoking, lack of sleep etc. has also been considered as the leading factor for wrinkles on our skin. Everybody develops wrinkles due to the natural ageing process and they mainly occur on the face, neck, hands of our body. At times, genetics plays a major factor while developing wrinkles.
https://ipsnews.net/business/2021/12/27/insta-tox-customer-reviews-erase-aging-signs-with-natural-serum/

Slimming Tea sagt:

Certain foods assist the body in removing excess water. Cucumber is loaded with water and is an excellent source of fiber, potassium, all the B vitamins, phosphorus and magnesium. Salad greens contain vitamin A and C (just don’t buy Iceberg; it has virtually no nutritional value). Spinach is a wonderful source of calcium and it doesn’t cause mucus buildup like dairy milk. And, do yourself a favor by trading in Romaine lettuce for spinach in salads.
https://ipsnews.net/business/2021/12/27/all-day-slimming-tea-reviews-efficient-drink-to-lose-weight/

Kolkata Night Love sagt:

Independent model girl agency Kolkata night love, you will find all kinds of sweet girls, Also Likes Modelling Career and Fashion Show model You can See for Entertainment or fun, Visit our website for details.

http://kolkatanightlove.com/

Schreibe einen Kommentar

Deine E-Mail-Adresse wird nicht veröffentlicht.