2022 ARM/ASR Joint User Facility and PI Meeting – Day One

Impromptu poster session with Monica.

Well, kind of day half! Woke bright (dark) and early at 4am for the 7am flight from O’Hare to the nation’s capital. After a smooth travel day I arrived at the very familiar Rockville Hilton with a small posse of Argonne Scientists.

The isolation (albeit easing) through the pandemic changes one’s brain chemistry. I have not been in a place where so many people know me and I know so many people in a very long time. Furthermore there are people here I have developed professional relationships with via zoom during the pandemic and now I meet them here in glorious, high def, lag free, three dimensions!

Team Argonne-ARM selfie!

One such person is Dr Monica Ihli from Oak Ridge National Laboratory. Monica was plugged into the ARM Facility after the start of the pandemic and she has been working closely with Max Grover (an RSE in my group) on data proximate compute as part of our funded work in workforce development. She is working with Max to build Jupyterlab based cyberinfrastructure right up against the many petabytes of ARM data. We even had an impromptu poster session! That kind of interaction does not happen over zoom.

Latest results from the TRACER Aerosol team.

This first, half day, of the meeting had two sessions that necessitated an early morning flight. A session on the TRACER field campaign that just finished. And, in a new innovation a session on emerging technologies. The TRACER session provided an awesome overarching view of the 1 and a half year deployment to Houston. Numerous partners, already 38TB of data in the archive and, at this meeting, 32 posters being presented mere days after the conclusion of the deployment! Some notables for me was the different temporal and spatial scales of the aerosol (those tiny particles that have big impacts) measurements and early efforts to classify and tag storms impacting the region.

Finishing the day with hot pot with friends.

The new and emerging technology was fascinating. So many technologies that, if realized, would be amazing. One technology I have my eyes on is the Snow Pixel by Particle Flux Analytics. It is like a digital camera for measuring snowflakes by sensing when a flake falls on them. And that was one of many, I have a page of notes to follow up on, especially for our plans for the CROCUS measurement deployment.

A great first day, finished up with some hot pot with fiends.. I am slowly regrowing that Science-Social nexus in my brain again that has gone un-fed for a long time.

The ARM ASR Science Team Meeting

A note: This represents the view of a DOE funded scientist, not the Department or any of its programs.

Screenshot from a talk I gave at the ARM meeting 13 years ago!

I did not have an iPhone back in 2009 when I attended my very first ARM Science Team meeting in Louisville, Kentucky. So I have been unable to find any pictures of the event. I did find my old presentation I gave! I arrived in Ky after flight from Australia which got delayed and an unexpected stop over in LA. I arrived barely in time to give a talk on vertical motions in storms!

That meeting was in spring (in the USA) and now we are here in fall 13 odd years later and next week will be first in person ARM-ASR science team meeting since the pandemic started in the USA. I used to work for the Australian Bureau of Meteorology and now I live in Chicago and work for Argonne National Laboratory and my work is a lot closer to ARM’s mission.

Real work gets done. ARM ASR science team meeting in Potomac, March 2013.

I am excited to be back at this meeting in person (in Rockville, Maryland). This will be my first “Programmatic” meeting since the Pandemic began. What is a programmatic meeting? you ask. Well funding bodies like programs within the Department of Energy’s Office of Science will provide support to universities, laboratories etc the same way programs in, say, the National Science Foundation will but they are more mission driven. Programs, like the Atmospheric Systems Research, or ASR, program, need those funded by them to work together.

Breakfast to Beer. Science all day. Tyson’s Corner, 2018.

The science ASR seeks to tackle (making our simulations of the planet more accurate and useful for the nation) can not be achieved by any one investigator. ASR forms working groups and special task forces and these groups meet, along with those who manage the programs, imaginatively called… wait for it… Program Managers, meet once a year. These meetings have many purposes but three are: To allow for DOE supported and associated scientists to understand the needs of the programs (ASR and the Atmospheric Radiation Measurement, or ARM program) and closely aligned programs, to allow the program managers listen and gain a deeper understanding of the breadth of science their programs fund and to allow everyone to interact, learn and forge new collaborations.

The last in person ARM ASR Science Team meeting. North Bethesda, June of 2019.

It is an exhausting week. In some years I have had days containing 7am breakfast meetings right up to 8pm science, dinner and a beer meetings. It is the one time all those I work with are in one physical place and it presents unique opportunities. I joke with Louise that it is the only domestic meeting I come back from jetlagged.

The ARM ASR meeting coming up next week is special. It is the first in person meeting in three years and it has been an eventful three years. Much has happened in all our lives, professionally and personally. There will be a lot of catching up. Those who read my blog know of the TRACER field campaign. That has been planned and executed all in this time. I will be heading to the meeting with excitement and an open mind, 13 years after that excited young man traveled to Louisville and his world changed.

Curating Weather Simulation Data. Earthcube Workshop in North Dakota.

“Simulation outputs are important but that does not mean we save them forever” – Gretchen Mullendore

This week I have been attending a workshop on data curation (a key part of open science) specifically on developing guidelines on the data produced by weather and climate simulations. Open science is better science! But a blanket “you must save and provide all data” is not only onerous (especially for underserved institutions) but not what is needed for reproducibility and reusability.

So many great minds focused on open science.

First, this post are my thoughts and do not, necessarily, reflect the views of attendees and organizers. There will be a report. There has been a lot written about measurements and measurements can no be recreated. Model data, to a degree, can be regenerated. By sharing workflows those with the appropriate resources can run the models on provided initialization and configuration data. Furthermore the sharing of workflows allows the exploring of the robustness of conclusions to assumptions (sensitivity) and the reuse of the workflow to address new science questions.

Gretchen kicking off the meeting

I really enjoyed the discussions and applaud the team’s focus on designing rubrics as it brings the conversation up a level and enables the clear measurement of the efficacy of solutions. It was also great seeing a huge diversity in the career stage and “flavor” of participants. We had data creators, curators, representatives from three publishers (AGU, AMS and PLOS), data scientists and more!

Susan from the University of Michigan on data curation.

Also, fittingly, lots of discussions around equity. Open science is better science. Journals are increasingly requiring data to be made available (even FAIR) which can create a burden to institutions without the physical and/or workforce to meet these requirements. There have been discussions of carving out exceptions for underserved communities. My perception is that the community here at the workshop pushed back hard against that idea as, as aforementioned, open science is better science. Rather we need to equip those institutions to meet the open science requirements.

Lots of discussions on just how much data should be required to be made available to be open and how long it should be curated for. Again a focus on designing rubrics to guide the process. The focus should be on the goal and be flexible to aid the scientist in achieving open science and reproducibility and also allow the society driven journals in meeting the aspiration of is members.

A nice atmosphere and a nice atmosphere!

It was great to be back in Grand Forks. The University of North Dakota is a great institution that, in the atmospheric science, punches way about its weight. Two of our recent three hires had a background at UND and I very much enjoy my collaborations with the team there. It was also very nice to be there during a dry cool air outbreak in summer rather than a frigid cold air outbreak in october!

SciPy Thoughts

Subtitle, too busy to blog. Just about finished my time here at SciPy and I am both tired and energized. My excitement has not diminished from my first SciPy back in 2012. Great to meet new people and re-meet people that, due to reasons, many of them pandemic related, I have lost contact with.

Good to be back in the ballroom!

My number one take away from SciPy is: How much better organized the community is and how they, more so than any government program I have worked with, pull in the same direction and work in concert across many projects. The impact of organizations like Chan Zuckerberg is clear as is the orchestrating role of NumFocus. Also a thing to watch is the new Scientific Python organization which is aimed at sustainable growth and enhancement of the ecosystem.

Queso!!!

The increasing common language of enhancement projects (PEPs, SPECs, ZEPs etc…) and common governance structures is extremely pleasing and what just blows my mind is how this is completely self organized without any kind of edict from above.

The Scientific Python ecosystem is just that, an evolving ecosystem! It is so pleasing watching it evolve to a sustainable track. As Ben Blaiszik said during his keynote, this software is fundamental science infrastructure and while it needs (very much) more financial support from the agencies who’s science it supports (side eye at DOE) it is now in a place where any funds it (the ecosystem) receives will be used for the good of science.

On a technical note some great things I took home were: New, exciting 3D visualization tools, Pangeo forge forges ahead, cool ways to access HRRR as a X-Array like Zarr store from AWS, James Webb space telescope processing runs on SciPy, new ways to manage conda environments for teams and more.

BBQ and storms!

On a professional note, my greatest enjoyment was from seeing the enjoyment of my team three of whom were at their first in-person SciPy. Joe, Max and Bhupendra seemed to completely immerse themselves in the meeting and made new connections. It was also fantastic seeing our ARM collaborators at Brookhaven Lab , Die Wang and Sid Gupta there. This turned into a mini-science meeting as well with new connections made and new work planned. It also is a sign that open science is growing in the programs I love.

On a personal note, it was fun and a little interesting being in Austin during the pandemic. The city’s homeless problem has gotten worse and many businesses are struggling with hiring and some old haunts have gone out of business. I really enjoyed taking advantage of the scooter scheme clocking up 25 miles of low carbon transport.

Great seeing out DOE EESSD funded open science family grow at SciPy.

The news today of NumFocus taking over from Enthought as the organizing entity for SciPy is great news. Enthought has been spectacular and so supportive but having a genuine not for profit will help in many ways. It also opens the opportunity for SciPy not being in Austin. I am genuinely on the fence about this. Whatever the case I hope NumFocus takes a good look at WHY we have these meetings and comes up with some guiding principles. Define what is trying to be achieved, a north star to guide decisions. Then they and the chairs, committee, etc, can keep coming back to those and be forced to justify decisions. I am excited for the future, be it in Austin or elsewhere (note the contract for Austin in 2023 is signed, this does not mean it has to be in Austin but means there is a cost to not having it in Austin).

I’ll finish this blog post by asserting I need to become more engaged in the community. I need to write in folks like NumFocus, Quantsight, 2i2c et al into grant proposals as collaborators as not only are they better positioned to implement workflows I love to use funding them will give back to the tools I love to use. I also need to make more time to contribute code and continue to support my team in contributing to free open community software, critical international science infrastructure.

SciPy 2022. Kid In a Candy Store.

Short update! I am SciPy bound. My first in person conference since, well, the world stopped. So the pandemic is by no means over and there is some controversy (which I will not go into but you can GTS yourself) but that has not dampened by excitement.

My first SciPy. Red pill all the way.

One super exciting thing is three members of my team, Bhupendra, Max and Joe, are heading to their first SciPy.. I remember my first SciPy. It was like a scene from the matrix where I took the red pill and my world changed forever. I have been in “science” for two decades plus and I have never found a community like the Scientific Python community. The smartest and kindest people I have ever met. Genuine and passionate.

Great day for traveling.

I am excited to re-meet many I have met (please please forgive my memory for names, the pandemic has frazzled the skills I had, already meagre, in that area) meet new people and just learn a lot! I remember clearly in 2012, my first SciPy attending and about to give a talk and wondering what to use to format code (at SciPy they show a LOT of code, it is amazing). And I heard about this cool tool called an iPython Notebook. Yeah, before Jupyter.

And that is the amazing thing about SciPy. You are, as Hamilton would say, in the room where it happens. In the very least you are in the hallway outside the room and are the first to know about what happened and use the tools of said happening. Bring It On!