• 0 Posts
  • 14 Comments
Joined 1 year ago
cake
Cake day: July 1st, 2023

help-circle
  • kraegartoActually Useful AIBecoming an AI engineer
    link
    fedilink
    English
    arrow-up
    4
    ·
    1 year ago

    I feel this has been the case already for more time than people think. AI/ML has been its own subspecialty of SWE for years. There are some low hanging fruit that using sklearn or copy and pasting from stack overflow will let you do, but for the most part the advanced features require professional specialization.

    One thing that bothers me is that subject matter expertise is often ignored. General AI researchers can be helpful, but often times having SME context AND and AI skillset will be way more valuable. For LLMs it may be fine since they produce a generalized solution to a general problem, but application specific tasks require relevant knowledge and an understanding of pros/cons within the use case.

    It feels like a hot take, but I think that undergraduate degrees should establish a base knowledge in a domain and then AI introduced at the graduate-level. Even if you are not using the undergraduate domain knowledge, it should be transferable to other domains and help you to understand how to solve problems with AI within the context of a professional domain.



  • kraegartoProgramming.dev Metawhat about data scrapers?
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    I think this is the beauty of federation. Everything is open and free to all rather than a company being able to lock in your personally created content.

    For example, I wanted to learn about NLP and am working on building a bot to monitor sentiment and check for hate speech in lemmy content. I am still at the brainstorming/research phase, but the accessibility of lemmy makes it really nice.

    Pythorhead was made for this exact purpose.


  • I normally try and do “fun” work. This largely depends on how autonomous your job is. I was a PhD student doing research for a company and I received very little oversight for 3 years.

    The supervision I did receive was great though. They understood needing to take a break and slow down. At those point I would generally read papers, watch PyData talks (highly recommend them, like inspirational ted talks for data people), or contribute to open source to learn about new tools or design paradigms.





  • This has been my experience too. A junior dev at my last company kept trying to use ChatGPT to generate docket compose files and wondered why they generally didn’t work.

    My research has been on time series forecasting which is tangentially related to NLP. People are shocked when I point out to them that all these models do it predict the next token. Using weather forecasting has been a good analogy for why long AI generated texts are extra bad: weather forecasts get worse as the horizon increases.

    Despite all my gripes about LLMs, I must say that copilot has saved me writing TONS of boilerplate code and unit tests.



  • I started my MSc and part of it involved building a ray-tracing simulation. I built it in MATLAB, but the technical debt quickly became so high that I had to rebuild it all in Python.

    MATLAB does have classes, but it is hot garbage. Distributed computing is also awful (I moved to Python and Pyspark in quick succession and life got a lot better).

    The only industry job I have seen request MATLAB was for legacy companies like Telesat and I wouldn’t be surprised if they were moving towards Python since the license fees are insane.




  • I interviewed for a position that I was comfortably qualified for. As soon as they mentioned a 3 hour whiteboard interview in person I politely hung up the zoom call.

    On the flip side, I had a company give the best interview process of all time. They told you how many people were remaining in the rounds. The programming task was to implement a hugging face model as a FastAPI. There was also a short video interview that took 5 minutes if you had basic ML knowledge. Likely took 1-2 hours tops and it was actually fun.


  • As much as MATLAB is a great tool, I feel the need to point out that it is roughly equivalent to shooting yourself in the foot long-term.

    Unless you plan on working at a huge company with legacy systems you are likely never going to use it outside of academia. My university seems to be transitioning away from it and others I have talked to had similar thoughts. In the cohort of students I went thought grad school with, AFAIK none of us are at companies that use MATLAB.

    I don’t want to be a downer, but going straight to Python is likely a better long term move.