8000
Skip to content
View eburakova's full-sized avatar
🦾
🦾

Block or report eburakova

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
eburakova/README.md

Hello there! I am Ekaterina (Katja), I am a software engineer coming from organic chemistry and protein NMR.

I have a soft spot for interactive charting, so I am learning D3 on the side.

πŸŽ“ In my doctoral years, I focussed on feature engineering in protein structural biology. I adapted of existing models of structure - proterty relationship for a specific case of highly statically disordered samples. I explored and proposed scores for the degree of local disorder. This was a largely interdisciplinary project that combined the domains of quantum physics, molecular biology as well as data science.

🐠 In spring 2024, I graduated from the Data Science Bootcamp @ neue fische, where I practiced my old skills and learn plently of new ones.

πŸ§‘β€πŸ”¬ Since 2024, I am working at AI|ffinity to finally advance academic NMR software to the next level and make automated protein assignments a reality.

πŸ“¨ If you have a technical problem to solve - drop me an email! burakova.ek@gmail.com

πŸ“ Bremen, DE

NAVIGATION

Hackathon Challenges

  • πŸ€– sort GIT out! - AI-Agentic app to chat with your Git repository. For CheffTreff Hackathon 2025, solution for Finanz Informatik challange. πŸš€πŸ’Έ

    • Analyses commit messages and difference logs.
    • Powered by Gemini API (state on April 2025)
    • Simple, sleek UI (made with Streamlit)
    • Developed in <24 hours!
  • πŸ₯ˆ sustAIn - Extracting sustainability-related information from PDFs for Bremen AI-Hackathon in summer 2024. Challenge provided by Encoway / Lenze group.

    • Highly heterogeneous documentation on electric devices is converted into a parseable database.
    • New PDFs can be uploaded and analyzed on the fly.
    • Powered by openAI API
    • Developed in 48 hours by the team of five
  • πŸ₯‡ DataScience for Production Pipelines - Berlin, September 2024. Challenge provided by Bayer. (NON-PUBLIC)

    • Identified and linked patterns of errors on the pilot plant packaging liquid formulations into vials.
    • Presented a Markov chain model to the non-technical stakeholders.

NMR

Data science

  • 🏎️ d-drivers - Data-driven search for traffic drivers. This is the graduation project at the neue fische Data science bootcamp (Apr 2024). Our team of five analyzed the internal content data of EFAHRER.com. We modelled the page impressions in the news feed and built a data app for the editorial managers.

Humble beginnings (bootcamp challenges)

  • πŸ”₯ fraud-detection - Analyzing energy consumption patterns to detect which clients have meddled with the electrical and gas counters.
  • 🏘️ eda-kc-housing - Analysis of price defining factors on the King County housing dataset for a mock client interested in investment into property development. EDA showcase prepared as a part of the DS bootcamp.

Personal projects

  • Built my own data processing software to handle highly heterogeneous and poorly structred transaction histories - from ground up. Used it to fill out my own tax declarations 2022-2025. Feel free to ask me about this, as well as my experience with real-time transaction analysis. _The repositories will remain private. _

Pinned Loading

  1. sortGIT_out sortGIT_out Public

    Explaining GIT history @ CheffTreffHackathon

    Python

  2. d-drivers d-drivers Public

    Data science graduation project on the dataset provided by EFAHRER.com

    Python 1

  3. protein_heterogeneity_ssnmr protein_heterogeneity_ssnmr Public

    Scripts and test data relevant to my PhD thesis

    Jupyter Notebook 3C31

  4. nmr_utilities nmr_utilities Public

    Some useful tools for visualization and handling of NMR data, primarily, proteins

    Jupyter Notebook

0