Hi! I'm rayane 👋

a Bioinformatics student at Paris-Saclay

About

Computational biology student interested in integrative, systems-level representation of biological data.

Interested in building new computational methods and tools merging multi-omics data from different modalities. I like understanding integrative systems behavior, particularly in human diseases and traits. I'm mostly into deep learning and network science approaches, extending my interests into single-cell and spatial omics in oncology and cognition.

Research Experience

April 2025 – June 2025

Bioinformatics Research Engineer Trainee

Bioinformatics and Biostatistics Hub, Institut Pasteur M1 Internship

Performed large scale bulk omics analysis on CCLE Breast Cancer data

  • RNA-seq raw reads preprocessing, exploratory analysis of transcriptomic and proteomic counts, differential experession and abudance analysis and downstream functional gene set enrichment and pathway analysis.
  • With the internship, attended 60+ hours of bioinformatics and statistics courses in unsupervised learning, linear models, single cell and functional analysis
January 2024 – May 2024

Graduation Capstone Project

Lebanese American University, Byblos Undergraduate Research

Exploration of network inference approaches to model gene interactions leading to antimicrobial resistance in bacterial species. The project was later focused on E. coli with phenotypic screenings narrowed down to 11 drugs, public data retrieved from BV-BRC PATRIC db.

  • Pangenome construction, gene-gene statistical synergy inference through statistical methods and machine learning models (SVM ensemble learning), network exploration, robustness and topological analysis was performed.
  • The project comprised pipelines in bash, python and R, running experiments and literature review.
September 2023 – September 2024

Genomics Dry Lab Volunteer

Microbial Genomics Lab, Lebanese American University Undergraduate Research

Bioinformatics contributor to multiple microbial and viral genomics projects, handling WGS data from Illumina and Oxford Nanopore sequencing.

  • Developed documented pipelines and created a lab's github account to maintain them for efficient reuse, mainly aiming to automate usual lab tasks for gene annotation, pangenome construction, phylogenetic analysis, and metagenomic processing (prokka, roary, fasttree, IQ-TREE, Biopython, vegan etc.).
  • Took part in an in silico SARS-CoV-2 study on ORF3a–VPS39 interaction, including mutation tracking, protein modeling, docking, and molecular dynamics simulations; presented as a poster at LAU Research Day 2024.
  • Performed WGS processing and phylogenetic analysis for a genomic characterization study of a clinical Streptococcus oralis strain (Proceedings 2025, 124(1), 10).

Education

MSc in Bioinformatics/Computational Biology

Université Paris-Saclay, 2024 – 2026

M1 grade: 16.63/20

Track: Genomics, Informatics, Mathematics for Health and Environment

Main coursework: functional and Comparative genomics, NGS analysis and statistical methods, Machine Learning and Deep Learning for precision medecine, Big Data and Databases, Advanced programming (OOP, parallel), Algorithms, Metaheuristics, Biological networks and Modeling

BSc in Bioinformatics

Lebanese American University, 2021 – 2024

CGPA: 3.8/4.0

Graduted with High Distinction, Honors Program scholar, merit-based Scholarship Holder

Main Coursework: Network Science, Data mining, Genetics, Molecular Biology, Adv. Human Genetics, Biochemsitry, Algorithms and Data Structures, Linear Algebra, Honors Capstone project course

Projects

RNAr: RNA Structural Library

RNA Structure Bioinformatics Python Library Group Project Course Project

Python library for extraction, processing, visualization, and analysis of RNA structures, developed as a collaborative course project.

Genetic Algorithms for Genome Assembly

Genome Assembly Network Science Genetic Algorithms Python Course Project

Genetic algorithm–based genome assembly formulated as a Hamiltonian path optimization problem, combining evolutionary computation and graph-based assembly.

che3le: deep learning library

Deep Learning Python OOP Library Course Project Group Project

Object-oriented Python implementation of a neural network library from scratch, covering core components like layers, activations, loss functions, optimizers, and training loops.

Pangenomic Epistatic GxG Network

Antimicrobial Resistance Network Science Statistics Python Research Capstone Project

Gene–gene interaction network constructed from pangenome presence–absence data.

COVID-19 Epidemiological Trends

Epidemiology Big Data Python bash R Course Project

Big data analysis of COVID-19 epidemiological data, focusing on preprocessing, statistical exploration, and temporal dynamics.

Tumor Growth and Immune Interaction Modeling

Modeling Cancer NetLogo Agent-Based Modeling Group Project Course Project

Agent-based simulation of tumor, immune, and normal cell interactions in a 2D grid environment, replicating a published modeling study.

DESeq2.jl

Differential Gene Expression Julia Negative Binomial Modeling personal

Julia implementation inspired by the DESeq2 package, modeling differential gene expression using negative binomial distributions and dispersion estimation.

Generalized Linear Models from Scratch

Statistical Modeling Python GLM OOP personal

Python library implementing generalized linear models from scratch, mimicking the R glm interface using object-oriented design principles.

Brain Imaging Deep Learning

Deep Learning Neuroimaging Python CNN personal

Exploration of convolutional neural networks applied to brain imaging data, focusing on representation learning and model behavior.

Dynamic Network Visualizer

Network Science Visualization Python personal

Lightweight Python tool for animating dynamic networks through a graph class that captures node and edge changes over time.

Protein Structure Proximity Network

Network Science Protein Structure Python bash personal

Scripts to build a protein structure proximity network of residue nodes and distance-based edges with their visualizations and contact maps - practicing data processing into networks to further train models on these graphs

Parallel Naive DNA Aligner

Bioinformatics C Parallel Programming personal

Naive motif matcher implemented in C, exploring parallel programming techniques for DNA sequence alignment with maximum Hamming distance constraints.

Graph-Based Genome Assembler

Genome Assembly Graph Theory Python personal

Genome assembly using De Bruijn and k-mer graph approaches, exploring alternative graph-based strategies for sequence reconstruction.

NGS Association and Linkage Analysis

Bioinformatics GWAS Python Group Project Course Project

Association and linkage analysis of multifactorial and Mendelian diseases using multiple NGS-based analytical approaches.

Machine Learning from Scratch

Machine Learning Python OOP personal

Implementation of core machine learning algorithms from scratch in Python, emphasizing visualization and algorithmic understanding.

LSTM for Protein Aggregation

Protein Aggregation Python LSTM personal

Exploration of recurrent neural networks applied to protein sequences to model aggregation-related patterns.

Simple HMM Splice Site Predictor

Bioinformatics Python HMM personal

Hidden Markov Model implemented from scratch for splice site prediction using the Viterbi algorithm.

Genetic Algorithms for Protein Design

Protein Design Genetic Algorithms Python personal

Optimized genetic algorithms for designing protein sequences that fold into target structures, inspired by classical GA formulations.

GNN on WGCNA

Graph Neural Networks Gene Coexpression Python GNN personal

Exploration of graph neural networks applied to weighted gene coexpression networks derived from transcriptomic data.

Network Disintegration through Targeted Attacks

Network Science Graph Theory Python Group Project Course Project

Comparative study of targeted node and edge attacks based on topological properties and their effects on different network types.

Drug Discovery Database Management System

Database Systems SQL Python Course Project

Database management system for storing and querying drug discovery–related data, developed as a first project in database systems.

ROSALIND

Bioinformatics Python personal

Collection of solutions to classic bioinformatics algorithmic problems from the ROSALIND platform.