February 2021

Temporal Ordered Clustering in Dynamic Networks: Unsupervised and Semi-Supervised Learning Algorithms

Publication: IEEE Transactions on Network Science and Engineering

Krzysztof Turowski, Wojciech Szpankowski

In temporal ordered clustering , given a single snapshot of a dynamic network in which nodes arrive at distinct time instants, we aim at partitioning its nodes into K ordered clusters C_1≺⋯≺C_K such that for i<j , nodes in cluster C_i arrived before nodes in cluster C_j , with K being a data-driven parameter and not known upfront. Such a problem is of considerable significance in many applications ranging from tracking the expansion of fake news to mapping the spread of information. We first formulate our problem for a general dynamic graph, and propose an integer programming framework that finds the optimal clustering, represented as a strict partial order set, achieving the best precision (i.e., fraction of successfully ordered node pairs) for a fixed density (i.e., fraction of comparable node pairs). We then develop a sequential importance procedure and design unsupervised and semi-supervised algorithms to find temporal ordered clusters that efficiently approximate the optimal solution. To illustrate the techniques, we apply our methods to the vertex copying (duplication-divergence) model which exhibits some edge-case challenges in inferring the clusters as compared to other network models. Finally, we validate the performance of the proposed algorithms on synthetic and real-world networks.

AI/ML

Wadhwani AI is a Proud Recipient of the Meta Llama Impact Grant

November 7, 2024

Notes from the Second GCA Impact Accelerator in New York City

December 5, 2022

Reflections from the first GCA Impact Accelerator in Stockholm

September 26, 2022

Field Lessons: Human-Centred Design for More Inclusive Tech Solutions

May 19, 2022

ML Engineer

ROLES AND RESPONSIBILITIES

An ML Engineer at Wadhwani AI will be responsible for building robust machine learning solutions to problems of societal importance; usually under the guidance of senior ML scientists, and in collaboration with dedicated software engineers. To our partners, a Wadhwani AI solution is generally a decision making tool that requires some piece of data to engage. It will be your responsibility to ensure that the information provided using that piece of data is sound. This not only requires robust learned models, but pipelines over which those models can be built, tweaked, tested, and monitored. The following subsections provide details from the perspective of solution design:

Early stage of proof of concept (PoC)

Setup and structure code bases that support an interactive ML experimentation process, as well as quick initial deployments
Develop and maintain toolsets and processes for ensuring the reproducibility of results
Code reviews with other technical team members at various stages of the PoC
Develop, extend, adopt a reliable, colab-like environment for ML

Late PoC

This is early to mid-stage of AI product development

Develop ETL pipelines. These can also be shared and/or owned by data engineers
Setup and maintain feature stores, databases, and data catalogs. Ensuring data veracity and lineage of on-demand pulls
Develop and support model health metrics

Post PoC

Responsibilities during production deployment

Develop and support A/B testing. Setup continuous integration and development (CI/CD) processes and pipelines for models
Develop and support continuous model monitoring
Define and publish service-level agreements (SLAs) for model serving. Such agreements include model latency, throughput, and reliability
L1/L2/L3 support for model debugging
Develop and support model serving environments
Model compression and distillation

We realize this list is broad and extensive. While the ideal candidate has some exposure to each of these topics, we also envision great candidates being experts at some subset. If either of those cases happens to be you, please apply.

DESIRED QUALIFICATIONS

Master’s degree or above in a STEM field. Several years of experience getting their hands dirty applying their craft.

Programming

Expert level Python programmer
Hands-on experience with Python libraries
- Popular neural network libraries
- Popular data science libraries (Pandas, numpy)
Knowledge of systems-level programming. Under the hood knowledge of C or C++
Experience and knowledge of various tools that fit into the model building pipeline. There are several – you should be able to speak to the pluses and minuses of a variety of tools given some challenge within the ML development pipeline
Database concepts; SQL
Experience with cloud platforms is a plus

ML Scientist

ROLES AND RESPONSIBILITIES

As an ML Scientist at Wadhwani AI, you will be responsible for building robust machine learning solutions to problems of societal importance, usually under the guidance of senior ML scientists. You will participate in translating a problem in the social sector to a well-defined AI problem, in the development and execution of algorithms and solutions to the problem, in the successful and scaled deployment of the AI solution, and in defining appropriate metrics to evaluate the effectiveness of the deployed solution.

In order to apply machine learning for social good, you will need to understand user challenges and their context, curate and transform data, train and validate models, run simulations, and broadly derive insights from data. In doing so, you will work in cross-functional teams spanning ML modeling, engineering, product, and domain experts. You will also interface with social sector organizations as appropriate.

REQUIREMENTS

Associate ML scientists will have a strong academic background in a quantitative field (see below) at the Bachelor’s or Master’s level, with project experience in applied machine learning. They will possess demonstrable skills in coding, data mining and analysis, and building and implementing ML or statistical models. Where needed, they will have to learn and adapt to the requirements imposed by real-life, scaled deployments.

Candidates should have excellent communication skills and a willingness to adapt to the challenges of doing applied work for social good.

DESIRED QUALIFICATIONS

B.Tech./B.E./B.S./M.Tech./M.E./M.S./M.Sc. or equivalent in Computer Science, Electrical Engineering, Statistics, Applied Mathematics, Physics, Economics, or a relevant quantitative field. Work experience beyond the terminal degree will determine the appropriate seniority level.
Solid software engineering skills across one or multiple languages including Python, C++, Java.
Interest in applying software engineering practices to ML projects.
Track record of project work in applied machine learning. Experience in applying AI models to concrete real-world problems is a plus.
Strong verbal and written communication skills in English.

About Us

Our Work

Knowledge Centre

Careers

Partnerships

Temporal Ordered Clustering in Dynamic Networks: Unsupervised and Semi-Supervised Learning Algorithms

Related Posts

Wadhwani AI is a Proud Recipient of the Meta Llama Impact Grant

Notes from the Second GCA Impact Accelerator in New York City

Reflections from the first GCA Impact Accelerator in Stockholm

Field Lessons: Human-Centred Design for More Inclusive Tech Solutions

Wadhwani AI is a program of the AI Unit of Lords Education and Health Society (LEHS)

About Us

Our Work

Knowledge Centre

Subscribe

Careers

Partnerships

Contact Us

Vision

ML Engineer

ML Scientist