2023 Landing Page – 2024 MSDS

Welcome to the 1st MSDS 2023!

The Centre for Data Sciences (CDS) of the University of Macau is excited to extend to you a warm welcome to showcase of progress of the rapidly growing and highly promising field of data sciences in the 1st Macau Symposium on Data Science (MSDS 2023), which will be held in the University of Macau on 4 September 2023.

The aims of this symposium are to exchange on the advances and achievements in the field of data sciences, spanning from marketing, financial technology, artificial intelligence, law, education, language, precision medicine and smart governance etc. In particular, the event will bring together speakers from diverse backgrounds to discuss the latest research on data science or data science tools.

The program will feature plenary sessions, keynote speakers and discussion on the following topics:

1) Data mining and data management;
2) Machine learning algorithms and techniques;
3) Natural language processing;
4) Big data analytics and processing;
5) Data visualization and storytelling;
6) Ethical and legal issues in data science;
7) Applications of data science in various industries.

Registration

Schedule

From	To	Speaker 講者
09:00	09:20	Opening Ceremony
09:20	09:30	Group Photo
09:30	10:30	KEYNOTE Presentation, Prof. Reynold CHENG Data Science for Social Goods: STAR Lab’s Experience
10:30	10:45	Coffee break
10:45	11:30	Industrial Talk, Huawei – Mr. YIM Kwok Woon, Jack Leveraging Advanced Intelligent Technology to Power the Next Wave of Digital Transformation
11:30	12:00	Technical Session 1, Prof. Chao HUANG Robust and Explainable Spatio-Temporal Graph Learning for Smart City
12:00	12:30	Technical Session 1, Prof. May WANG Revolutionizing Business Innovation: The Power of Creative AI in Data Science
12:30	14:00	Lunch break
14:00	15:00	KEYNOTE Presentation, Prof. Jiangliang XU Large-scale Geospatial Analytics: Challenges and Opportunities
15:00	15:30	Technical Session 2, Prof. Ming Ming CHIU Applying Artificial Intelligence & Statistics to Big Data: Automatic Analysis of Conversations
15:30	16:00	Technical Session 2, Prof. Minjia TAN Integrative proteogenomic and pharmacological characterization of disease samples identifies therapeutic opportunities
16:00	16:30	Technical Session 2, Prof. Yun ZHAO Dualism in data protection: balancing the right to personal data and the data property right
16:30	16:45	Coffee break
16:45	17:15	Technical Session 3, Prof. Yu LU AI-Driven Research and Practice in Education
17:15	17:45	Technical Session 3, Prof. Tianji CAI Modelling municipal solid waste generation in Macau: A synthetic approach
17:45	18:15	Technical Session 3, Prof. Terence POON Opportunities and Pitfalls in Application of High-Throughput Proteomics and Metabolomics Technologies to Disease Marker Discovery
18:15	19:45	Dinner for invited guest

Keynotes

Reynold CHENG 鄭振剛

Professor, The University of Hong Kong

Prof. Cheng received his MSc and PhD degrees from Department of Computer Science of Purdue University in 2003 and 2005. Prof. Cheng is a Professor of the Department of Computer Science in the University of Hong Kong (HKU). His research interests are in data science, big graph analytics and uncertain data management. He was an Assistant Professor in the Department of Computing of the Hong Kong Polytechnic University (HKPU) from 2005 to 2008.

Prof. Cheng received the SIGMOD Research Highlights Reward 2020, HKICT Awards 2021, and HKU Knowledge Exchange Award (Engineering) 2021. He was granted an Outstanding Young Researcher Award 2011-12 by HKU. He received the Universitas 21 Fellowship in 2011, and two Performance Awards from HKPU Computing in 2006 and 2007. He is an academic advisor to the College of Professional and Continuing Education of HKPU. He is a member of IEEE, ACM, ACM SIGMOD, and UPE. He was a PC co-chair of IEEE ICDE 2021, and has been serving on the program committees and review panels for leading database conferences and journals like SIGMOD, VLDB, ICDE, KDD, IJCAI, AAAI, and TODS. He is on the editorial board of IS and DAPD, and was a former editorial board member of TKDE.

Talk Title: Data Science for Social Goods: STAR Lab’s Experience

Talk Abstract

In many metropolitan cities, there is a lack of manpower in social care. In Hong Kong, for example, the elderly care homes report a 70% shortage of employees. To alleviate these issues, recently there is a lot of attention on “data science for social goods”, or the use of technologies for enhancing service quality and streamlining administrative work of social workers. In this talk, I will discuss how the HKU STAR (Social Technology And Research) Lab uses data science technologies to support elderly and family care services. I will first introduce HINCare, a software platform that provides volunteering and cultivating mutual-help culture in the community. HINCare uses the HIN (Heterogeneous Information Network) to recommend helpers to elders or other service recipients, and is now supporting 14 NGOs and 7,000 users. I will also discuss our collaboration with the Hong Kong Jockey Club Charities Trust for developing a novel case management and data analysis system for 40% of the family care centers in Hong Kong. These projects have received an HKICT Award, Asia Smart App Awards, and HKU Faculty Knowledge Exchange Awards.

Jianliang XU 徐建良

Chair Professor, Hong Kong Baptist University

Jianliang Xu is the Head and Chair Professor of the Department of Computer Science at Hong Kong Baptist University, where he leads the Database Research Group. His current research interests include big data management, data security & privacy, and blockchain technology. With an h-index of 56, he has published more than 250 technical papers in these areas, most of which appeared in leading journals and conferences such as SIGMOD, VLDB, ICDE, TKDE, and VLDBJ. He is listed among the World’s Top 2% Scientists by Stanford University. He has served as a PC Chair for a number of major international conferences and as an Associate Editor for several esteemed international journals, including IEEE Transactions on Knowledge & Data Engineering and Proceedings of the VLDB Endowment.

Talk Title: Large-scale Geospatial Analytics:
Challenges and Opportunities

Talk Abstract

Geospatial analytics has emerged as a crucial tool in diverse fields such as crime science, transportation science, epidemiology, ecology, and urban planning. However, as big geospatial data continues to grow exponentially, conventional geospatial analytics tools encounter efficiency issues when handling large-scale datasets. This poses significant challenges and concerns for domain experts, underscoring the pressing need for more efficient approaches. In this talk, we will delve into the challenges and problems associated with large-scale geospatial analytics. We will share our recent efforts to develop efficient geospatial analytics tools, including kernel density visualization for hotspot detection and K-function for correlation analysis. We will also highlight future opportunities and potential advancements in this field.

Invited Talks

Ming Ming CHIU 趙明明

Chair Professor, The Educational University of Hong Kong

Professor Chiu obtained his Bachelor degree in Computer Science at Columbia University, his master’s degree in education at the Harvard University and his doctorate at the University of California, Berkeley.

Professor Chiu is an analytics expert, integrating statistics, computer science and mathematics to develop theories and test them in diverse fields, including education, psychology, sociology, linguistics, criminology, economics, and management.

Talk Title: Applying Artificial Intelligence & Statistics to Big Data: Automatic Analysis of Conversations

Talk Abstract

As people solve problems together that they cannot do alone, automatic analysis of conversations can inform and enhance their design to aid learning and teaching. Such analyses must traverse the obstacle course of voice transcription, complex categorization, and statistical analysis. Automated transcription feeds automatic categorization via computational linguistics to create a database (Big Data). Automated statistical analysis integrates statistical discourse analysis (SDA) and artificial intelligence. SDA models (a) pivotal actions that radically change subsequent processes and (b) explanatory variables at multiple levels (sequences of turns/messages, time periods, individuals, groups, organizations, etc.) on multiple target actions. The artificial intelligence expert system translates my theory into a statistical model, tests it on the data, interprets the results, (if needed, rewrites itself to execute revised analyses), and prints a table of results. I showcase automated SDA on 321,867 words in 1,330 messages by 17 student-teachers in 13 weekly online discussions of lesson designs.

Tianji CAI 蔡天驥

Professor, The University of Macau

Professor Cai received his PhD and Master degree at University of North Carolina at Chapel Hill in 2010.

His research interests concentrate on new forms of data and new methods of analysis. Reflecting on his broad intellectual pursuits, his research topics are diverse, ranging from methodological, such as quantitative methods and data mining, to substantive ones, such as gene-environmental interplay and adolescent health behaviors.

Talk Title: Modelling municipal solid waste generation in Macau: A synthetic approach

Talk Abstract

With the development of urbanization during the past few decades, Municipal Solid Wastes (MSW) generated in urban areas have posed serious challenges to the local environment in many cities around the world, such as Macau. Residential and touristic activities are the two primary sources of solid waste in Macau; however, it is still unclear how and which indicators of these two could be used to predict solid waste generation.

Taking advantage of the data published by government agencies in Macau from 2010 to 2021, our study explored the possibility of using publicly available data to predict the trend of monthly MSW generation in Macau. In particular, we adopted a machine learning strategy and compared six predictive models that implement both conventional and state-of-art machine learning methods.

Our results indicate that the Generalized Additive Model outperformed the rest of the models in predicting MSW generation in Macau and is more suitable when data present strong seasonality and potential domain shift. Consistent with previous literature, both population-level and household-level factors contributed to MSW generation, including population density, income level, household energy consumption, and household size. In addition, our results support that tourism activities — especially tourists’ spending on non-traded goods and services —positively affected MSW generation

Minjia TAN 譚敏佳

Professor, Shanghai Institute of Materia Medica Chinese Academy of Sciences

Dr. Minjia Tan is currently a professor in Shanghai
Institute of Materia Medica, Chinese Academy of Sciences. Dr. Tan’s interest is
focused on the development of mass spectrometry-based proteomics technologies
to characterize the roles of protein post-translational modifications (PTMs) in
physiology and diseases, His previous works characterized the new roles of
several types of PTMs in cancer and metabolic disorders and demonstrated the
therapeutic potential of targeting PTM crosstalk in cancer therapies. He co-authored more than 100 refereed
publications (>12,000 citations, H index 40) including senior author papers
in Cell (2018, 2020), Cell Metabolism (2021), Molecular Cell (2021, 2022),
Nature Chemistry (2023).

Talk Title: Integrative proteogenomic and pharmacological characterization of disease samples identifies therapeutic opportunities

Talk Abstract

Proteins and post-translational modifications (PTMs) play
fundamental roles in disease development. Yet current understanding of the cancer
proteome and therapeutic approaches is by far limited. We conducted a
comprehensive whole exome and transcriptome sequencing, proteomic and
phosphoproteomic analyses of several types of clinical tumor samples and cancer
cell lines, including lung adenocarcinoma, KRAS mutant cancer and acute myeloid
leukaemia. Integrative multi-omics analysis identified the molecular
heterogeneity of these cancers, that is closely related with clinical
characteristics. A large-scale ex vivo drug response profiling, together with
the integration of publicly available drug sensitivity information, identified a
panel of drug combinations which showed therapeutic potentials against certain
subsets of these cancers.

Yun ZHAO 趙雲

Professor, University of Hong Kong

Dr. Zhao obtained his PhD from Erasmus University Rotterdam, an LLM from Leiden University and an LLM & LLB from China University of Political Science and Law.

Dr. Zhao is currently Standing Council Member of Chinese Society of International Law; Council Member of Chinese Law Society; Council Member of Zhuhai International Court of Arbitration; and Chair Professor at Xiamen University (2020-2023). He is listed as an arbitrator in several international arbitration commissions. He sits in the editorial teams of several academic journals and has published widely on various topics including, in particular, Dispute Resolution and Space Law.

Talk Title: Dualism in data protection: balancing the right to personal data and the data property right

Talk Abstract

The presentation explores the issues of the right to personal data and data property rights in the context of commercial transactions involving big data. The analysis herein attempts to break through the traditional concept of ‘property’ with the aim of helping to develop a modern information society in China, devise a property law theory suitable for the big data era, and improve the level of protection afforded rights and legitimate interests in data. To date, no comprehensive study has focused on developing a proper understanding of the concept of ‘data property rights’, and hence we lack the solid theoretical support needed for the construction of a proper protective system for such rights. The presentation examines the rules pertaining to data property rights, thereby enriching the theory of data property rights. It also offers a thorough discussion of how to construct a data property protection system.

Yu LU 盧宇

Associate Professor, Beijing Normal University

Professor LU received the Ph.D. degree from National University of Singapore in computer engineering, and B.S. degree from Beijing University of Aeronautics and Astronautics (Beihang University).

His current research interests sit at the intersection of artificial intelligence and educational technology, including learner modeling, educational robotics, intelligent tutoring system and educational data mining.

Talk Title: AI-Driven Research and Practice in Education

Talk Abstract

Artificial intelligence (AI) is the technology that enables machine to think or act in a more “human” way to perform tasks that normally require human intelligence. Driven by the fast advancement of AI technologies (e.g., machine learning and generative AI), researchers and practitioners in education , utilizes AI as a powerful tool to support teaching and learning. In this talk, I will talk about how AI influences education from multiple perspectives, including the ongoing AI-driven research and practice for the large amount of students and teachers. I will also discuss the key research issues and promising future of AI in education (AIED) domain.

Terence POON 潘全威

Associate professor, The University of Macau

Bio: Dr. Terence C.W. Poon is an Associate Professor at the University of Macau. He obtained his PhD degree in Pathological Sciences from the Chinese University of Hong Kong and his M.Sc. degree in Bioinformatics from the University of Manchester, UK.

His research focuses on clinical proteomics, mass spectrometry, and disease marker discovery. He published 82 articles in SCI journals, such as Gastroenterology, Clinical Chemistry and Clinical and Translational Medicine. Currently, he is the President of Asia Oceania Human Proteome Organization, and he is the Vice-President of China Human Proteome Organization. He is the associate editor of the journal, Clinical Proteomics.

Talk Title: Opportunities and Pitfalls in Application of High-Throughput Proteomics and Metabolomics Technologies to Disease Marker Discovery

Talk Abstract

Discovery of reliable clinical biomarkers for classifying patients into subgroups for receiving different effective treatments is crucial in precision medicine. Since year 2000, various proteomics and metabolomics approaches have been applied to discovery of biomarkers for risk assessment, prediction, screening, diagnosis, monitoring, or prognosis of human diseases as well as for predicting or monitoring response to therapy. Despite more than 20 years of global research efforts, biomarkers from only one proteomics study have been successfully translated into an FDA-approved test in United States. In certain extent, this reflects the presence of difficulties in finding “genuinely useful biomarkers for clinical practice”. Collection of biological specimens from study groups is the first step in biomarker studies. The composition of the study groups and the specimen collection time points depend on the study design. Single-center case-control design and single-center cohort design are the most used study designs for biomarker discovery. However, these two simple study designs could easily result in false biomarkers that are only associated with hidden/known confounding factors. With a proper study design, this pitfall can be avoided, leading to the discovery of reliable biomarkers.

May WANG 王莹

Associate professor, Beijing Normal University – Hong Kong Baptist University United International College

Professor Wang received her Ph.D. in the School of Business from The University of Hong Kong and her M.Phil. in System Engineering and Engineering Management from The Chinese University of Hong Kong.

Her research interests include Big data in E-Commerce, FinTech, recommendation agents, human-computer interaction. Her works have been published in Information Systems Research and Management Science, among others.

Talk Title: Revolutionizing Business Innovation: The Power of Creative AI in Data Science

Talk Abstract

Generative AI (GenAI) has been transforming the landscape of business. This talk explores the transformative potential of generative artificial intelligence in revolutionizing business strategies and practices. With specific examples in marketing applications, GenAI drives greater effectiveness in the customer experience through hyper-personalization, growth through the right analytics and automation, and productivity. The talk presents a comprehensive review and analysis of the evolution of marketing technologies in the context of AI advancements, revealing a three-stage trajectory from a human-centric pre-AI era, through a data-driven post-AI era, to the dynamic and creative post-GPT era. It further identifies critical gaps in current marketing practices and research, particularly around GenAI’s impacts across various business strategies and behaviors. It highlights the need for mindful adoption of GenAI in business practice and the development of business research on GenAI. Generative AI tools present promising opportunities for enhancing consumer engagement, personalizing shopping experiences, automating business analytics, and dynamic content creation, their utilization must be ethical, informed, and considerate of demographic-specific implications. This talk provides valuable insights for future investigations aimed at understanding GenAI’s role in the future of business research, while guiding practitioners towards effective and responsible GenAI integration in their business strategies.

Chao HUANG 黃超

Assistant Professor, University of Hong Kong

Professor Huang received his Ph.D. in Computer Science from the University of Notre Dame in USA. He is an Assistant Professor at the Department of Computer Science at the University of Hong Kong (HKU).

His research focuses on developing novel machine learning frameworks to tackle various challenges in Data Mining, Information Retrieval, Spatial-Temporal Data Analytics, User Behavior Modeling, Recommendation, Graph Mining, and Deep Representation Learning.

Talk Title: Robust and Explainable Spatio-Temporal Graph Learning for Smart City

Talk Abstract

Advancements in remote sensing technologies and large-scale computing infrastructure have led to an unprecedented volume of spatio-temporal data in various fields, including transportation, environmental science, and public security. Effectively harnessing this diverse range of spatio-temporal data calls for the integration of human-centered machine learning techniques with the rich spatio-temporal information available. However, the efficacy of data-driven methods is heavily dependent on the quality of labeled training data, which cannot always be guaranteed in the era of big spatial-temporal data. In addition, in urban computing applications, it is insufficient to merely develop accurate predictive models. Providing human-readable explanations for predictions is equally vital, especially in interdisciplinary domains. To address these challenges, this presentation aims to introduce research that advances the analysis of large-scale spatio-temporal data by developing robust, interpretable, and expressive spatial-temporal graph learning frameworks. This presentation aims to introduce research that advances the analysis of large-scale spatio-temporal data towards robust, interpretable, and expressive spatio-temporal graph learning frameworks.

Industrial Talks

Jack YIM

Cloud Solutions Architect, Huawei

He have extensive experience in the field of Computer & Data Engineering, specifically in the Cloud & AI industry. His expertise spans from research and development to the practical application of AI in various aspects, with a particular focus on Computer Vision (CV) and Natural Language Processing (NLP). Additionally, he have been actively involved in Cloud Computing, specifically in the area of Application Modernization.

Talk Title: Leveraging Advanced Intelligent Technology to Power the Next Wave of Digital Transformation

Talk Abstract

Jack Yim is now a Cloud Solutions Architect of HUAWEI, experienced in the research and development of Artificial Intelligence, Data analysis and Cloud Computing. He will share trend and modern technology for intelligent technology about AI and Big Data

Committee

Chair

u-leong-hou-ryan-1-275x300

Prof. Leong Hou U, FST

余亮豪教授

Members

dingqi

Prof. Dingqi YANG, FST

楊丁奇教授

cherisc

Prof. Cheris Wing Chi CHOW, FBA

周詠芝教授

henrylei

Prof. Henry Chun Kwok LEI, FBA

李振國教授

muruga-perumal-ramaswamy

Prof. Muruga Perumal RAMASWAMY, FLL

terence-chuen-wai-poon-819x1024-1-800x1000

Prof. Chun Wai POON, FHS

潘全威教授

Barry Reynolds

Prof. Barry Lee REYNOLDS, FED

雷貝利教授

tjcai_c-800x976

Prof. Tianji CAI, FSS

蔡天驥教授

Venue

Date: 04 September, 2023

Address: Ground floor Function Hall, N1 Building – University of Macau

Map: UM-Map

About UM

About CDS (UM)

Welcome to the 1st MSDS 2023!

Schedule

Keynotes

Reynold CHENG 鄭振剛

Professor, The University of Hong Kong

Talk Title: Data Science for Social Goods: STAR Lab’s Experience

Jianliang XU 徐建良

Chair Professor, Hong Kong Baptist University

Talk Title: Large-scale Geospatial Analytics: Challenges and Opportunities

Invited Talks

Ming Ming CHIU 趙明明

Chair Professor, The Educational University of Hong Kong

Talk Title: Applying Artificial Intelligence & Statistics to Big Data: Automatic Analysis of Conversations

Tianji CAI 蔡天驥

Professor, The University of Macau

Talk Title: Modelling municipal solid waste generation in Macau: A synthetic approach

Minjia TAN 譚敏佳

Professor, Shanghai Institute of Materia Medica Chinese Academy of Sciences

Talk Title: Integrative proteogenomic and pharmacological characterization of disease samples identifies therapeutic opportunities

Yun ZHAO 趙雲

Professor, University of Hong Kong

Talk Title: Dualism in data protection: balancing the right to personal data and the data property right

Yu LU 盧宇

Associate Professor, Beijing Normal University

Talk Title: AI-Driven Research and Practice in Education

Terence POON 潘全威

Associate professor, The University of Macau

Talk Title: Opportunities and Pitfalls in Application of High-Throughput Proteomics and Metabolomics Technologies to Disease Marker Discovery

May WANG 王莹

Associate professor, Beijing Normal University – Hong Kong Baptist University United International College

Talk Title: Revolutionizing Business Innovation: The Power of Creative AI in Data Science

Chao HUANG 黃超

Assistant Professor, University of Hong Kong

Talk Title: Robust and Explainable Spatio-Temporal Graph Learning for Smart City

Industrial Talks

Jack YIM

Cloud Solutions Architect, Huawei

Talk Title: Leveraging Advanced Intelligent Technology to Power the Next Wave of Digital Transformation

Committee

Chair

Members

Venue

Talk Title: Large-scale Geospatial Analytics:
Challenges and Opportunities