Guanqun Yang 杨冠群Ph.D. Candidate in Computer Science @ Stevens Institute of Technology
I am currently working towards building NLP systems that are well-tested, secure against adversaries, and interpretable for domain specifications. Previously my projects were mainly focused on statistical machine learning and its application in speech signal processing, recommendation system, and text mining. I am also interested in optimization and graph structures in a real-world setting. I also spent plenty of time on both lower-level design and fabrication and higher-level control and pose estimation of mobile robots. Statistical Machine Learning and its ApplicationsStatistical machine learning is a process of incrementally improving the performance of computer systems under specific metrics by employing statistical techniques but without prior explicit programming. The core of (supervised) machine learning is to find patterns of given data and apply this pattern to previously unseen data and thereby giving predictions of them. Because of the universal needs for pattern recognition, machine learning algorithms are utilized to a wide range of areas, including art, humanities, economics, and many others. For example, neural style transfer in arts, KuroNet in ancient Japanese identification, and algorithmic trade. Fairness-Preserving Machine LearningGiven the availability of colossal dataset and ever-increasing computing power, the responsibility of decision-making is gradually shifted to algorithms. From something as small as distributing free admissions to a movie premiere to more consequential ones like granting an automobile loan, these systems make decisions that affect people's life without raising their awareness. However, many recent cases exemplify that algorithmic decision-making systems could inherit and amplify the bias encoded in data. Application Tracking Systems (ATS) used by Amazon show gender bias against female when evaluating job applicants’ resumes, making equally qualified women fail to get the interview opportunity while men could. Credit card approval systems wrongly associate race and gender information with whether or not the applicant would default, creating significant racial and gender disparity in the final approval statistics. Commerical facial recognition systems embody as much as 30% performance degradation when the subject has darker skin, which is often associated with race.
Autonomous Motion Planning by Deep Reinforcement Learning for Fall Prevention in Hospitals
Speech Segment Identification for Person Recognition
Popularity Analysis of Twitter Hashtags for SuperBowl 2015
Graph Structure in Real World ProblemsThe elements of the graph (vertices, edges, and weights associated with edges) make it suitable for representing relations and result in many applications of graph structures in social networks, transportation, and many others. The following are two of my projects that explore the graph structure in cooperative relations in the entertainment industry and general property of public transportation. Cooperative Relation between Movie Actors/ActressesUber Movement in San FranciscoThis project aims to analyze the graph structure of more than 1.6 million traveling data in San Francisco area during December 2017. Major components (vertices, edges, and weights) of the graph are first extracted from metadata, and they are then used to generate the graph, find GCC, and finally converted to a legal undirected simple weighted graph. Multiple graph structures are explored in the resulting graph including MST and maximum flow. What is more, an approximate algorithm is attempted to solve the traveling salesman problem in this graph. The metadata is available here. Convex OptimizationConvex optimization is the backbone for solving problems arising from areas such as machine learning, control, estimation and signal processing, and even finance. Many machine learning algorithms gain their popularity because of the underlying convexity of their formulation, including Linear and Logistic regression, SVM (Support Vector Machine), and others. Even though convex optimization is largely seen as a technology, which shows the maturity of many existing algorithms, in order to better understand the mechanism of this process, a SVM classifier is implemented from stratch with the help of CVX. A Generic Linear Classifier Implementation for Image RecognitionLinear SVM is a popular linear classification algorithm extensively used in many applications, including the classification of images, texts, and even chemical compounds. Depending on number of classes to be classified, the tasks are categorized into binary and multiclass classification problems. The solution of multiclass classification is made possible by numerous strategies, including transforming into binary classification, an extension of binary classification, and hierarchical classification. In this project, we are aiming to classify handwritten digits based on MNIST dataset using the first strategy mentioned above. Suppose we have \(K\) classes of data points to be classified (in our dataset, \(K=10\)), then we have two schemes:
From the description of two different schemes, the OvO scheme should provide a more accurate prediction, which is evident in the figure alongside. Note that the Sampling Ratio means the number of data points used from the original dataset. Previous ProjectsThe following projects are completed during my undergraduate studies at Northeastern University, China. These projects are mainly focused on the design and implementation of robotics control systems. Pose Estimation of Mobile Robots Based on the Integration of IMU and Vision
SVPWM Controller Implementation for 3-Phase Asynchronous MotorThe three-phase asynchronous motor features non-linearity in its mathematical model and tight electromagnet coupling, and this makes this family of motors is hard to control effectively by solely using regular PID controllers. Space vector pulse width modulation (SVPWM) algorithm manages to generates sinusoid waves by simple ON/OFF operations of power electronic devices like IGBT. Incorporating PID controllers into the system, this project manages to implement a double loop SVPWM governing system in C++ on TMS320F2812 DSP by Texas Instruments. The 3-phase asynchronous motor in the SVPWM system could be governed like the AC motor with high precision and low latency. Design and Implementation of a Flapping Propulsion Plant for Underwater RobotSome underwater tasks like seabed exploration require relatively low velocity but high flexibility, and this makes the traditional screw propulsion solution no more applicable. In this project, we design and implement a bionic propulsion system inspired by sea turtles, where flipper could move in four degrees of freedom (elevating/heaving and walking/surging), and therefore make possible low-velocity underwater maneuvering. This propulsion system consists of two cylinders, where each controls two degrees of freedom, and the flipper is governed by AC motor in the outer cylinder, which is in turn controlled by another AC motor fixed in the inner cylinder. Based on the prototype of our design, multiple key parameters were identified experimentally and analytically for the system model. Furthermore, different control algorithms were employed for the rapid response of the mechanical structure of the prototype. Specifically, the deadbeat and ripple-free algorithm showed the best dynamic and static characteristics. The prototype of this propulsion system indicated its flexibility and extensibility for underwater exploration in the low-speed setting.
|