About
At Yahoo, my primary focus has been on enhancing customer experiences through trusted…
Activity
-
FreeWheel is #hiring data engineers! Our platform processes billions of ad events daily, delivers real-time and batch data, and provides various…
FreeWheel is #hiring data engineers! Our platform processes billions of ad events daily, delivers real-time and batch data, and provides various…
Liked by xin zhang
-
We are actively looking for a skilled C++ Software Engineer with a strong background in robotics to join our Core Behavior Simulation team! This role…
We are actively looking for a skilled C++ Software Engineer with a strong background in robotics to join our Core Behavior Simulation team! This role…
Liked by xin zhang
-
🔎 Open to New Opportunities: Senior Software Engineer | Backend | Seattle or Remote With 9+ years of experience in software development (2 years of…
🔎 Open to New Opportunities: Senior Software Engineer | Backend | Seattle or Remote With 9+ years of experience in software development (2 years of…
Posted by xin zhang
Experience
Education
-
Carnegie Mellon University
-
-
My major is Computational Data Science.
Course:
Cloud computing, Storage System, database application, data system seminar, Advanced Database systems, Advanced cloud computing, Multimedia database and data mining, Machine Learning -
-
-
-
-
-
Publications
-
Non-blind deblurring of structured images with geometric deformation
The Visual Computer
-
Multiple Geometry Transform EstimaMultiple Geometry Transform Estimation from Single Camera-captured Text Imagetion from Single Camera-captured Text Image
International Conference on Document Analysis and Recognition
-
Rectification of Optical Characters as Transform Invariant Low-rank Textures
International Conference on Document Analysis and Recognition
-
Blind non-uniform deblur under projection motion path
Journal of Electronic Imaging
-
Fast Low-Rank Subspace Segmentation
Transactions on Knowledge and Data Engineering
-
Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images
Other authors -
-
Transform invariant text extraction
The Visual Computer
-
Circle Text Expansion as Low-Rank Textures
International Conference on Document Analysis and Recognition
-
Pulse Coupled Neural Network Edge-Based Algorithm for Image Text Locating
Tsinghua Science & Technology
-
A Combined Algorithm for Video Text Extraction
7th Internationa7th International Conference on Fuzzy Systems and Knowledge Discoveryl Conference on Fuzzy Systems and Knowledge Discovery
Patents
-
Rectification of characters and text as transform invariant low-rank textures
Issued US US 20120134588 A1
Other inventors -
Projects
-
Convolutional Neural Networks for handwritten character recognition
Implement lenet version of Convolutional Neural Networks for handwritten character recognition algorithms.
1) Implemented lenet version of CNN algorithm.
2) Tested the algorithm on MINST dataset and achieve 98% recongition rate. -
Latch-free Skip List in the Peloton database management system
Implemented a latch-free Skip List in the Peloton database management system: implement a multi-thread skip list in Peloton database with concurrency control in a modern, order-preserving indexing data structure. This projects add another latch free index for Peloton database management system in addition to BW+ tree. This skip list index support insert, delete, scan, backward scan, garbage collection functions.
1) Implemented latch free index for Peloton database using skip list
2)…Implemented a latch-free Skip List in the Peloton database management system: implement a multi-thread skip list in Peloton database with concurrency control in a modern, order-preserving indexing data structure. This projects add another latch free index for Peloton database management system in addition to BW+ tree. This skip list index support insert, delete, scan, backward scan, garbage collection functions.
1) Implemented latch free index for Peloton database using skip list
2) Implemented garbage collection for skip list which can recycle the logical deleted skip list node.
3) Added test cases for Peloton database. -
Developing CloudFS using the FUSE framework
Implement CloudFS using C as user-level code that uses libfuse to service file
system calls of applications attempting to access the SSD.
1) Implemented the CloudFS system which is a hybrid file system with two different storage components: a local SSD and a cloud storage service similar to Amazon S3.
2) Implemented de-duplicate algorithms to reduce cloud usage.
3) Implemented snapshot to help user to recover certain snapshot. Also use cache algorithm to further reduce the cloud…Implement CloudFS using C as user-level code that uses libfuse to service file
system calls of applications attempting to access the SSD.
1) Implemented the CloudFS system which is a hybrid file system with two different storage components: a local SSD and a cloud storage service similar to Amazon S3.
2) Implemented de-duplicate algorithms to reduce cloud usage.
3) Implemented snapshot to help user to recover certain snapshot. Also use cache algorithm to further reduce the cloud usage. -
Twitter Analysis Web service
· Course project of Cloud Computing
· Developed a web service on AWS using VertX as server, and use MySQL and HBase as database to handle different queries.
· Wrote programs in Hadoop MapReduce to extract, transform, and analyse a 1TB twitter dataset
· Used sharding and in-memory caching to accelerate query processingOther creators -
Flash Translation Layer for SSD
Designed and developed a Flash Translation Layer for SSD Simulator. This Flash Translation Layer can do address translation, garbage collection, wear leveling optimization.
-
On-Line Hockey Game
-
In this project, we implemented an on-line hockey video game using django, javascripts. This on-line hockey game support enter game lobby, select team, room management and hockey. This game support real time updated of both the statistic of lobby information and hockey game and game recovery when one of the user is off-line. I am mainly focus on implement the backend of lobby room part, which include user login, use logout, enter room, leave room, select team, start game, show the real time…
In this project, we implemented an on-line hockey video game using django, javascripts. This on-line hockey game support enter game lobby, select team, room management and hockey. This game support real time updated of both the statistic of lobby information and hockey game and game recovery when one of the user is off-line. I am mainly focus on implement the backend of lobby room part, which include user login, use logout, enter room, leave room, select team, start game, show the real time information of lobby.
Other creators -
Intelligent Scheduling on YARN
-
• Deployed a VM-based YARN cluster. Installed YARN and modified scheduling policy server
to schedule the jobs that were submitted to the cluster. Implemented the FIFO-random, FIFO Heterogeneity awareness, Shortest Job Heterogeneity awareness scheduling policy.
• Proposed job utility awareness scheduling policy which consider preference of GPU job and MPI job and group job utility.
-
Peloton constraint function
-
See projectIn this project, we add the constraint function for memory based Peloton database. In this project, we add constraint check from scratch. The goal of this project is make peloton has the same constraint function as postgres. The constraint function include: primary key, unique, default, not null, check and foreign key constrain check. This constrain check function not only support single column check but also multiple constrain checks. We did following things:
1) Implemented single column…In this project, we add the constraint function for memory based Peloton database. In this project, we add constraint check from scratch. The goal of this project is make peloton has the same constraint function as postgres. The constraint function include: primary key, unique, default, not null, check and foreign key constrain check. This constrain check function not only support single column check but also multiple constrain checks. We did following things:
1) Implemented single column constraint check: including not null, primary, unique, check and foreign key.
2) Implemented multi columns constraint check: including not null, primary, unique, check and foreign key.
3) Added test cases for constraint check for peloton.
4) Added function for cascade foreign key delete function
5) Added Alter table function for peloton. -
D-Cube algorithm to find dense block in high dimension dataset
-
In this project, we implement D-Cube algorithm to detect dense block in dataset with high dimension. Different from java version of original D-Cube algorithm which make use of cloud, we run the experiment on single machine with the assumption that not all inner result can be fit into memory. D-cube algorithm can detect suspicious cluster in dataset and very useful for network attack detection, etc.
1) Implement the D-Cube algorithm in python and postgresql.
2) Do the experiments on five…In this project, we implement D-Cube algorithm to detect dense block in dataset with high dimension. Different from java version of original D-Cube algorithm which make use of cloud, we run the experiment on single machine with the assumption that not all inner result can be fit into memory. D-cube algorithm can detect suspicious cluster in dataset and very useful for network attack detection, etc.
1) Implement the D-Cube algorithm in python and postgresql.
2) Do the experiments on five popular real datasets: Amazon review, Yelp review, english wikipedia revision history, DAPRA dataset, AirForce dataset and show the efficiency of implemented algorithm.Other creators
Honors & Awards
-
Google China Anita Borg Scholarship
Google
-
Google China Anita Borg Scholarship
Google
-
Excellent Graduate in Xidian University
Xidian University
-
National Scholarship
Chinese government
Languages
-
English
Professional working proficiency
-
Chinese
Native or bilingual proficiency
More activity by xin
-
My team Rufus is hiring Applied Scientists! If you are interested, please let me know :) Rufus: https://lnkd.in/gMiXpdvV Job…
My team Rufus is hiring Applied Scientists! If you are interested, please let me know :) Rufus: https://lnkd.in/gMiXpdvV Job…
Liked by xin zhang
-
After one year and a half working at Amazon Web Services (AWS), I decided to embark on a new journey in my career. I will join Google as a Software…
After one year and a half working at Amazon Web Services (AWS), I decided to embark on a new journey in my career. I will join Google as a Software…
Liked by xin zhang
-
Excited to announce that we raised a $17M Series A! We are hiring software engineers in Bay Area. DM me if you want to learn more about opportunities…
Excited to announce that we raised a $17M Series A! We are hiring software engineers in Bay Area. DM me if you want to learn more about opportunities…
Liked by xin zhang
-
Hiring a Senior Data DevOps engineer https://lnkd.in/du5CPn2
Hiring a Senior Data DevOps engineer https://lnkd.in/du5CPn2
Liked by xin zhang
-
FreeWheel Job Spotlight: Senior Product Marketing Manager, Programmatic Marketplaces based in NYC. We are on a mission to shape the future of…
FreeWheel Job Spotlight: Senior Product Marketing Manager, Programmatic Marketplaces based in NYC. We are on a mission to shape the future of…
Liked by xin zhang
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content