In many occasions we have textual labels in structured data. The case we
considered in this paper is the industry designations to companies. While there
are standard to govern the industry designation, its use is found to be
arbitrary.
[more]
Pimbley (2016) Better measurements for CLO equity performance
A short paper arguing how to evaluate CLO equity performance. As the equity tranche of a CLO has a maturity date, we can consider that as a bond with indeterministic coupon. So it is natural to use IRR as a measure for equity. IRR is the solution of \(r\) in...
[more]
Numbers for machine learning
How much data is enough? This was the question for any statistical exercise, such as experiments, simulations, surveys. But nowadays, this is also the question for machine learning.
[more]
Sholler et al (2019) Ten simple rules for helping newcomers become contributers to open projects
Community: common goal, mutually engaged, shared resource
[more]
Parsing Cantopop titles by machine learning
If you search “cantopop” in YouTube, you will find loads of music videos. Each of the video will bear a title. Below are some examples:
[more]