Skip to content

Commit 7016799

Browse files
authored
Introduction to financial applications of DS
A brief introduction to financial introduction to Data Science has been added. More details will be added as more analysis and techniques are learnt upon financial data.
1 parent e3c8066 commit 7016799

File tree

1 file changed

+40
-0
lines changed

1 file changed

+40
-0
lines changed
Lines changed: 40 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,40 @@
1+
# Applications in Financial Services
2+
3+
## Traditional Sources
4+
5+
Traditionally, the following indicators have been used as data for finance:
6+
7+
1. Macro-Economic Indicators: GDP, Turnover etc.
8+
2. Performance of investments over time: How well the stocks did?
9+
3. Records of behavior of clients: If they paid on time or not
10+
11+
Although data science takes this a step further, because new sources can be now involved in the decision making process:
12+
13+
1. Unstructured text: Tweets, natural language records etc.
14+
2. Sequence Data: How things change over time if sequence 1 is replaced with sequence 2.
15+
3. Biometric markers: How
16+
4. Network data: Social relationships and insights into the records
17+
18+
## Decisions that need to be made
19+
20+
1. Acceptable level of risk
21+
2. False positives and false negatives
22+
3. Sources of your data: How reliable are these?
23+
4. Importance of real-time analysis (because it would impact the range of algorithms that we can use)
24+
5. Importance of interpretability
25+
26+
## Common ML Algorithms used with Financial Data
27+
28+
1. KNN
29+
2. Regression
30+
3. Decision Trees and RFs
31+
4. ANNs
32+
5. Deep Learning
33+
34+
## Potential Problems
35+
36+
1. Flash crash: The process of having a large amount of change in the value of a security in very brief period of time is called a Flash Crash. The most recent one was when the value of Ethereum went from $319 to 10 cents in 2017<sup>[1](https://www.cnbc.com/2017/06/22/ethereum-price-crash-10-cents-gdax-exchange-after-multimillion-dollar-trade.html)</sup>.
37+
2. Relevant data missing: The models are certainly not going to work if they have not been provided with all the information that is required to predict a particular value.
38+
3. Overfitting
39+
4. Black Box Models: It is very difficult to explain models that could give very impressive results to a non-technical person.
40+
5. Unintentionally using restricted information: Things like race, gender etc. are often included in the application while training it, and are restricted in many countries from being used for any financial purposes because of racist biases that might creep in wuith them into the algorithm

0 commit comments

Comments
 (0)