chapter 6.1 - 6.4 of Chen's thesis. chapter 1 - 4.1 of the foundation work in this field: the deep Galerkin method (DGM), http://utstat.toronto.edu/~ali/papers/PDEandDeepLearning.pdf as a supplementary material.
Corr =
Read Chen's thesis in detail. Read Longstaff Schwartz paper. Learned the basic of pytorch and keras.
During the delta hedging process, we need Delta at each time step for the current stock price. We can retrieve that either by running the MC again, or interpolate using results gained from previous MC.
Implemented the Longstaff-Schwartz method. Read Chen's thesis in a great detail.
F1 score is a good indicator of accuracy when the proportion of false positive and true negative is skewed.
where
There are less neural networks in a single pricing model.
Implemented antithetic variates, control variates to reduce variation in MC. Learned PCA in detail. Read [Dimension Reduction for the
Black-Scholes Equation](https://www.it.uu.se/edu/course/homepage/projektTDB/vt07/Presentationer/Projekt3/Dimension_Reduction_for_the_Black-Scholes_Equation.pdf) to figure out one potential method that deals with high dimensional PDEs.
We need to guarantee that the basis # is much smaller than the sample size. It's also recommended to add a regularization term:
Initially we want
which again can be solved by QR or SVD.
The analytical form of original sol is
It's more accurate to simulate stock price by a kinda analytical sol instead of the crude Euler method
European option pricing can be accelerated greatly by this, since we can directly get the price at time T.
In P208 Glasserman, we decompose
Let the pdf of Z be
So
It follows that
The first term on the right is the variance of an estimate of
In the usual problem, the covariance matrix needs to be approximated. But in option pricing, we assume that a correlation matrix is given. In this case, we can recover the correlation matrix to a covariance matrix by only approximating variance of stock price (TODO: Can be analytically expressed?). It should give a more accurate price.
Implemented the analytical solution to GBM. Implemented importance sampling for European options. Implemented 1D parabolic PDE solver. Halfway through the implementation of 2D parabolic solver. PDE change of variable. Martingale.
It can also be used in numerical linalg. Suppose we are going to solve
Consider the BS equation
Let
Further implementation on 2D parabolic pde solver (haven't debugged). Read chapter 3 (probability & information theory), chapter 5 (machine learning basics), half of chapter 6 (deep feedforward networks). Studied concepts in information theory in depth. Read again chapter 1 - 4.1 of the foundation work in this field: the deep Galerkin method (DGM), http://utstat.toronto.edu/~ali/papers/PDEandDeepLearning.pdf as a supplementary material.
Learned the use of tensorflow. Implemented the deep galerkin method for single-asset BS equation. Came up with several ideas about how to improve DGM. Method of characteristic. Read "Numerical methods for conservation laws", learned some mathematical foundation of the conservation laws and the numerical difficulties. Learned some numerical methods for Burgers' equation.
Add penalty terms (1. make delta monotonic. 2. avoid negative or > stock price option) to th loss term. Do more samplings in BC and IC at the beginning.
Able to get Greek on the whole domain. Able to know the loss when computing solutions.
Made some helper functions for deep learning experiments. Attempts: 1. add another penalty term that requires the first derivative to be always non-positive or non-negative; 2. consider boundary conditions in the loss function. Learned heat equation's properties. Proved the error bound for the deep Galerkin method given the loss on the whole domain.
Explored maximum principle in depth. Computed the error bound for the n dim inhomogeneous Black-Scholes equation. Learned basic discrete fourier transformation.
Learned regular/discrete Fourier transform and FFT. Learned the characteristic function of probability densities. Read Option valuation using the fast Fourier transform (Carr & Madan) using Efficient Options Pricing Using the Fast Fourier Transform as supplementary material. Implemented Carr & Madan method for 1D European option.
Such that we can get a constant variable PDE, which will be the target of Fourier time stepping method. And theoretically, the target function should be defined on R to make it Fourier transformable.
Fourier transform & Fourier inverse transform are basically the same thing. If we did the inverse trans first, the resulting function will also be in the frequency domain
If the characteristic function of the density function is known, we can give the fourier transformed option price in an analytical form. Then we can do FFT to transform it back to real price.
Mathematically speaking, there would be no complex part, since Fourier + Fourier inverse give the original option price, which is real. But truncation error would possibly lead to complex number. In this case, we just discard it.
Let dimension be d. Then the complexity would be
Read "Multi-asset option pricing using a parallel Fourier-based technique (C. C. W. Leentvaar, C. W. Oosterlee)" multiple times, trying to implement it but failed.
Implemented Conv method for the 1d case based on "A fast and accurate FFT-based method for pricing early-exercise options under Levy processes (R. Lord et al). Derived the characteristic function for n-d GBM. In the process of extending the 1d case to the n-d case.
Same idea as the Monte-Carlo method for highly out-of-money options.
Continue writing the thesis. Finish the writing of n-d case. Found a bug related to FFT and characteristic function in the thesis on Monday.
Identified and fixed the characteristic function bug and the change of variable bug in the thesis. Successfully implemented the n-d conv method and make it 100x faster by vectorization.
Add medium dimensional option pricing benchmark (geometric avg payoff). Add experiment module to simplify numerical experiments. Investigate in the reason why Conv method only gives solutions on domain close to the spot price: circular convolution. Give the analytical form of the greeks using Conv method.
Investigate more closely in the circular convolution. Understand Tensorflow in more depth and implemented the deep Galerkin method N-d European case. Wrote the first two chapters in the thesis: 1. Introduction and 2. Two Frameworks for Option Pricing. Experimented with N-d MC method using sobol sequence. Noticed that the N-d antithetic method may also be useful.
Thesis: Described three types of low discrepancy methods and their implementations. Completed the gap in "the error bound of the dgm" by proving the BS equation is a parabolic pde. Added the statement of maximum principle. Briefly introduced the architecture of dgm and bsde and compared the two methods.
Implemented antithetic variates for n-d MC and added the discussion of it to the paper. Figure out the reason why DGM using Hessian is extremely slow: batch size too large. Took error from the boundary conditions of the PDE into consideration.
Conducted numerical experiments and finished the thesis.