Research Works
This page contains the abstracts of some of the research projects I’ve worked on recently.
On the Continued Fraction Expansion of Almost All Real Numbers
It is well-known that every irrational number has a unique expansion as a continued fraction , where and the numbers , , are positive integers, called the continued fraction digits of . By a classical result of Gauss and Kuzmin, the continued fraction expansion of a random real number contains, with probability , each digit with asymptotic frequency ; that is, almost all real numbers satisfy
In this paper we consider two related questions: First, for certain infinite subsets , we establish simple closed formulas for the frequency with which the continued fraction digits of almost all real numbers belong to the set . For example, we show that for almost all real numbers , a proportion of the continued fraction digits of is of the form , where is a prime.
Second, we determine explicitly the frequency with which a string of consecutive digits appears in the continued fraction expansion of almost all real numbers. For example, we show that for almost all real numbers a string of consecutive digits appears in the continued fraction expansion of with frequency , where is the th Fibonacci number.
Finally, we compare the frequencies predicted by these results with actual frequencies found among the first 300 million continued fraction digits of . Our results show that the latter frequencies are statistically indistinguishable from those of a random real number.
Predicting Flight Delays Caused By Weather
This project is an attempt to predict the amount by which a given flight is delayed due to weather-related causes. We used a dataset on flight delays and their causes, and another about weather conditions in the US, and determined, for each plane, if its takeoff and/or landing time is during a severe weather event. If so, we used the time between the end of the event and the departure/arrival, as well as the type of weather event at each airport as features to try to predict the actual delay time due to weather. We tested two different models, KNN and SVM, to see which gave the better accuracy. Neither model was very successful due to a large number of confounding variables.
Link to our code:
https://gist.github.com/shreyas-s125/f42d9c435786aaa472e73b81c7d12eb8