Research Works
This page contains the abstracts of some of the research projects I’ve worked on recently.
On the Continued Fraction Expansion of Almost All Real Numbers
It is well-known that every irrational number has a unique expansion as a continued fraction
, where
and the numbers
,
, are positive integers, called the continued fraction digits of
. By a classical result of Gauss and Kuzmin, the continued fraction expansion of a random real number
contains, with probability
, each digit
with asymptotic frequency
; that is, almost all real numbers
satisfy
In this paper we consider two related questions: First, for certain infinite subsets , we establish simple closed formulas for the frequency with which the continued fraction digits of almost all real numbers belong to the set
. For example, we show that for almost all real numbers
, a proportion
of the continued fraction digits of
is of the form
, where
is a prime.
Second, we determine explicitly the frequency with which a string of consecutive digits
appears in the continued fraction expansion of almost all real numbers. For example, we show that for almost all real numbers
a string of
consecutive digits
appears in the continued fraction expansion of
with frequency
, where
is the
th Fibonacci number.
Finally, we compare the frequencies predicted by these results with actual frequencies found among the first 300 million continued fraction digits of . Our results show that the latter frequencies are statistically indistinguishable from those of a random real number.
Predicting Flight Delays Caused By Weather
This project is an attempt to predict the amount by which a given flight is delayed due to weather-related causes. We used a dataset on flight delays and their causes, and another about weather conditions in the US, and determined, for each plane, if its takeoff and/or landing time is during a severe weather event. If so, we used the time between the end of the event and the departure/arrival, as well as the type of weather event at each airport as features to try to predict the actual delay time due to weather. We tested two different models, KNN and SVM, to see which gave the better accuracy. Neither model was very successful due to a large number of confounding variables.
Link to our code:
https://gist.github.com/shreyas-s125/f42d9c435786aaa472e73b81c7d12eb8