About this page

This is a page to host personal projects that I have created using R. The page itself is built using the shiny package with a bootstrap layout and is written entirely in R.

Platform

This page is hosted on a DigitalOcean server running Shiny Server Open Source with an Rstudio backend. The web applications are built on the Shiny framework. PDF rendering is accomplished using Rmarkdown with custom LaTeX formatting.

Objective

This Shiny app was designed to provide the predicted home price and recent comparable sales for homes in Spokane County, WA. Predicted property values are calculated using both a standard regression model and a random forest model. The property location is displayed using a leaflet widget and the sales price is predicted and graphed for a +/- 6 month window. The data table contains comparable recent sales (if any) for the property and can be modified using the additional input widgets to specify distance, price difference, etc. The report style and generate report button will create a PDF flyer for the property of interest with a list of recent relevant sales (dynamically generated) and the corresponding list of additional comparable properties (these are not based on the user input widgets, but are instead determined programatically).

Methodology

Sales and property information were obtained and compiled from the publicly available records from the Spokane County Assessor's office. Geospatial data were obtained by reverse geocoding property addresses using the US Census Bureau API to obtain the longitude and latitude of each property. The data for recent sales were fitted using both linear regression and random forest models and the values for all properties were predicted using the current date. All data and predicted prices are stored using MySQL and calculated daily using updated models. PDF property reports are generated from Rmarkdown templates using custom LaTeX formatting.

Generate report



Objective

This Shiny app was designed to provide marketing mail for real estate prospects using dynamic content for each property. The input widgets allow for selection of a specific neighborhood (county designation) along with filtering by the previous sales date predicted sales price of the property. The Listings or Buyers widget allows for the user to subset mailing address based on likely home owner (Listings), renter (First-time Buyers), or rental property owner (Property Owners). The report style provides different mailer styles and custom banners for each agent. The specific information for recent home sales is dynamically generated for each property based on a optimization of predicted sales price, geographical location, and property details to enhance engagement and elicit a "Wow, you will never believe what the house right down the street that is exactly like ours just sold for" reaction.

Methodology

All data were obtained and modeled as described under the section for "Predictive Modeling for Home Prices". User input for neighborhood, sales date, etc. are passed as parameters to the template selected. Templates were constructed using Rmarkdown with custom LaTeX formatting to produce the dimensional output (flyer vs. postcard) for PDF printable mailing media. The "Recent Home sales in your area" content is dynamically generated for each property based on a combination of the predicted sales price of the property and the geographical proximity of the mailer property and recent sales locations.

Generate report



Objective

To perform a time series analysis to identify stocks that are currently displaying a cup and handle price pattern. Daily price rates were obtained for all NASDAQ listed stocks for a 1-year period (2018-11-08 to 2019-11-08) and analyzed for the detection of stocks displaying the cup and handle pattern in the past 30 days (prior to 2019-11-08). The results of the analysis are displayed below and contain price charts for each stock that was detected to contain a cup and handle with the the period of interest. The cup top (blue), cup bottom (yellow), handle top (green), and handle bottom (red) are labeled along with the fitted line (yellow) used to calculate the points

Methodology

Daily stock prices were obtained for all NASDAQ listed stocks using the BatchGetSymbols package. The daily price data for each stock were processed using a smoothing spline. The cup top, cup bottom, handle top, and handle bottom were identified using peak and valley detection for the fitted values of the time series. Stock prices and results were stored in SQLite database and HTML reports were generated using RMarkdown for results meeting the parameters of interest (i.e. a handle occurring in the previous 30 days).

Fred Franken


Education

Oregon Health & Science University, Portland, OR

Ph.D. in Behavioral Neuroscience

Truman State University, Kirksville, MO

B.S. in Psychology, Minor in Biology


R Programming Experience

Data Analysis
  • Data preparation and analysis, including use of packages: dplyr, stringr, quantmod, zoo, stats, randomForest
  • Data acquisition and storage, including use of packages: RMySQL, SQLite, fst, RODBC, and publicly available APIs
  • Predictive modeling, including use of standard regression models and decision trees
Data Visualization and Automated Reporting
  • Interactive data visualization, including use of packages: shiny, leaflet, dplyr, and DT
  • Automated reporting outputs via rmarkdown and knitr
  • Production ready print output using rmarkdown with custom LaTeX formatting

Professional Experience

Scientist, Jubilant HollisterStier, Spokane, WA (02/2012 - current)
  • Direct the review and implementation of changes to regulatory requirements to ensure CFR and GMP compliance
  • Perform method development, protocol executions, and author acceptance reports for new analytical methods
  • Design and maintain data systems for analysis of facility metrics and inventory optimization
Graduate Student, Oregon Health & Science University, Portland, OR (09/2005 - 06/2011)
  • Refined skills in experimental design, laboratory techniques, and data analysis
  • Performed cell culture techniques and assay procedures, including radioligand binding and uptake, ELISA, and HPLC
Research Fellow, National Institutes of Health, Baltimore, MD (09/2003 - 09/2005)
  • Assisted in experimental design, protocol development, and data analysis for clinical trials
  • Managed study personnel, participant scheduling, and database integrity

Freelance Projects

Founder/Developer, ieMetrics (05/2017 - current)
  • Developed web bases services for Customer Relationship Management and record storage
  • Data systems included UI developed in PHP, MySQL, and dashboard metrics provided by analysis in R with data visualization in Shiny apps
Presenter, Inland Northwest R User Group