About This Project
Data analysts often struggle to determine the optimal amount of historical data required for solving specific business problems. This challenge stems from the difficulty in identifying temporal inflection points—periods when relationships among variables significantly change (e.g., the COVID-19 pandemic disrupted many pre-2020 data behaviors). This research aims to develop statistical methods to detect such inflection points to improve the accuracy of business insights.
Ask the Scientists
Join The DiscussionWhat is the context of this research?
Businesses increasingly depend on historical data to drive decision-making and predict future outcomes. However, determining the optimal timeframe for relevant data remains a complex challenge, particularly when significant external events disrupt established patterns.
These disruptions underscore a critical issue in analytics: the inability to pinpoint temporal inflection points where data behaviors and relationships undergo fundamental shifts.
This project aims to address this challenge by developing a statistical framework to detect such inflection points. By identifying when key changes occur in data patterns, the research will enable analysts to focus on the most relevant and recent data, improving the accuracy of insights and the reliability of predictions.
What is the significance of this project?
The research has several key implications:
Enhanced Decision-Making: By identifying the most relevant timeframe for analysis, businesses can derive insights that reflect current realities, improving strategic and operational decisions.
Improved Predictive Models: Focusing on relevant data improves the accuracy of predictive analytics, enabling organizations to anticipate trends and mitigate risks effectively.
Broader Applicability: The methodology developed through this project is adaptable across industries, from finance and retail to healthcare and logistics, wherever data-driven insights drive outcomes.
Efficient Resource Utilization: By narrowing the scope of data analysis to the most relevant periods, organizations can reduce computational costs and streamline analytical processes.
What are the goals of the project?
Goals of the Project
Design and validate a robust methodology to identify the points in time where significant changes in data patterns or variable relationships occur.
Establish techniques to isolate the most pertinent segments of historical data, ensuring that analyses reflect current trends and patterns.
Create tools or guidelines to improve the accuracy of business insights and predictive models by focusing on data with contemporary relevance.
Test and refine the methodology across multiple industries (e.g., retail, finance, healthcare) to ensure versatility and applicability in different business contexts.
Publish findings and share methodologies to advance the understanding of how temporal shifts impact data analysis, supporting both academic research and practical implementation.
Budget
This budget allocation will support the hiring of a student who will assist in managing the substantial workload of this research project under my direct supervision.
• Python Libraries-Advanced packages Like Scikit-Learn, TensorFlow, PyTorch, and XGBoost for Machine Learning And Deep Learning Based Tasks.
• Data Visualization- Software such as Tableau for creating insightful visualization.
• Data Pre-Processing: Access to platforms like MATLAB.
• Statistical Software: Tools like SPSS or SAS for robust statistical analysis.
Endorsed by
Project Timeline
This research will begin in January 2025 with a comprehensive literature review, which will be followed by the preliminary research frameworks. From February to May 2025, the team will be focusing on the development of statistical models and algorithms. During the publication and wrap-up phase, from June to August 2025, the team will finalize conclusions and insights. The team will also develop a prototype for the businesses to implement.
Dec 16, 2024
Project Launched
Jan 31, 2025
Project Initiation
May 31, 2025
Framework Development
Aug 31, 2025
Publication & Wrap-up
Meet the Team
Abhimanyu Gupta
Abhimanyu Gupta has been an Instructor of Data Science at the Richard A. Chaifetz School of Business, Saint Louis University, since 2017. He earned a Master of Statistics (M.Stat.) from the Indian Statistical Institute and a Doctor of Business Economics (Ph.D.) in Information Systems from Ghent University, Belgium. Prior to entering academia, Abhi worked for 17 years as a Project Manager in the Information Technology industry.
Abhi’s teaching excellence has been recognized with several prestigious awards, including the Emerson Excellence in Teaching Award in 2021. He was also named Graduate Faculty of the Year in 2020, 2021, and 2024, and received the Beta Gamma Sigma Outstanding Teacher Award in 2023.
Shubh Patel
Shubh Vivek Patel is a master's student in artificial intelligence at Saint Louis University with six years of experience in data science, machine learning, and deep learning. He holds a bachelor's degree in artificial intelligence, which laid the foundation for his strong analytical skills and passion for AI-driven solutions.
Proficient in PyTorch, TensorFlow, Python, and SQL, Shubh is also an active open-source contributor. He is passionate about leveraging AI to solve real-world problems and is committed to continuous learning and innovation.
Lab Notes
Nothing posted yet.
Additional Information
[1] Cang, S., & Yu, H. (2014). A combination selection algorithm on forecasting of stock inflection points. Journal of Computational and Applied Mathematics, 259, 676-685. https://doi.org/10.1016/j.cam....
[2] Preis, T., Moat, H. S., & Stanley, H. E. (2013). Quantifying trading behavior in financial markets using Google Trends. Scientific Reports, 3, 1684. https://doi.org/10.1038/srep01...
[3] Moat, H. S., Curme, C., Avakian, A., Kenett, D. Y., Stanley, H. E., & Preis, T. (2013). Quantifying Wikipedia usage patterns before stock market moves. Scientific Reports, 3, 1801. https://doi.org/10.1038/srep01...
[4] Benson, C. L., & Magee, C. L. (2018). Data-driven investment decision-making: Applying Moore's Law and S-Curves to business strategies. *arXiv preprint arXiv:1805.06339. https://arxiv.org/abs/1805.06339
[5] Arslan, M., & Cruz, C. (2023). Unlocking insights into business trajectories with transformer-based spatio-temporal data analysis. *arXiv preprint arXiv:2306.10034. https://arxiv.org/abs/2306.10034
Project Backers
- 0Backers
- 0%Funded
- $0Total Donations
- $0Average Donation