A Neural Topic Method Using a Large-Language-Model-in-the-Loop for Business Research

Authors: Stephan Ludwig, Peter J. Danaher, Xiaohao Yang

Abstract: The growing use of unstructured text in business research makes topic modeling a central tool for constructing explanatory variables from reviews, social media, and open-ended survey responses, yet existing approaches function poorly as measurement instruments. Prior work shows that textual content predicts outcomes such as sales, satisfaction, and firm performance, but probabilistic models often… ▽ More The growing use of unstructured text in business research makes topic modeling a central tool for constructing explanatory variables from reviews, social media, and open-ended survey responses, yet existing approaches function poorly as measurement instruments. Prior work shows that textual content predicts outcomes such as sales, satisfaction, and firm performance, but probabilistic models often generate conceptually diffuse topics, neural topic models are difficult to interpret in theory-driven settings, and large language model approaches lack standardization, stability, and alignment with document-level representations. We introduce LX Topic, a neural topic method that conceptualizes topics as latent linguistic constructs and produces calibrated document-level topic proportions for empirical analysis. LX Topic builds on FASTopic to ensure strong document representativeness and integrates large language model refinement at the topic-word level using alignment and confidence-weighting mechanisms that enhance semantic coherence without distorting document-topic distributions. Evaluations on large-scale Amazon and Yelp review datasets demonstrate that LX Topic achieves the highest overall topic quality relative to leading models while preserving clustering and classification performance. By unifying topic discovery, refinement, and standardized output in a web-based system, LX Topic establishes topic modeling as a reproducible, interpretable, and measurement-oriented instrument for marketing research and practice. △ Less

Submitted 3 March, 2026; originally announced March 2026.

arXiv:2602.15312 [pdf]

Extracting Consumer Insight from Text: A Large Language Model Approach to Emotion and Evaluation Measurement

Authors: Stephan Ludwig, Peter J. Danaher, Xiaohao Yang, Yu-Ting Lin, Ehsan Abedin, Dhruv Grewal, Lan Du

Abstract: Accurately measuring consumer emotions and evaluations from unstructured text remains a core challenge for marketing research and practice. This study introduces the Linguistic eXtractor (LX), a fine-tuned, large language model trained on consumer-authored text that also has been labeled with consumers' self-reported ratings of 16 consumption-related emotions and four evaluation constructs: trust,… ▽ More Accurately measuring consumer emotions and evaluations from unstructured text remains a core challenge for marketing research and practice. This study introduces the Linguistic eXtractor (LX), a fine-tuned, large language model trained on consumer-authored text that also has been labeled with consumers' self-reported ratings of 16 consumption-related emotions and four evaluation constructs: trust, commitment, recommendation, and sentiment. LX consistently outperforms leading models, including GPT-4 Turbo, RoBERTa, and DeepSeek, achieving 81% macro-F1 accuracy on open-ended survey responses and greater than 95% accuracy on third-party-annotated Amazon and Yelp reviews. An application of LX to online retail data, using seemingly unrelated regression, affirms that review-expressed emotions predict product ratings, which in turn predict purchase behavior. Most emotional effects are mediated by product ratings, though some emotions, such as discontent and peacefulness, influence purchase directly, indicating that emotional tone provides meaningful signals beyond star ratings. To support its use, a no-code, cost-free, LX web application is available, enabling scalable analyses of consumer-authored text. In establishing a new methodological foundation for consumer perception measurement, this research demonstrates new methods for leveraging large language models to advance marketing research and practice, thereby achieving validated detection of marketing constructs from consumer data. △ Less

Submitted 16 February, 2026; originally announced February 2026.

arXiv:2512.13806 [pdf, ps, other]

EEG-D3: A Solution to the Hidden Overfitting Problem of Deep Learning Models

Authors: Siegfried Ludwig, Stylianos Bakas, Konstantinos Barmpas, Georgios Zoumpourlis, Dimitrios A. Adamos, Nikolaos Laskaris, Yannis Panagakis, Stefanos Zafeiriou

Abstract: Deep learning for decoding EEG signals has gained traction, with many claims to state-of-the-art accuracy. However, despite the convincing benchmark performance, successful translation to real applications is limited. The frequent disconnect between performance on controlled BCI benchmarks and its lack of generalisation to practical settings indicates hidden overfitting problems. We introduce Dise… ▽ More Deep learning for decoding EEG signals has gained traction, with many claims to state-of-the-art accuracy. However, despite the convincing benchmark performance, successful translation to real applications is limited. The frequent disconnect between performance on controlled BCI benchmarks and its lack of generalisation to practical settings indicates hidden overfitting problems. We introduce Disentangled Decoding Decomposition (D3), a weakly supervised method for training deep learning models across EEG datasets. By predicting the place in the respective trial sequence from which the input window was sampled, EEG-D3 separates latent components of brain activity, akin to non-linear ICA. We utilise a novel model architecture with fully independent sub-networks for strict interpretability. We outline a feature interpretation paradigm to contrast the component activation profiles on different datasets and inspect the associated temporal and spatial filters. The proposed method reliably separates latent components of brain activity on motor imagery data. Training downstream classifiers on an appropriate subset of these components prevents hidden overfitting caused by task-correlated artefacts, which severely affects end-to-end classifiers. We further exploit the linearly separable latent space for effective few-shot learning on sleep stage classification. The ability to distinguish genuine components of brain activity from spurious features results in models that avoid the hidden overfitting problem and generalise well to real-world applications, while requiring only minimal labelled data. With interest to the neuroscience community, the proposed method gives researchers a tool to separate individual brain processes and potentially even uncover heretofore unknown dynamics. △ Less

Submitted 15 December, 2025; originally announced December 2025.

MSC Class: 68T07 ACM Class: I.2.6

arXiv:2502.14442 [pdf, other]

Stochastic Resonance Improves the Detection of Low Contrast Images in Deep Learning Models

Authors: Siegfried Ludwig

Abstract: Stochastic resonance describes the utility of noise in improving the detectability of weak signals in certain types of systems. It has been observed widely in natural and engineered settings, but its utility in image classification with rate-based neural networks has not been studied extensively. In this analysis a simple LSTM recurrent neural network is trained for digit recognition and classific… ▽ More Stochastic resonance describes the utility of noise in improving the detectability of weak signals in certain types of systems. It has been observed widely in natural and engineered settings, but its utility in image classification with rate-based neural networks has not been studied extensively. In this analysis a simple LSTM recurrent neural network is trained for digit recognition and classification. During the test phase, image contrast is reduced to a point where the model fails to recognize the presence of a stimulus. Controlled noise is added to partially recover classification performance. The results indicate the presence of stochastic resonance in rate-based recurrent neural networks. △ Less

Submitted 20 February, 2025; originally announced February 2025.

Comments: MSc Course Project

arXiv:2311.17968 [pdf, other]

Latent Alignment with Deep Set EEG Decoders

Authors: Stylianos Bakas, Siegfried Ludwig, Dimitrios A. Adamos, Nikolaos Laskaris, Yannis Panagakis, Stefanos Zafeiriou

Abstract: The variability in EEG signals between different individuals poses a significant challenge when implementing brain-computer interfaces (BCI). Commonly proposed solutions to this problem include deep learning models, due to their increased capacity and generalization, as well as explicit domain adaptation techniques. Here, we introduce the Latent Alignment method that won the Benchmarks for EEG Tra… ▽ More The variability in EEG signals between different individuals poses a significant challenge when implementing brain-computer interfaces (BCI). Commonly proposed solutions to this problem include deep learning models, due to their increased capacity and generalization, as well as explicit domain adaptation techniques. Here, we introduce the Latent Alignment method that won the Benchmarks for EEG Transfer Learning (BEETL) competition and present its formulation as a deep set applied on the set of trials from a given subject. Its performance is compared to recent statistical domain adaptation techniques under various conditions. The experimental paradigms include motor imagery (MI), oddball event-related potentials (ERP) and sleep stage classification, where different well-established deep learning models are applied on each task. Our experimental results show that performing statistical distribution alignment at later stages in a deep learning model is beneficial to the classification accuracy, yielding the highest performance for our proposed method. We further investigate practical considerations that arise in the context of using deep learning and statistical alignment for EEG decoding. In this regard, we study class-discriminative artifacts that can spuriously improve results for deep learning models, as well as the impact of class-imbalance on alignment. We delineate a trade-off relationship between increased classification accuracy when alignment is performed at later modeling stages, and susceptibility to class-imbalance in the set of trials that the statistics are computed on. △ Less

Submitted 29 November, 2023; originally announced November 2023.

ACM Class: I.2.6

arXiv:2204.00581 [pdf, other]

Reference Network and Localization Architecture for Smart Manufacturing based on 5G

Authors: Stephan Ludwig, Doris Aschenbrenner, Marvin Scharle, Henrik Klessig, Michael Karrenbauer, Huanzhuo Wu, Maroua Taghouti, Pedro Lozano, Hans D. Schotten, Frank H. P. Fitzek

Abstract: 5G promises to shift Industry 4.0 to the next level by allowing flexible production. However, many communication standards are used throughout a production site, which will stay so in the foreseeable future. Furthermore, localization of assets will be equally valuable in order to get to a higher level of automation. This paper proposes a reference architecture for a convergent localization and com… ▽ More 5G promises to shift Industry 4.0 to the next level by allowing flexible production. However, many communication standards are used throughout a production site, which will stay so in the foreseeable future. Furthermore, localization of assets will be equally valuable in order to get to a higher level of automation. This paper proposes a reference architecture for a convergent localization and communication network for smart manufacturing that combines 5G with other existing technologies and focuses on high-mix low-volume application, in particular at small and medium-sized enterprises. The architecture is derived from a set of functional requirements, and we describe different views on this architecture to show how the requirements can be fulfilled. It connects private and public mobile networks with local networking technologies to achieve a flexible setup addressing many industrial use cases. △ Less

Submitted 4 April, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

Comments: 10 pages; submitted to 6th International Conference on System-Integrated Intelligence. Intelligent, flexible and connected systems in products and production, 7-9 September Genova, Italy

arXiv:2202.12950 [pdf, other]

2021 BEETL Competition: Advancing Transfer Learning for Subject Independence & Heterogenous EEG Data Sets

Authors: Xiaoxi Wei, A. Aldo Faisal, Moritz Grosse-Wentrup, Alexandre Gramfort, Sylvain Chevallier, Vinay Jayaram, Camille Jeunet, Stylianos Bakas, Siegfried Ludwig, Konstantinos Barmpas, Mehdi Bahri, Yannis Panagakis, Nikolaos Laskaris, Dimitrios A. Adamos, Stefanos Zafeiriou, William C. Duong, Stephen M. Gordon, Vernon J. Lawhern, Maciej Śliwowski, Vincent Rouanne, Piotr Tempczyk

Abstract: Transfer learning and meta-learning offer some of the most promising avenues to unlock the scalability of healthcare and consumer technologies driven by biosignal data. This is because current methods cannot generalise well across human subjects' data and handle learning from different heterogeneously collected data sets, thus limiting the scale of training data. On the other side, developments in… ▽ More Transfer learning and meta-learning offer some of the most promising avenues to unlock the scalability of healthcare and consumer technologies driven by biosignal data. This is because current methods cannot generalise well across human subjects' data and handle learning from different heterogeneously collected data sets, thus limiting the scale of training data. On the other side, developments in transfer learning would benefit significantly from a real-world benchmark with immediate practical application. Therefore, we pick electroencephalography (EEG) as an exemplar for what makes biosignal machine learning hard. We design two transfer learning challenges around diagnostics and Brain-Computer-Interfacing (BCI), that have to be solved in the face of low signal-to-noise ratios, major variability among subjects, differences in the data recording sessions and techniques, and even between the specific BCI tasks recorded in the dataset. Task 1 is centred on the field of medical diagnostics, addressing automatic sleep stage annotation across subjects. Task 2 is centred on Brain-Computer Interfacing (BCI), addressing motor imagery decoding across both subjects and data sets. The BEETL competition with its over 30 competing teams and its 3 winning entries brought attention to the potential of deep transfer learning and combinations of set theory and conventional machine learning techniques to overcome the challenges. The results set a new state-of-the-art for the real-world BEETL benchmark. △ Less

Submitted 14 February, 2022; originally announced February 2022.

Comments: PrePrint of the NeurIPS2021 BEETL Competition Submitted to Proceedings of Machine Learning Research (PMLR)

arXiv:2202.03267 [pdf, other]

Team Cogitat at NeurIPS 2021: Benchmarks for EEG Transfer Learning Competition

Authors: Stylianos Bakas, Siegfried Ludwig, Konstantinos Barmpas, Mehdi Bahri, Yannis Panagakis, Nikolaos Laskaris, Dimitrios A. Adamos, Stefanos Zafeiriou

Abstract: Building subject-independent deep learning models for EEG decoding faces the challenge of strong covariate-shift across different datasets, subjects and recording sessions. Our approach to address this difficulty is to explicitly align feature distributions at various layers of the deep learning model, using both simple statistical techniques as well as trainable methods with more representational… ▽ More Building subject-independent deep learning models for EEG decoding faces the challenge of strong covariate-shift across different datasets, subjects and recording sessions. Our approach to address this difficulty is to explicitly align feature distributions at various layers of the deep learning model, using both simple statistical techniques as well as trainable methods with more representational capacity. This follows in a similar vein as covariance-based alignment methods, often used in a Riemannian manifold context. The methodology proposed herein won first place in the 2021 Benchmarks in EEG Transfer Learning (BEETL) competition, hosted at the NeurIPS conference. The first task of the competition consisted of sleep stage classification, which required the transfer of models trained on younger subjects to perform inference on multiple subjects of older age groups without personalized calibration data, requiring subject-independent models. The second task required to transfer models trained on the subjects of one or more source motor imagery datasets to perform inference on two target datasets, providing a small set of personalized calibration data for multiple test subjects. △ Less

Submitted 1 February, 2022; originally announced February 2022.

ACM Class: I.2.6

arXiv:2110.10009 [pdf, other]

EEGminer: Discovering Interpretable Features of Brain Activity with Learnable Filters

Authors: Siegfried Ludwig, Stylianos Bakas, Dimitrios A. Adamos, Nikolaos Laskaris, Yannis Panagakis, Stefanos Zafeiriou

Abstract: Patterns of brain activity are associated with different brain processes and can be used to identify different brain states and make behavioral predictions. However, the relevant features are not readily apparent and accessible. To mine informative latent representations from multichannel recordings of ongoing EEG activity, we propose a novel differentiable decoding pipeline consisting of learnabl… ▽ More Patterns of brain activity are associated with different brain processes and can be used to identify different brain states and make behavioral predictions. However, the relevant features are not readily apparent and accessible. To mine informative latent representations from multichannel recordings of ongoing EEG activity, we propose a novel differentiable decoding pipeline consisting of learnable filters and a pre-determined feature extraction module. Specifically, we introduce filters parameterized by generalized Gaussian functions that offer a smooth derivative for stable end-to-end model training and allow for learning interpretable features. For the feature module, we use signal magnitude and functional connectivity estimates. We demonstrate the utility of our model towards emotion recognition from EEG signals on the SEED dataset, as well as on a new EEG dataset of unprecedented size (i.e., 761 subjects), where we identify consistent trends of music perception and related individual differences. The discovered features align with previous neuroscience studies and offer new insights, such as marked differences in the functional connectivity profile between left and right temporal areas during music listening. This agrees with the respective specialisation of the temporal lobes regarding music perception proposed in the literature. △ Less

Submitted 2 February, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

Comments: 14 pages, 8 figures

ACM Class: I.2.6

arXiv:2110.06874 [pdf]

doi 10.3390/psych3040056

Automated Essay Scoring Using Transformer Models

Authors: Sabrina Ludwig, Christian Mayer, Christopher Hansen, Kerstin Eilers, Steffen Brandt

Abstract: Automated essay scoring (AES) is gaining increasing attention in the education sector as it significantly reduces the burden of manual scoring and allows ad hoc feedback for learners. Natural language processing based on machine learning has been shown to be particularly suitable for text classification and AES. While many machine-learning approaches for AES still rely on a bag-of-words (BOW) appr… ▽ More Automated essay scoring (AES) is gaining increasing attention in the education sector as it significantly reduces the burden of manual scoring and allows ad hoc feedback for learners. Natural language processing based on machine learning has been shown to be particularly suitable for text classification and AES. While many machine-learning approaches for AES still rely on a bag-of-words (BOW) approach, we consider a transformer-based approach in this paper, compare its performance to a logistic regression model based on the BOW approach and discuss their differences. The analysis is based on 2,088 email responses to a problem-solving task, that were manually labeled in terms of politeness. Both transformer models considered in that analysis outperformed without any hyper-parameter tuning the regression-based model. We argue that for AES tasks such as politeness classification, the transformer-based approach has significant advantages, while a BOW approach suffers from not taking word order into account and reducing the words to their stem. Further, we show how such models can help increase the accuracy of human raters, and we provide a detailed instruction on how to implement transformer-based models for one's own purpose. △ Less

Submitted 13 October, 2021; originally announced October 2021.

Comments: 18 pages, 1 figure, 5 tables; for the associated source code, see https://github.com/LucaOffice/Publications/tree/main/Automatic_Essay_Scoring_Using_Transformer_Models

arXiv:1809.09396 [pdf, other]

A 5G Architecture for The Factory of the Future

Authors: Stephan Ludwig, Michael Karrenbauer, Amina Fellan, Hans D. Schotten, Henning Buhr, Savita Seetaraman, Norbert Niebert, Anne Bernardy, Vasco Seelmann, Volker Stich, Andreas Hoell, Christian Stimming, Huanzhuo Wu, Simon Wunderlich, Maroua Taghouti, Frank Fitzek, Christoph Pallasch, Nicolai Hoffmann, Werner Herfs, Elena Eberhardt, Thomas Schildknecht

Abstract: Factory automation and production are currently undergoing massive changes, and 5G is considered being a key enabler. In this paper, we state uses cases for using 5G in the factory of the future, which are motivated by actual needs of the industry partners of the "5Gang" consortium. Based on these use cases and the ones by 3GPP, a 5G system architecture for the factory of the future is proposed. I… ▽ More Factory automation and production are currently undergoing massive changes, and 5G is considered being a key enabler. In this paper, we state uses cases for using 5G in the factory of the future, which are motivated by actual needs of the industry partners of the "5Gang" consortium. Based on these use cases and the ones by 3GPP, a 5G system architecture for the factory of the future is proposed. It is set in relation to existing architectural frameworks. △ Less

Submitted 25 September, 2018; originally announced September 2018.

Comments: 8 pages, 7 figures Accepted for publication at 2018 IEEE 23rd International Conference on Emerging Technologies and Factory Automation (ETFA), Torino, Italy

arXiv:1804.04531 [pdf, other]

Towards a Flexible Architecture for Industrial Networking

Authors: Michael Karrenbauer, Amina Fellan, Hans D. Schotten, Henning Buhr, Savita Seetaraman, Norbert Niebert, Stephan Ludwig, Anne Bernardy, Vasco Seelmann, Volker Stich, Andreas Hoell, Christian Stimming, Huanzhuo Wu, Simon Wunderlich, Maroua Taghouti, Frank Fitzek, Christoph Pallasch, Nicolai Hoffmann, Werner Herfs, Elena Eberhardt, Thomas Schildknecht

Abstract: The digitalization of manufacturing processes is expected to lead to a growing interconnection of production sites, as well as machines, tools and work pieces. In the course of this development, new use-cases arise which have challenging requirements from a communication technology point of view. In this paper we propose a communication network architecture for Industry 4.0 applications, which com… ▽ More The digitalization of manufacturing processes is expected to lead to a growing interconnection of production sites, as well as machines, tools and work pieces. In the course of this development, new use-cases arise which have challenging requirements from a communication technology point of view. In this paper we propose a communication network architecture for Industry 4.0 applications, which combines new 5G and non-cellular wireless network technologies with existing (wired) fieldbus technologies on the shop floor. This architecture includes the possibility to use private and public mobile networks together with local networking technologies to achieve a flexible setup that addresses many different industrial use cases. It is embedded into the Industrial Internet Reference Architecture and the RAMI4.0 reference architecture. The paper shows how the advancements introduced around the new 5G mobile technology can fulfill a wide range of industry requirements and thus enable new Industry 4.0 applications. Since 5G standardization is still ongoing, the proposed architecture is in a first step mainly focusing on new advanced features in the core network, but will be developed further later. △ Less

Submitted 18 April, 2018; v1 submitted 12 April, 2018; originally announced April 2018.

Comments: 8 pages, 4 figures, to appear in Proceedings of 23th VDE/ITG Conference on Mobile Communication (23. VDE/ITG Fachtagung Mobilkommunikation), Osnabrück, May 2018

Showing 1–12 of 12 results for author: Ludwig, S