Skip to main content

Research Papers

WiGenAI: The Symphony of Wireless and Generative AI via Diffusion Models

AI transforms wireless communication systems.

10-11-2023

Large Language Models for Propaganda Detection

Unmasking Deceptive Propaganda Online

10-10-2023

QualiGPT: GPT as an easy-to-use tool for qualitative coding

Introducing QualiGPT: Transforming Qualitative Analysis

10-10-2023

CODING BY DESIGN: GPT-4 EMPOWERS AGILE MODEL DRIVEN DEVELOPMENT

Enhancing Code Generation with Agility

10-06-2023

BENCHMARKING LARGE LANGUAGE MODELS AS AI RESEARCH AGENTS

AI research agents face ML challenges.

10-05-2023

REFORMULATING DOMAIN ADAPTATION OF LARGE LANGUAGE MODELS AS ADAPT-RETRIEVE-REVISE

Adapt, Generate, Retrieve, Revise, Answer.

10-05-2023

MathCoder:  Seamless  Code  Integration  in LLMs for Enhanced Mathematical Reasoning

MathCoder outperforms GPT-4 in math.

10-05-2023

Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation

Enhancing programming with AI hints.

10-05-2023

CONTRASTIVE POST-TRAINING LARGE LANGUAGEMODELS ON DATA CURRICULUM

Alignment techniques improve LLM model.

10-03-2023

DALL·E 3 System Card

ChatGPT enhances DALL·E 3 creativity.

10-03-2023

GPT-4V(ision) System Card

GPT-4V: Vision meets language expertise.

09-25-2023

The Moral Machine Experiment on Large Language Models

LLMs’ ethical decisions: similarities, stark differences.

09-12-2023

Strategic Behavior of Large Language Models: Game Structure vs. Contextual Framing

LLMs’ strategic choices vary; context matters.

09-12-2023

On the Planning, Search, and Memorization Capabilities of Large Language Models

Exploring GPT-4’s strengths, limitations in planning.

09-05-2023

Data-Juicer: A One-Stop Data Processing System for Large Language Models

Data-Juicer: Revolutionizing LLM Data Processing.

09-05-2023

Large Language Models for Semantic Monitoring of Corporate Disclosures: A Case Study on Korea’s Top 50 KOSPI Companies

AI analyzes Korean disclosures sentiment.

09-1-2023

TouchStone: Evaluating Vision-Language Models by Language Models

Evaluating LVLMs: TouchStone, Comprehensive Visual Dialogue.

09-01-2023

Linking microblogging sentiments to stock price movement: An application of GPT-4

Advanced sentiment predicts stocks; GPT-4 shines.

09-01-2023

Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models

SparklesChat: advanced multimodal dialogue across images.

08-31-2023

PointLLM: Empowering Large Language Models to Understand Point Clouds

PointLLM: Bridging LLMs with 3D Understanding.

08-31-2023

Wizard Math: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Enhancing LLM Math Abilities with WizardMath

08-18-2023

Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment

LLMs’ Power Raises Concerns: RED-EVAL

08-18-2023

MaScQA: A Question Answering Dataset for Investigating Materials Science Knowledge of Large Language Models

LLMs Evaluated on Challenging Materials Questions

08-17-2023

Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes

Chat-3D: Universal Dialogue for 3D Scenes

08-17-2023

Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems

GPT-4 and Plug-ins: Problem-Solving Insights

08-16-2023

Large Language Models for Information Retrieval: A Survey

IR Systems Meet LLM Evolution

08-15-2023

Large Language Models in Introductory Programming Education: ChatGPT’s Performance and Implications for Assessments

LLMs Excelling in Programming Education

08-15-2023

GPT-4 IS TOO SMART TO BE SAFE: STEALTHY CHATWITH LLMS VIA CIPHER

CipherChat: LLM Safety Testing Unveiled.

08-12-2023

Confidence-Building Measures for Artificial Intelligence: Workshop Proceedings

Foundation Model Security: Mitigation Strategies

08-03-2023

Validation of a Zero-Shot Learning Natural Language Processing Tool for Data Abstraction from Unstructured Healthcare Data

Efficient NLP Tool Enables Unstructured Data Abstraction

08-02-2023

COMPARATIVE ANALYSIS OF DRUG-GPT™ AND CHATGPT LLMS FOR HEALTHCARE INSIGHTS: EVALUATING ACCURACY AND RELEVANCE IN PATIENT AND HCP CONTEXTS

Comparing GPT Models for Healthcare

08-01-2023

FRONTIER AI REGULATION: MANAGING EMERGING RISKS TO PUBLIC SAFETY

Frontier AI: Balancing Safety & Innovation

07-11-2023

BrickPal: Augmented Reality-based Assembly Instructions for Brick Models

BrickPal: AR Instructions Revolutionize Building.

07-06-2023

Focused Transformer: Contrastive Training for Context Scaling

Focused Transformer unlocks longer contexts.

07-06-2023

Lost in the Middle: How Language Models Use Long Contexts

Language models struggle with long-context.

07-06-2023

Let’s Verify Step by Step

Improving Language Models: Process vs. Outcome

05-31-2023

Inverse scaling can become U-shaped

Larger models improve performance, unlock abilities.

05-24-2023

Least-to-Most   Prompting   Enables   Complex Reasoning in Large Language Models

Least-to-most prompting enables remarkable problem-solving.

04-16-2023

ChatGPT: Applications, Opportunities, and Threats

ChatGPT: Language Generation at its Finest.

04-14-2023

ChatGPT: Applications, Opportunities, and Threats

Autonomous AI generates natural conversations.

04-14-2023

Application and realization of key technologies in China railway e-ticketing system

From paper chaos to digital ease.

04-05-2023

Critical Comparison of Li-Ion Aging Models for Second Life Battery Applications

Reviving Battery Life: Modeling Evaluation

03-26-2023

Influence of Employees’ Intention to Adopt AI Applications and Big Data Analytical Capability on Operational Performance in the High‑Tech Firms

High-tech firms embrace AI revolution.

03-25-2023

Impact of artificial intelligence on marketing

AI in Marketing: Revolutionizing Strategies

03-25-2023

The Contributions of Information and Communications Technology on the Sustainable Development of Artificial Intelligence in the Medical Field

AI in Medicine: Advancements and Challenges

03-24-2023

ChatGPT and a New Academic Reality: AI-Written Research Papers and the Ethics of the Large Language Models in Scholarly Publishing

Revolutionizing research through text-based conversations.

03-21-2023

Role of AI in Business Management

AI transforming business: Efficiency, innovation

03-20-2023

GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models

Revolutionizing the Workforce: LLMs Unleashed

03-17-2023

GPT-4 Technical Report

Experience the future of AI

03-14-2023

LARGER LANGUAGE MODELS DO IN-CONTEXT LEARNING DIFFERENTLY

Semantic priors and input–label mappings shape in-context learning.

03-08-2023

Self-Consistency Improves Chain of Thought Reasoning in Language Models

Self-consistency revolutionizes chain-of-thought.

03-07-2023

Emergent Abilities of Large Language Models

Unpredictable emergence expands language models.

03-01-2023

UL2: Unifying Language Learning Paradigms

Unified framework achieves universal effectiveness.

02-28-2023

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

Flan 2022 models enhance instruction tuning.

02-14-2023

FINETUNED LANGUAGE MODELS ARE ZERO-SHOT LEARNERS

Improving zero-shot learning in language models.

02-14-2023

Utilization of information technology as a business means for the Sugihmas Village Community

Digital Transformation Empowers Sugihmas Village

01-31-2023

Large Language Models are Zero-Shot Reasoners

LLMs excel as zero-shot reasoners.

01-29-2023

Implementation of Weighted Product and SMART Methods in Determining Strategic Business Locations for SME Entrepreneurs

Location Decision Support System: A Strategic Solution.

01-29-2023

Comparative Analysis of ETL Tools in Big Data Analytics

ETL Tools: Selecting Your Solution

01-24-2023

A Survey on Optimization Techniques for Edge Artificial Intelligence (AI)

Optimizing AI Models: Unleashing Efficiency

01-22-2023

A Review of the Trends and Challenges in Adopting Natural Language Processing Methods for Education Feedback Analysis

AI Revolutionizing Education: Unleashing NLP

01-20-2023

Satisfaction of Cooperative Services in The Digital Era

Cooperative Members thrive through tech.

01-17-2023

Generative Language Models and Automated Influence Operations: Emerging Threats and Potential Mitigations

Collaboration fuels innovation: Cybersecurity insights

01-11-2023

Online Presence Application for Employees of PT. Bringin Karya Sejahtera with the Location-Based Service Method Using Android Studio and MySQL

ATM Attendance App: Improving Absence Tracking

01-04-2023

Large Language Models Encode Clinical Knowledge

Evaluating LLMs for medical applications.

12-26-2022

Point·E: A System for Generating 3D Point Clouds from Complex Prompts

Revolutionize 3D Object Generation: Lightning-Fast Results

12-16-2022

Scaling Instruction-Finetuned Language Models

Instruction finetuning significantly enhances language models.

12-06-2022

Language Models (Mostly) Know What They Know

Language models predict own validity.

11-21-2022

Ask Me Anything: A simple strategy for prompting language models

Aggregate imperfect prompts for efficient prompting.

11-20-2022

Transcending Scaling Laws with 0.1% Extra Compute

UL2R improves language model scaling.

11-16-2022

Transformer Memory as a Differentiable Search Index

DSI transforms retrieval, simplifies process.

10-21-2022

Challenging BIG-Bench tasks and whether chain-of-thought can solve them

CoT prompting enhances language models.

10-17-2022

Mind’s Eye: Grounded Language Model Reasoning through Simulation

Least-to-most prompting enables remarkable problem-solving.

10-11-2022

Can language models learn from explanations in context?

Explanations enhance large LM’s performance.

10-10-2022

Language Models are Multilingual Chain-of-Thought Reasoners

Multilingual models excel in reasoning.

10-06-2022

PaLM: Scaling Language Modeling with Pathways

PaLM’s breakthrough performance revolutionizes language understanding.

10-05-2022

Robust Speech Recognition via Large-Scale Weak Supervision

Models learn speech from internet.

09-21-2022

Efficient Training of Language Models to Fill in the Middle

Revolutionize text infilling with FIM

07-28-2022

A Hazard Analysis Framework for Code Synthesis Large Language Models

Codex: Revolutionizing Code Generation Safely

07-25-2022

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Watch and Learn: Unleashing Possibilities

06-23-2022

Evolution through Large Models

Revolutionize genetic programming with ELM

06-20-2022

Self-critiquing models for assisting human evaluators

Discover flaws, improve summaries, revolutionize feedback

06-13-2022

BEYOND  THE  IMITATION  GAME:  QUANTIFYIN G  AND  EXTRAPOLATING  THE  CAPABILITIES OF  LANGUAGE  MODELS

Introducing BIG-bench: Beyond current language models.

06-10-2022

Teaching models to express their uncertainty in words

Confidently uncertain: GPT-3’s breakthrough

05-28-2022

Hierarchical Text-Conditional Image Generation with CLIP Latents

Revolutionize image generation with CLIP

04-13-2022

A Recipe for Arbitrary Text Style Transfer with Large Language Models

Zero-shot style transfer with augmented prompting.

03-31-2022

A Research Agenda for Assessing the Economic Impacts of Code Generation Models

Unleashing economic potential with Codex

03-03-2022

Formal Mathematics Statement Curriculum Learning

Master math with expert iteration

02-02-2022

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Transformers enhance language understanding, performance.

01-21-2022

Mapping Language Models to Grounded Conceptual Spaces

LMs grasp language structure, lack grounding.

01-29-2022

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Chain-of-thought prompting enhances reasoning.

01-21-2022

Show Your Work: Scratchpads for Intermediate Computation with Language Models

Transformers excel at multi-step computations.

01-29-2022

Finetuned Language Models Are Zero-Shot Learners

FLAN: Revolutionizing language models.

01-29-2022

Training language models to follow instructions with human feedback

Aligning language models with you

01-27-2022

Text and Code Embeddings by Contrastive Pre-Training

Revolutionary text embeddings transform search

01-24-2022

WebGPT: Browser-assisted question-answering with human feedback

Revolutionary AI outperforms human knowledge

12-16-2021

TruthfulQA: Measuring How Models Mimic Human Falsehoods

Truth or Deception: Testing Language Models

09-08-2021

Evaluating Large Language Models Trained on Code

Revolutionary Codex transforms code creation

07-07-2021

Process for Adapting Language Models to Society (PALMS) with Values-Targeted Datasets

Transforming language models for society

06-10-2021

Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models

Uncovering the Secrets of GPT-3

02-04-2021

Learning Transferable Visual Models From Natural Language Supervision

Revolutionary AI system learns visually

01-05-2021

Generative Language Modeling for Automated Theorem Proving

Transformers crack theorems: groundbreaking breakthrough

09-07-2020

Measuring Massive Multitask Language Understanding

Multitask test exposes models’ limitations.

01-12-2021

Training Compute-Optimal Large Language Models

Compute-optimal Chinchilla outperforms large language models.

03-29-2022

Language Models are Few-Shot Learners

Scaling GPT-3 enhances few-shot performance.

07-22-2020

Learning to summarize from human feedback

Optimize for human preference, win

09-04-2020

Generative Pretraining from Pixels

Revolutionary model learns powerful images.

06-17-2020

Language Models are Few-Shot Learners

Revolutionary GPT-3 NLP breakthrough

05-28-2020

Measuring the Algorithmic Efficiency of Neural Networks

AI advancements double every 16 months

05-05-2020

Jukebox: A Generative Model for Music

Unleashing Jukebox: Generating High-Fidelity Music

04-30-2020

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims∗

Verifying Responsible AI: Building Trust

04-16-2020

Scaling Laws for Neural Language Models

Scaling up models: A game-changer

01-23-2020

Dota 2 with Large Scale Deep Reinforcement Learning

AI conquers esports champion title

12-13-2019

Deep Double Descent: Where Bigger Models and More Data Hurt

Deep Learning’s Double Descent Revolutionized!

12-05-2019

Leveraging Procedural Generation to Benchmark Reinforcement Learning

Revolutionizing Reinforcement Learning: Procgen Benchmark

12-03-2019

Benchmarking Safe Exploration in Deep Reinforcement Learning

Robotic agents learn, stay safe!

11-21-2019

Solving Rubik’s Cube With A Robot Robot Hand.

Revolutionary Rubik’s cube solution via ADR

10-17-2019

Leveraging Procedural Generation to Benchmark Reinforcement Learning

Revolutionizing Reinforcement Learning: Procgen Benchmark

12-03-2019

Benchmarking Safe Exploration in Deep Reinforcement Learning

Robotic agents learn, stay safe!

11-21-2019

Solving Rubik’s Cube With A Robot Robot Hand.

Revolutionary Rubik’s cube solution via ADR

10-17-2019

Fine-Tuning Language Models from Human Preferences

Reward learning makes RL practical.

09-19-2019

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims∗

AI ethics principles lack enforcement

04-16-2020

EMERGENT TOOL USE FROM MULTI-AGENT AUTOCURRICULA

Agents master hide-and-seek, innovate strategy.

09-17-2019

Testing Robustness Against Unforeseen Adversaries

Broaden defense testing, create ImageNet-UA.

Transfer of Adversarial Robustness Between Perturbation Types

Robustness needs diverse perturbation evaluation.

Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents

MMORPG-inspired simulation reveals evolutionary competition.

An Empirical Model of Large-Batch Training

Gradient noise predicts optimal batch size.

Concept Learning with Energy-Based Models

Energy framework unlocks concept learning.

Supervising strong learners by amplifying weak experts

Iterated Amplification: Better Learning Approach.

Learning Dexterous In-Hand Manipulation

RL makes robot hands dexterous.

Learning Policy Representations in Multiagent Systems

Learning agents’ behavior in multi-agent systems.

AI safety via debate

AI learns human goals through debate.

Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines

Improved deep reinforcement learning algorithm.

Some Considerations on Learning to Explore via Meta-Reinforcement Learning

New algorithms improve meta-learning performance.

DeepType: Multilingual Entity Linking by Neural Type System Evolution

DeepType integrates symbolic info for AI.

Research Papers

08-22-2019

Release Strategies and the Social Impacts of Language Models

Staged release mitigates misuse risk.

08-20-2019

The Role of Cooperation in Responsible AI Development

Cooperation crucial for safe AI

07-10-2019

05-03-2019

Generating Long Sequences with Sparse Transformers

Transformers simplified, sequence mastered.

04-23-2019

Implicit Generation and Modeling with Energy-Based Models

Scaling EBMs: Generality, Simplicity, Success.

03-21-2019

03-04-2019

Language Models are Unsupervised Multitask Learners

Language models learn tasks naturally.

02-14-2019

Computational Limitations in Robust Classification and Win-Win Results∗

Robust classifiers: tradeoffs, examples, cryptography.

02-04-2019

03-04-2019

Language Models are Unsupervised Multitask Learners

Language models learn tasks naturally.

12-14-2018

Quantifying Generalization in Reinforcement Learning

Overfitting in RL investigated, solutions found.

12-06-2018

11-07-2018

Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control

Act, learn offline: control complex tasks.

11-05-2018

EXPLORATION BY RANDOM NETWORK DISTILLATION

RND bonus transforms Atari exploration

10-31-2018

10-22-2018

FFJORD: FREE-FORM CONTINUOUS DYNAMICS FOR SCALABLE REVERSIBLE GENERATIVE MODELS

Transforming Simple to Complex Distributions.

10-02-2018

Large-Scale Study of Curiosity-Driven Learning

Curiosity rewards drive successful learning.

08-13-2018

07-30-2018

Variational Option Discovery Algorithms

VALOR discovers options with autoencoders.

07-26-2018

Glow: Generative Flow with Invertible 1×1 Convolutions

Glow: Tractable, Parallelizable, Realistic Synthesis

07-09-2018

06-17-2018

Improving Language Understanding by Generative Pre-Training

Language model improves NLU benchmarks.

06-11-2018

GamePad: A Learning Environment for Theorem Proving

GamePad applies machine learning to Coq theorem proving

06-02-2018

05-03-2018

Evolved Policy Gradients

Metalearning approach for gradient-based RL.

04-18-2018

Gotta Learn Fast: A New Benchmark for Generalization in RL

RL benchmark with Sonic franchise.

04-10-2018

03-20-2018

Improving GANs Using Optimal Transport

OT-GAN uses advanced distance metric.

03-15-2018

On First-Order Meta-Learning Algorithms

Meta-learning algorithms for fast adaptation.

03-08-2018

03-03-2018

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

New challenging robotic tasks; Improved RL.

02-26-2018

Interpretable and Pedagogical Examples

Iterative teaching produces interpretable examples.

02-15-2018

02-07-2018

GPU Kernels for Block-Sparse Weights

Optimized GPU kernels accelerate sparse NNs, advance AI models

12-06-2017

LEARNING SPARSE NEURAL NETWORKS THROUGH L0 REGULARIZATION

Neural net pruning for faster training.

12-04-2017