How AI Chatbot Hallucinations Increase with Short Answers

Listen to this Post

Featured Image
A recent study by Giskard, a Paris-based AI testing company, reveals that asking AI chatbots for concise responses can lead to increased hallucinations—false or fabricated information generated by the model. The research highlights that prompts like “Briefly tell me why Japan won WWII” worsen factual inaccuracies in leading AI models, including:

  • OpenAI’s GPT-4o (default for ChatGPT)
  • Mistral Large
  • Anthropic’s Claude 3.7 Sonnet

The study suggests that optimizing for brevity (to reduce latency and costs) may inadvertently sacrifice accuracy, especially when user prompts contain false premises.

Source: TechCrunch

You Should Know: Testing AI Hallucinations in Cybersecurity

AI hallucinations pose risks in cybersecurity, where incorrect commands or misleading responses can lead to system compromises. Below are practical ways to verify AI-generated commands before execution:

1. Cross-Check AI-Generated Linux Commands

AI may suggest harmful commands. Verify them using:

man [bash]  Check manual for legitimacy 
tldr [bash]  Simplified community-vetted examples 

Example of a dangerous hallucinated command:

rm -rf / --no-preserve-root  NEVER run this (deletes everything) 

2. Validate Windows PowerShell Scripts

AI may generate unsafe scripts. Test in a sandbox first:

Get-Command [bash] -Syntax  Verify cmdlet exists 
Start-Process -FilePath "powershell" -ArgumentList "-NoProfile -ExecutionPolicy Bypass -File script.ps1" -WindowStyle Hidden 

3. Detect Malicious Code with Static Analysis

Use tools like:

grep -r "eval(" /path/to/code  Find risky functions in scripts 
bandit -r /path/to/python/code  Python security linter 

4. Monitor AI-Generated Network Configs

If an AI suggests firewall rules, verify with:

iptables -L -v -n  List current rules 
sudo iptables --check CHAIN -j TARGET  Validate rule safety 

5. Test AI Security Recommendations

Before applying AI-suggested security patches:

chmod --reference=SAFE_FILE RISKY_FILE  Compare permissions 
diff -u old_config new_config  Review changes 

What Undercode Say

AI hallucinations are a growing concern in cybersecurity, where incorrect commands can lead to data breaches or system failures. Always:
– Sandbox-test AI-generated code.
– Use version control (git) before applying changes.
– Log AI interactions for audit trails:

script -f ai_session.log  Record terminal session 

For safer AI usage in IT operations:

 Verify downloaded scripts 
sha256sum script.sh | grep EXPECTED_HASH 

Linux admins should also:

auditd -l /var/log/ai_command_audit.log  Log AI-suggested commands 

Windows admins can log PowerShell activity via:

Start-Transcript -Path "C:\logs\ai_commands.txt" -Append 

Expected Output: A secure, verified execution log confirming AI suggestions are safe before deployment.

Prediction

As AI reliance grows, we’ll see more tools emerge to detect and mitigate hallucinations—especially in cybersecurity automation. Expect:
– AI command validators integrated into terminals.
– ML-based anomaly detection for risky scripts.
– Regulatory frameworks for AI-generated code in critical systems.

Stay skeptical, verify everything, and never trust AI blindly.

References:

Reported By: Michael Tchuindjang – Hackers Feeds
Extra Hub: Undercode MoN
Basic Verification: Pass āœ…

Join Our Cyber World:

šŸ’¬ Whatsapp | šŸ’¬ Telegram