Writing

What I think is the next pocket computing device, after the smartphone

The era of the app is ending. The next leap in personal computing isn't a better phone; it's a new category of device.

2025-07-01

Technology
Generative AI
Product
Future Tech

It’s 7:00 PM on a Tuesday. I’m wrapping up a final work call from my Bengaluru apartment, mind buzzing with product specs and deadlines. The old version of my evening would begin now: a wave of decision fatigue about dinner, chores, and tomorrow’s workout at Cult.Fit.

Tonight, that chaos is gone.

As my call ends, a soft tone plays in my earbud. It’s My Aura. “Roshan,” it says quietly, “your last meeting ran late. Based on your smart fridge’s inventory, the ‘Orchestrator Agent’ has planned a quick paneer stir-fry and pre-filled a BlinkIt cart with the missing ginger-garlic paste. It estimates a 10-minute delivery. Shall I place the order?”

“Confirm,” I say. That entire chain of mental effort has simply vanished.

This is the future the tech world is racing towards. It’s the vision that likely drives Sam Altman and Jony Ive’s new AI hardware venture and why they called the first wave of AI gadgets “very poor products.” They didn’t solve this fundamental problem: the immense, invisible cognitive load of managing modern life.

This is my attempt to think through the device that will. It’s a product vision for a Cognitive Co-Processor, a “context sponge”, which I will call call “The Aura”.

1. The Three Challenges Holding AI Back Today

The software is ready. AI “agents” can now make phone calls, spin up marketing campaigns, and accomplish complex tasks with breathtaking speed. With nearly “unlimited context windows,” the bottleneck is no longer the AI’s memory. The bottleneck is the physical world.

For AI to move from a novelty to a truly integrated part of our lives, we must solve three fundamental challenges that our current hardware, especially the smartphone, is completely unequipped to handle. This is my attempt to think through the device that will solve them.

Challenge 1: The Context Funnel is Broken

Today, feeding real-world context to an AI is a point of immense friction. To get help with a physical object, you have to unlock your phone, open an app, grant camera access, and point it at the object. This multi-step, high-attention workflow is a barrier for all but the most dedicated tech enthusiasts. We need to radically oversimplify the process of context capture.

Challenge 2: The Speed of Light is Too Slow

The second challenge is the speed of processing. Most powerful LLMs are cloud-based. The round-trip time for data to travel to a server and back introduces latency that makes real-time interaction feel clunky and unnatural. This is precisely why early AI devices like the Humane AI Pin and Rabbit R1 failed to deliver on their promise. For an AI to be a seamless part of our reality, it needs to think on-device. We need compact hardware that can house a personalized AI, always on and instantly responsive.

Challenge 3: The Physical World Needs an API

The third challenge is making AI agents useful outside of the digital world. Currently, an agent can operate software that has a “Model-Context Protocol” (MCP)—an interface for AI. But the physical world lacks this. We have “smart devices,” but they are not “agent-friendly.” We need a future where a smart fridge doesn’t just have Wi-Fi, but an MCP that allows an agent to look inside with a camera, remotely change the temperature, so that it can plan to have ice cubes ready before a house party. Solving this will enable a local, personal AI to orchestrate our environment and automate our lives.

This is a product vision for a Cognitive Co-Processor designed to solve these three challenges. A device I call My Aura.

2. Can We Actually Build This Today?

So, is this “Aura” device just a futuristic dream? Or could a dedicated team, like the one Sam Altman and Jony Ive are building at IO, actually engineer this with technology that exists right now?

Let’s do the math. Let’s get nerdy and build a bill of materials for our Cognitive Co-Processor based on the best components available in 2025.

2.1. The On-Device Brain: The Chip

We need a mobile processor (SoC) with a powerful Neural Processing Unit (NPU) designed for sustained AI workloads.

The Component: The Qualcomm Snapdragon 8 Gen 4. Its NPU is the current market leader, capable of roughly 75-80 TOPS (Trillion Operations Per Second) of peak AI performance.
The Verdict: The raw power is there.

2.2. The On-Device Mind: The Model

We need a lightweight, efficient Large Language Model (LLM) to handle real-time reasoning without needing the cloud.

The Component: Google’s Gemma 2B, a 2-billion parameter model. It’s small enough to run locally but powerful enough for summarization, transcription, and function calling.
The Verdict: The software exists and is open-source.

2.3. The Reality Check: The Sustained Workload Calculation

This is where it gets interesting. The chip’s NPU doesn’t just run the LLM. It has to do three demanding jobs simultaneously in its “Active AI” state:

The Perception Tax: Fusing and analyzing real-time data from cameras, microphones, and sensors is computationally expensive. Estimated Load: ~40 TOPS.
The Reasoning Job: Running the Gemma 2B model for on-device thinking. Estimated Load: ~4 TOPS.
The Action Job: Running smaller, specialized models for the active “agent” (like the Social Mirror’s sentiment analysis). Estimated Load: ~10 TOPS. Total Sustained Load: 40 + 4 + 10 = 54 TOPS

This means we’d be running the best mobile chip on the market at ~70% of its peak capacity, continuously. This is a massive but theoretically manageable load. The real challenges it creates are heat and power.

2.4. The Power & Heat Solution

The Challenge: Running the chip at this intensity would generate about 6 Watts of heat. To keep it from overheating and throttling, we need a serious cooling solution. To power it for a demanding workday (e.g., 4 hours of active use), we need a massive battery.
The Specification:
- Cooling: A sophisticated Thermal Stack combining a Graphene heat spreader and a custom-designed Vapor Chamber.
- Battery: A 30 Wh (~8,000 mAh) Silicon-Carbon (Si-C) Anode Battery. This is about 50% more capacity than a flagship smartphone.
The Verdict: The thermal and battery technology exists. The engineering challenge is one of integration and miniaturization.

2.5. The Final Size Estimation

So, what does a device with these high-end components, including a full sensor suite and connectivity, look like?

(Table from Notion — see original for full data.)

Conclusion: It’s Possible.

For context, a standard deck of playing cards is also about 100 cm³.

The math shows that building Aura isn’t a matter of waiting for a magical invention. All the core components exist today. The challenge is one of extreme engineering: integrating a powerful processor, a full suite of sensors, and a massive battery into a compact, pocketable form factor, all while designing a thermal system that can handle the immense heat generated by a constantly working AI brain.

It is an incredibly difficult task, but it is not science fiction. It is a solvable engineering problem. And the team that solves it first will define the next era of personal computing.

Here’s how it “could look”

3. The Great Unbundling of the Smartphone

For fifteen years, the smartphone has been bundling features. It’s our camera, navigator, communicator, and computer. It’s a jack of all trades, but a master of none. The future isn’t a better bundle; it’s a “Great Unbundling.” We need specialized devices that work in harmony.

The Smartphone: For active, high-attention tasks (reading, watching, typing).
The Smartwatch: For glanceable notifications and biometrics.
The Smartglasses: For heads-up, contextual overlays that merge digital information with your direct field of view.
My Aura: For passive, low-friction context capture and environmental orchestration. This new device shouldn’t aim to replace the phone. Its job is to do the one thing the phone can’t: seamlessly bridge the gap between our digital AI and our physical reality. We need it to “do stuff” outside the phone screen and not just “talk”.

(Table from Notion — see original for full data.)

3.1. The Great Integration: Making Your Ecosystem Smarter

The purpose of the Cognitive Co-Processor is not to replace your phone or watch. Its purpose is to be the central, context-aware brain that makes your entire personal ecosystem of devices exponentially more powerful. It’s the hub for a new kind of distributed computing, where each device does what it does best.

The Smartphone: Your Command Center & Visual Canvas Aura supercharges your phone. It passively gathers context throughout your day. When you pick up your phone, that context is instantly available to your apps. The phone becomes the primary screen for viewing the insights Aura has gathered. Instead of being a device for manual data entry, it becomes a powerful dashboard for your life. For example, after a meeting, a full transcript and summary will be waiting in your Notion app, ready for review.
The Headset (Earbuds): Your Private Audio Layer Your earbuds become the most intimate output channel for Aura. This connection enables a new class of real-time, private coaching. The “Social Mirror” agent analyzes a room’s engagement and can whisper in your earbud: “Your pace is a bit fast. Take a breath. You’re losing the attention of the person on the left.”
The Smartwatch: Your Haptic & Biometric Link The connection becomes a two-way street. Aura uses the watch’s haptic engine for silent, contextual alerts (e.g., a specific buzz for “you left your keys”). More importantly, the watch constantly feeds your biometric data (heart rate, HRV) to Aura. This allows the AI to correlate external events with your internal state, providing objective data on what truly causes you stress or excitement.
AR/VR Glasses: Your Visual Augmentation Layer This is the ultimate evolution. Aura does the heavy lifting of identifying objects and people, and the AR glasses become a lightweight display that overlays the AI’s insights onto your field of view. For example, as you look at a new acquaintance, their name, company, and a key phrase from your last conversation appears subtly in your peripheral vision.

4. The Rise of “Agents”, not Apps

In this new ecosystem, you don’t use “apps” on Aura; you enable “agents”—specialized, autonomous processes that run in the background to proactively assist you.

The Scribe Agent: The evolution of note-taking, handling transcription and summarization for meetings, lectures, or doctor’s visits.
The Focus Agent: Protects your attention by automatically silencing non-critical notifications during “Deep Work” blocks in your calendar.
The Orchestrator Agent: Your universal remote for the real world, interacting with “Agent-Ready” devices to order groceries or find your keys.
The Guardian Agent: A safety-focused agent that can detect distress signals and automatically send your location to an emergency contact.

5. A New OS: The Digital Nervous System

Why can’t this just be built on Android? Because an OS designed for apps is the wrong tool for a world of agents. An app is a tool you open; an agent is a process you enable. This requires a fundamentally different architecture.

A smartphone OS is a reactive system designed to manage siloed apps on a screen. The OS for Aura must be a proactive, Real-Time Operating System (RTOS) designed for one primary purpose: to manage a continuous, high-volume flood of real-world data and make it available to a community of agents.

This “Digital Nervous System” has three core design principles that make it different:

Shared Context, Not Siloed Data: In Android, every app has its own data. On Aura’s OS, there is one central, unified stream of context—the “Hippocampus.” All authorized agents tap into this same stream. The “Scribe Agent” and the “Social Mirror” are looking at the exact same meeting data, but using it for different functions. This allows for incredibly powerful, cross-functional insights.
Real-Time, Guaranteed Performance: A smartphone OS is “fair,” but it makes no guarantees about when a task will run. An RTOS provides a deterministic guarantee. This means it can ensure that the “Guardian Agent” will process a distress signal within a specific 10-millisecond window, without fail. This reliability is non-negotiable for a device that is deeply integrated into your life.
Granular Power Management: The OS is co-designed with the hardware to manage power at a component-by-component level. It’s not just “screen on” or “screen off.” It can keep the microphone’s sensor hub active while the main processor is in a deep sleep state, enabling “always ready” functionality while consuming mere milliwatts of power.

7. Life After the Sponge: A Story from 2040

My morning starts not with an alarm, but with a gentle nudge from my AI. It knows my sleep cycle. As I make coffee, I ask, “What’s the most important thing I need to focus on today?” It replies, “The product review at 10 AM. Based on your conversations this week, the key friction point is the database migration. I’ve compiled the three most relevant data points from our past projects to support your argument. They are on your tablet.”

I walk into the review with focus, not anxiety. I can engage fully with the people because I’ve offloaded the burden of perfect recall to my cognitive co-processor.

But one afternoon, I leave my Sponge at work. The feeling is immediate and visceral. It’s not like forgetting my phone; that’s an inconvenience. This is a feeling of cognitive amputation. A low-level anxiety I haven’t felt in years creeps back in. I realize the new divide isn’t about who has information. It’s about who has offloaded the burden of information. We haven’t just outsourced memory; we’ve outsourced cognitive load. Humans haven’t become dumber; we’ve just reallocated our mental resources to higher-level thinking.

The question is no longer if we will have a second brain, but what we will choose to do with our first one when it’s finally free. What will you think about when you no longer have to remember?

← All writing