Excel File is Unusually Large and Performance is Poor
A 200-row spreadsheet shouldn’t be 50 MB. When Excel files balloon to many times the size their content suggests, the cause is almost always invisible bloat: a used range that extends far beyond your data, accumulated styles that aren’t actually used, conditional formatting rules that were never cleaned up, or embedded images that didn’t get compressed. The diagnosis is straightforward and the fixes work in place — no third-party tools required for most cases.
This is not corruption. The file is structurally fine. It’s just carrying weight it doesn’t need.
Quick fix
Find your real used range and delete everything beyond it.
- Open the workbook.
- Press Ctrl+End. Excel jumps to what it considers the last used cell.
- If the last used cell is far past your actual data — for example, your data ends at C500 but Ctrl+End takes you to BZ50000 — that’s bloat.
- Click the row header for the first empty row past your data (row 501 in the example).
- Press Ctrl+Shift+Down to extend the selection to the bottom of the sheet.
- Right-click the selection and choose Delete. Confirm if prompted.
- Repeat for excess columns: click the column header for the first empty column past your data, Ctrl+Shift+Right, Delete.
- Press Ctrl+S to save.
- Close the workbook completely, then reopen it.
The close-and-reopen step is essential. Excel doesn’t update the used range until the workbook is closed and reopened — until then, Ctrl+End still takes you to the old position. After the reopen, press Ctrl+End again to confirm the used range now ends at your real data. The file size on disk will have dropped, often dramatically.
If the file size didn’t drop, the bloat is elsewhere. Continue with the diagnostic steps below.
If that didn’t work
Inspect the XLSX as a ZIP archive
XLSX is a ZIP container. Looking inside tells you which part of the workbook is actually large and points directly to the cause.
- Make a copy of the file with a
.zipextension. - Open the copy with 7-Zip, WinRAR, or another archive tool.
- Look at the file sizes inside, especially:
xl/sharedStrings.xml— large if you have huge amounts of unique textxl/styles.xml— large if you have thousands of style definitions (often unused)xl/worksheets/sheet*.xml— large if a single sheet is bloatedxl/media/— embedded imagesxl/pivotCache/— pivot table cached data, can be very largexl/charts/— chart definitions
Whatever’s largest is your bloat source. The remediation depends on which:
- Large styles.xml. Accumulated formatting from copy-pasting from web pages, other workbooks, or old Excel files. Fix by clearing all formatting on cells you don’t need formatted: select cells, Home > Editing > Clear > Clear Formats.
- Large sheet XML. Excessive used range or thousands of named ranges. Used range fix is above. For named ranges, Formulas > Name Manager lists every defined name; delete the ones you don’t recognise.
- Large media folder. Embedded images in their full uncompressed form. See “Compress embedded images” below.
- Large pivot cache. Pivot tables cache their source data inside the workbook, which can double the workbook size if your source range is large. See “Reduce pivot cache” below.
Convert to XLSB binary format
The XLSB format stores Excel data as binary records rather than XML. For workbooks with heavy formula calculation or large data ranges, XLSB files are typically 50–70% smaller than the equivalent XLSX and load substantially faster.
- Open the workbook.
- File > Save As.
- From the file type dropdown, choose Excel Binary Workbook (*.xlsb).
- Save with a new name to keep the XLSX original until you’ve verified the XLSB version works.
XLSB has trade-offs. It’s not as widely supported by third-party tools as XLSX (anything that reads XLSX as a ZIP will fail on XLSB). Macros run identically. Cell content and formulas are preserved exactly. For internal use on workbooks that are too large in XLSX form, the conversion is usually worth it.
Compress embedded images
Images embedded in Excel are stored at full resolution and original format by default. A handful of high-resolution screenshots can add tens of megabytes to a workbook.
- Click any embedded image in the workbook.
- The Picture Format tab appears in the ribbon.
- Click Compress Pictures.
- Untick Apply only to this picture to compress all images at once.
- Choose a resolution — Email (96 ppi) is small; Screen (150 ppi) is the usual sensible default; HD (220 ppi) preserves quality at higher cost.
- Click OK and save.
This is lossy — images are downsampled and re-encoded. Make a backup before compressing if the originals matter.
Remove unused styles and names
Workbooks accumulate style definitions over their lifetime, especially when content is copied between files. Most of those styles are never used but stay in the file.
Excel doesn’t have a built-in “remove unused styles” command, but a copy-paste-special operation effectively does it:
- Create a new blank workbook.
- In the bloated workbook, press Ctrl+A to select all cells on the active sheet, then Ctrl+C.
- In the new workbook, click cell A1.
- Ctrl+Alt+V to open Paste Special. Choose Values to paste only the data without formatting, or Values and number formats if you need to keep number formats.
- Repeat for each sheet you need.
- Save the new workbook.
This rebuilds the file from values, dropping all accumulated style definitions. You lose all formatting in the process, so it’s only appropriate when you’re going to restore formatting deliberately or don’t need it.
For named-range cleanup specifically: Formulas > Name Manager lists every named range. Delete any with #REF! errors (broken references), and any you don’t recognise. A workbook with thousands of named ranges — common in workbooks that have been copied and modified for years — can shed significant size by removing the unused ones.
Reduce pivot cache
If xl/pivotCache/ was the largest folder in your ZIP inspection, your pivot tables are the bloat source.
- Click any cell inside a pivot table.
- Go to PivotTable Analyze > Options > Options.
- On the Data tab, untick Save source data with file.
- Tick Refresh data when opening the file instead.
- Click OK and save.
The pivot cache is dropped from the file. The pivot table refreshes from the source data on open, so the source must remain accessible. If the source is in the same workbook, that’s automatic. If the source is an external file or database, the connection details must remain valid.
This trades file size for open speed (refreshing on open takes time) and source-availability dependency, but for workbooks bloated by pivot caches it dramatically reduces size.
Advanced recovery
If none of the above shrinks the file meaningfully, the workbook may have structural inefficiencies that are harder to address — typically thousands of merged cells, conditional formatting rules referencing entire columns, or accumulated worksheet objects (text boxes, comments, controls) that aren’t visible in normal view.
The Inquire add-in, available in Microsoft 365 and Office Professional Plus, analyses workbooks for these issues:
- File > Options > Add-ins.
- Set Manage to COM Add-ins and click Go.
- Tick Inquire if it’s listed and click OK.
- The Inquire tab appears in the ribbon.
- Click Workbook Analysis for a full report.
Inquire flags problematic structures, broken references, and inefficient patterns. Its Clean Excess Cell Formatting button is particularly useful — it removes formatting from cells beyond the used range that wasn’t picked up by the Ctrl+End reset.
Inquire isn’t available in standard Microsoft 365 Family or Office Home & Business — only in Microsoft 365 Apps for enterprise, Microsoft 365 Apps for business, or Office Professional Plus. Check File > Options > Add-ins > Manage > COM Add-ins to see if it’s installed; if it’s not listed at all, your edition doesn’t include it.
For workbooks where Inquire isn’t available and the file remains stubbornly large after standard remediation, opening in LibreOffice Calc and re-saving as XLSX often produces a smaller file. LibreOffice rewrites the entire workbook structure from scratch on save, dropping unused parts that Excel preserves.
Why this happens
Excel workbooks accumulate bloat over their lifetime through several specific mechanisms.
Used range tracking. Excel maintains a record of the largest range that has ever contained data or formatting on each sheet. If you typed something in cell BZ50000 once, deleted it, then never thought about it again, Excel still considers BZ50000 part of the used range. Every save persists empty cells from A1 all the way to BZ50000 with their formatting metadata. Hundreds of thousands of empty-but-tracked cells add up fast. The fix is the close-and-reopen reset described above — Excel only updates the actual used range when the workbook is closed.
Style accumulation. Each time you copy formatted content from another workbook, web page, or email into Excel, a new style definition is added. Excel doesn’t deduplicate styles aggressively. A workbook that’s been edited heavily for years can have tens of thousands of style definitions, only a handful of which are actually used by visible cells.
Pivot caches. Pivot tables cache their source data inside the workbook for fast refresh. A pivot table with a 100,000-row source effectively doubles the workbook size, since the source data is stored both in its sheet and in the pivot cache. The cache can be disabled, traded for slower opens.
Embedded media at full resolution. Excel embeds images in their original format and resolution. A 4K screenshot saved as PNG is several megabytes; pasted into Excel, it stays several megabytes. The Compress Pictures feature exists specifically because of this default behaviour.
Named ranges referencing whole columns. A named range like Data referring to Sheet1!$A:$A covers a million-plus cells. If hundreds of formulas reference such ranges, calculation slows and file size grows.
Hidden objects. Text boxes, form controls, comments (now called notes), and embedded objects accumulate across edits. Hidden objects are still saved and still consume space. Inquire’s Workbook Analysis surfaces these; without it, they’re effectively invisible.
Preventing this in future
Two habits prevent most workbook bloat.
Be deliberate about formatting. Apply formatting to the cells you need formatted, not to entire columns. Conditional formatting rules in particular should reference specific ranges, not whole columns — a CF rule on A:A rather than A1:A1000 quietly tracks formatting state for a million cells.
Use copy-paste-values for content from external sources. When pasting from web pages or other workbooks, Ctrl+Alt+V > Values drops the formatting baggage that comes with the source. If you need the formatting, paste-special as values first and then apply formatting deliberately afterward.
For workbooks that will live a long time and accumulate edits, run a periodic cleanup. Once a quarter, run the Ctrl+End check, look at the file size, and apply Inquire’s Clean Excess Cell Formatting if available. A few minutes of maintenance prevents the eventual 50 MB workbook.
Related issues
If the workbook isn’t just slow but actually crashes Excel during operations, see Excel crashes when opening a file — large workbooks can trigger memory-related crashes that aren’t strictly the file’s fault. If the bloat is specifically in chart objects or pivot caches, the post-recovery state of those features is covered in Excel charts are missing or broken and Excel pivot tables are broken after recovery.
Last verified: April 2026