How to convert html to text in cells in Excel?
As below screenshot shown, if numbers of html tags existing in your worksheet cells, how could you convert them to plain text in Excel? This article will show you two methods to remove all html tags from cells in Excel.
Convert html to text in selected cells with Find and Replace function
Convert html to text in the whole worksheet with VBA
Convert html to text in cells with Find and Replace function
You can convert all html to texts in cells with the Find and Replace function in Excel. Please do as follows.
1. Select the cells you will convert all html to texts, and press the Ctrl + F keys to open the Find and Replace dialog box.
2. In the Find and Replace dialog box, go to the Replace tab, enter <*> into the Find what box, keep the Replace with box empty, and click the Replace All button. See screenshot:
3. Then a Microsoft Excel dialog box pops up to tell you how many html tags have been replaced, click the OK button and close the Find and Replace dialog box.
Then you can see all html tags are removed from selected cells as below screenshot shown.
Convert html to text in the whole worksheet with VBA
Besides, you can convert all html to text in the whole worksheet at the same time with the below VBA code.
1. Open the worksheet contains html you will convert to text, then press the Alt + F11 keys to open the Microsoft Visual Basic for Applications window.
2. In the Microsoft Visual Basic for Applications window, click Insert > Module, then copy below VBA code into the Module window.
VBA code: Convert html to text in the whole worksheet
Sub RemoveHTMLTags()
'Update by Extendoffice 20180703
Dim xRg As Range
Dim xCell As Range
Dim xStr As String
Dim xRegEx As RegExp
Dim xMatch As Match
Dim xMatches As MatchCollection
Set xRegEx = New RegExp
Application.EnableEvents = False
Set xRg = Cells.SpecialCells(xlCellTypeConstants)
With xRegEx
.Global = True
.Pattern = "<(""[^""]*""|'[^']*'|[^'"">])*>"
End With
For Each xCell In xRg
xStr = xCell.Value
Set xMatches = xRegEx.Execute(xCell.Text)
For Each xMatch In xMatches
xStr = Replace(xStr, xMatch.Value, "")
Next
xCell.Value = xStr
Next
Application.EnableEvents = True
End Sub
3. Still in the Microsoft Visual Basic for Applications window, please click Tools > References, check the Microsoft VBScript Regular Expression 5.5 option in the References-VBAProject dialog box, and then click the OK button.
4. Press the F5 key or click the Run button to run the code.
Then all html tags are removed from the whole worksheet immediately.
Related articles:
Best Office Productivity Tools
Supercharge Your Excel Skills with Kutools for Excel, and Experience Efficiency Like Never Before. Kutools for Excel Offers Over 300 Advanced Features to Boost Productivity and Save Time. Click Here to Get The Feature You Need The Most...
Office Tab Brings Tabbed interface to Office, and Make Your Work Much Easier
- Enable tabbed editing and reading in Word, Excel, PowerPoint, Publisher, Access, Visio and Project.
- Open and create multiple documents in new tabs of the same window, rather than in new windows.
- Increases your productivity by 50%, and reduces hundreds of mouse clicks for you every day!