Skip to main content

Removing HTML Tags from Text Using SQL Server User-Defined Function

 

Introduction:

In this blog post, we'll explore how to create and use a SQL Server User-Defined Function (UDF) to remove HTML tags from a text string. This function can be handy when you need to extract plain text from HTML content stored in your database.

Creating the Function:

First, let's create the SQL Server UDF named udf_StripHTML. This function takes a VARCHAR(MAX) parameter @HTMLText, which represents the HTML content from which we want to remove the tags. It returns a VARCHAR(MAX) value, representing the text stripped of HTML tags.

sql
SET QUOTED_IDENTIFIER ON GO 
CREATE FUNCTION [dbo].[udf_StripHTML] (@HTMLText VARCHAR(MAX)) RETURNS VARCHAR(MAX) AS BEGIN DECLARE @Start INT 
DECLARE @End INT 
DECLARE @Length INT 
SET @Start = CHARINDEX('<', @HTMLText
SET @End = CHARINDEX('>', @HTMLText, CHARINDEX('<', @HTMLText)) 
SET @Length = (@End - @Start) + 1 
WHILE @Start > 0 AND @End > 0 AND @Length > 0 
BEGIN 
SET @HTMLText = STUFF(@HTMLText, @Start, @Length, ''
SET @Start = CHARINDEX('<', @HTMLText) SET @End = CHARINDEX('>', @HTMLText, CHARINDEX('<', @HTMLText)) 
SET @Length = (@End - @Start) + 1 
END 
RETURN LTRIM(RTRIM(@HTMLText)) 
END 
GO

Using the Function:

Now that we have created the udf_StripHTML function, let's see how we can use it to remove HTML tags from a text string.

sql
-- Example usage of udf_StripHTML function 
DECLARE @HTMLText VARCHAR(MAX) SET @HTMLText = '<p>This is <b>some</b> <i>HTML</i> <u>text</u>.</p>' 
SELECT dbo.udf_StripHTML(@HTMLText) AS PlainText

Conclusion:

In this blog post, we've learned how to create a SQL Server User-Defined Function to remove HTML tags from a text string. This function can be useful in various scenarios where you need to extract plain text from HTML content stored in your database.

Feel free to incorporate this function into your SQL Server environment to simplify text processing tasks involving HTML content.

Comments

Popular posts from this blog

Using SSRS web services to render a report as a PDF

I have been looking around the net for some decent code which would explain how I could render a report, using SSRS 2008 web services as a PDF.   The need was to extract reports sitting on a SSRS 2008 server sitting on a NT domain on a trusted network, whereas my web server was sitting in a DMZ. Where the only communication allowed by the network admin was port 80. To do this you will need to use the SSRS2008   ReportExecution2005.asmx web service. This could be accesses using the following URL assuming your SSRS server was installed using the default settings. http://YourServerIP/reportserver/reportexecution2005.asmx?wsdl 1.        Create a user on your AD domain with the least amount of privileges (say ReportUser) 2.        Give this account browse access on the reporting server for the desired reports. 3.        To get this working in visual studio 2010 (I am using t...

How to Automatically Create SQL Server Views from MySQL Tables Using OPENQUERY (An alternative to ETL)

If you have a linked server from SQL Server to MySQL, you can automate importing data and creating views using dynamic SQL. This is useful when integrating external MySQL data into a Microsoft SQL Server reporting or analytics environment. 🔗 Setup: Linked Server to MySQL Make sure you have already set up your MySQL linked server in SQL Server (for example, named SB ), and that you can run queries like the following: SELECT * FROM OPENQUERY(SB, 'SELECT * FROM your_table'); ⚙️ Goal We want to dynamically create SQL Server views for all base tables in a MySQL database, using a format like: CREATE VIEW [dbo].[lnk_table_name] AS SELECT * FROM OPENQUERY(SB, 'SELECT * FROM table_name WHERE deleted_at IS NULL'); But not all MySQL tables have a deleted_at column. So, we will check whether the column exists before appending the WHERE clause. 🧠 Full SQL Script This SQL Server script loops through all MySQL tables and generates the appropriate view creation stat...