Skip to main content

Removing HTML Tags from Text Using SQL Server User-Defined Function

 

Introduction:

In this blog post, we'll explore how to create and use a SQL Server User-Defined Function (UDF) to remove HTML tags from a text string. This function can be handy when you need to extract plain text from HTML content stored in your database.

Creating the Function:

First, let's create the SQL Server UDF named udf_StripHTML. This function takes a VARCHAR(MAX) parameter @HTMLText, which represents the HTML content from which we want to remove the tags. It returns a VARCHAR(MAX) value, representing the text stripped of HTML tags.

sql
SET QUOTED_IDENTIFIER ON GO 
CREATE FUNCTION [dbo].[udf_StripHTML] (@HTMLText VARCHAR(MAX)) RETURNS VARCHAR(MAX) AS BEGIN DECLARE @Start INT 
DECLARE @End INT 
DECLARE @Length INT 
SET @Start = CHARINDEX('<', @HTMLText
SET @End = CHARINDEX('>', @HTMLText, CHARINDEX('<', @HTMLText)) 
SET @Length = (@End - @Start) + 1 
WHILE @Start > 0 AND @End > 0 AND @Length > 0 
BEGIN 
SET @HTMLText = STUFF(@HTMLText, @Start, @Length, ''
SET @Start = CHARINDEX('<', @HTMLText) SET @End = CHARINDEX('>', @HTMLText, CHARINDEX('<', @HTMLText)) 
SET @Length = (@End - @Start) + 1 
END 
RETURN LTRIM(RTRIM(@HTMLText)) 
END 
GO

Using the Function:

Now that we have created the udf_StripHTML function, let's see how we can use it to remove HTML tags from a text string.

sql
-- Example usage of udf_StripHTML function 
DECLARE @HTMLText VARCHAR(MAX) SET @HTMLText = '<p>This is <b>some</b> <i>HTML</i> <u>text</u>.</p>' 
SELECT dbo.udf_StripHTML(@HTMLText) AS PlainText

Conclusion:

In this blog post, we've learned how to create a SQL Server User-Defined Function to remove HTML tags from a text string. This function can be useful in various scenarios where you need to extract plain text from HTML content stored in your database.

Feel free to incorporate this function into your SQL Server environment to simplify text processing tasks involving HTML content.

Comments

Popular posts from this blog

Using SSRS web services to render a report as a PDF

I have been looking around the net for some decent code which would explain how I could render a report, using SSRS 2008 web services as a PDF.   The need was to extract reports sitting on a SSRS 2008 server sitting on a NT domain on a trusted network, whereas my web server was sitting in a DMZ. Where the only communication allowed by the network admin was port 80. To do this you will need to use the SSRS2008   ReportExecution2005.asmx web service. This could be accesses using the following URL assuming your SSRS server was installed using the default settings. http://YourServerIP/reportserver/reportexecution2005.asmx?wsdl 1.        Create a user on your AD domain with the least amount of privileges (say ReportUser) 2.        Give this account browse access on the reporting server for the desired reports. 3.        To get this working in visual studio 2010 (I am using t...

Creating a Link Server to MYSQL from MSSQL

There might be situations where you need to apply CRUD  operations ( Create,   Read,   Update,   Delete ) on data residing in a MYSQL server database from SQL server. In order to do this you will need to create a link service from SQL server to MYSQL. The Steps are outlined below: Steps: 1. Install the MYSQL ODBC driver on the server with the MSSQL server installation. 2. Enter details for the MYSQL ODBC settings. 3. Create a link service to MYSQL using SSMS ( SQL Server Management Studio) 4. You will need an user account on the mysql server which can perform CRUD operations  along with enable remote connections to the MYSQL installation. (See Troubleshooting steps below) 4. Test the connection. Install MYSQL ODBC Driver  Download   (http://dev.mysql.com/downloads/connector/odbc/) and install the latest MYSQL ODBC driver from the MYSQL server site. Select the version which corresponds to your installation of MSSQL usually the 64bit version. Note This...