An issue with Full Text Search against PDF blobs... - ms-sqlserver
This is a discussion on An issue with Full Text Search against PDF blobs... - ms-sqlserver ; My client is trying to use SQL Server Full Text Search to search PDFs stored in a varbinary(max) column. Full text search works fine for other Microsoft Office documents stored in the same table but it does NOT give back ...
![]() |
| | LinkBack | Thread Tools | Display Modes |
|
#1
| |||
| |||
| in a varbinary(max) column. Full text search works fine for other Microsoft Office documents stored in the same table but it does NOT give back any results from the PDF documents. Following is a high-level view of what the client did: 1. Installed Adobe PDF IFilter 6.0 2. Ran the stored procedure sp_fulltext_service (as documented) 3. Restarted the server 4. Verified that the filter got properly installed by querying the system view sys.fulltext_document_types 5. Created a full text index on the table with the documents 6. Started a full population of the index 7. Ran a sample query with a string he knows is in the PDF file like the following: select * from documents where freetext(document, ‘Review’) and got no results back 8. Ran the same sample query with a string he knows is in some Word files like the following: select * from documents where freetext(document, ‘SQL’) the query returned several rows back as expected. Does anybody know what might be happening here? Thank you! Camilo Leon |
|
#2
| |||
| |||
|
Camilo, Are you using 64-bit Windows and SQL Server? If so, last time I looked the Adobe PDF IFilter was only 32-bit. RLF "Camilo" news:92BEFB18-89BF-4C7E-8B11-769B0C329B7B@microsoft.com... > My client is trying to use SQL Server Full Text Search to search PDFs > stored > in a varbinary(max) column. Full text search works fine for other > Microsoft > Office documents stored in the same table but it does NOT give back any > results from the PDF documents. > > Following is a high-level view of what the client did: > > 1. Installed Adobe PDF IFilter 6.0 > 2. Ran the stored procedure sp_fulltext_service (as documented) > 3. Restarted the server > 4. Verified that the filter got properly installed by querying the system > view > sys.fulltext_document_types > 5. Created a full text index on the table with the documents > 6. Started a full population of the index > 7. Ran a sample query with a string he knows is in the PDF file like the > following: > select * from documents where freetext(document, 'Review') and got no > results > back > 8. Ran the same sample query with a string he knows is in some Word files > like the > following: select * from documents where freetext(document, 'SQL') the > query > returned several rows back as expected. > > Does anybody know what might be happening here? > > Thank you! > > Camilo Leon > |
|
#3
| |||
| |||
|
Russell, No, we are using a 32-bit machine in this case. I did make FTS work with PDF files by creating a new table, importing the data, creating a new catalog and populating it again. Thanks! Camilo "Russell Fields" wrote: > Camilo, > > Are you using 64-bit Windows and SQL Server? If so, last time I looked the > Adobe PDF IFilter was only 32-bit. > > RLF > "Camilo" > news:92BEFB18-89BF-4C7E-8B11-769B0C329B7B@microsoft.com... > > My client is trying to use SQL Server Full Text Search to search PDFs > > stored > > in a varbinary(max) column. Full text search works fine for other > > Microsoft > > Office documents stored in the same table but it does NOT give back any > > results from the PDF documents. > > > > Following is a high-level view of what the client did: > > > > 1. Installed Adobe PDF IFilter 6.0 > > 2. Ran the stored procedure sp_fulltext_service (as documented) > > 3. Restarted the server > > 4. Verified that the filter got properly installed by querying the system > > view > > sys.fulltext_document_types > > 5. Created a full text index on the table with the documents > > 6. Started a full population of the index > > 7. Ran a sample query with a string he knows is in the PDF file like the > > following: > > select * from documents where freetext(document, 'Review') and got no > > results > > back > > 8. Ran the same sample query with a string he knows is in some Word files > > like the > > following: select * from documents where freetext(document, 'SQL') the > > query > > returned several rows back as expected. > > > > Does anybody know what might be happening here? > > > > Thank you! > > > > Camilo Leon > > > > > |
|
#4
| |||
| |||
|
Check the gatherer logs to see what the status of your population was. -- relevantNoise - dedicated to mining blogs for business intelligence. Looking for a SQL Server replication book? http://www.nwsu.com/0974973602.html Looking for a FAQ on Indexing Services/SQL FTS http://www.indexserverfaq.com "Camilo" news:92BEFB18-89BF-4C7E-8B11-769B0C329B7B@microsoft.com... > My client is trying to use SQL Server Full Text Search to search PDFs > stored > in a varbinary(max) column. Full text search works fine for other > Microsoft > Office documents stored in the same table but it does NOT give back any > results from the PDF documents. > > Following is a high-level view of what the client did: > > 1. Installed Adobe PDF IFilter 6.0 > 2. Ran the stored procedure sp_fulltext_service (as documented) > 3. Restarted the server > 4. Verified that the filter got properly installed by querying the system > view > sys.fulltext_document_types > 5. Created a full text index on the table with the documents > 6. Started a full population of the index > 7. Ran a sample query with a string he knows is in the PDF file like the > following: > select * from documents where freetext(document, 'Review') and got no > results > back > 8. Ran the same sample query with a string he knows is in some Word files > like the > following: select * from documents where freetext(document, 'SQL') the > query > returned several rows back as expected. > > Does anybody know what might be happening here? > > Thank you! > > Camilo Leon > |
|
#5
| |||
| |||
| Quote:
I'm experiencing the same problem: no results from PDFs are returned. |
|
#6
| |||
| |||
|
I am trying to get the full-text search thesaurus to work in Sql Server 2005 Express edition. I live in the USA so I assume the tx.ENU.xml is the appropriate file to modify. I used Notepad to modify the tx.ENU.xml file and saved as unicode: |
![]() |
« Previous Thread
|
Next Thread »
| Thread Tools | |
| Display Modes | |
| |
All times are GMT -4. The time now is 01:09 PM.





Linear Mode