![]() ![]() The new Excel file appears at the top of the list because of the alphabetical sorting by filename. Refreshing the query shows the newly added filed at the SOURCE step. Remember how the Microsoft Word document appeared at the bottom of the metadata list? We see that we have combined 351 rows of fragmented data into a single table.Įven though we applied a filter to eliminate the non-PDF files from the folder examination, the process only worked because we got a bit lucky. Set the Scan for the Start Delimiter option to “ From the end of the input”.Ĭlick OK and rename the column to “ Store”.Ĭlick Home (tab) -> Close (group) -> Close & Load to send the results to an Excel table on a new sheet. In the Text Between Delimiters dialog box, select an underscore as the Start Delimiter and a period as the End Delimiter.īecause there are multiple underscores in the filenames, we need to search for the underscore character starting from the end of the filename. We will use a transformation tool to extract the text immediately following the last underscore character and before the period just before the file’s extension.Ĭlick Transform (tab) -> Text Column (group) -> Extract -> Text Between Delimiters. The file’s name contains the region name at the end of the filename. ![]() Keeping Track of the Data’s Place of Originīefore we load the results into Excel, we want to ensure that we can properly identify the region from which each transaction originated. ![]() This iterative process will extract and combine the data from each PDF table into a single table. Power Query will automatically create a series of queries, parameters, custom functions, and transformation steps to iterate through each PDF in the folder. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |