User Tools

Site Tools


regex:file_path:windows

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
regex:file_path:windows [2021/05/20 23:12] peterregex:file_path:windows [2021/05/26 15:26] (current) peter
Line 2: Line 2:
  
 <code> <code>
-^(\w:\\$)|^([A-Za-z_\-\s0-9\.\$]+)$|^(?:[\w]\:|\\)(\\(?!\.)(?!\s[a-z_0-9-])[A-Za-z_\-\s0-9\.\$]+)+$ +^(?:[a-zA-Z]:\\$)|^(?:[a-zA-Z]\:|\\)(\\(?!\.)(?!\s[a-zA-Z_0-9-\.])[a-zA-Z_\-\s0-9\.\$]+)+$ 
-^(\w:\\$)|^(?:[\w]\:|\\)(\\(?!\.)(?!\s[a-z_0-9-])[A-Za-z_\-\s0-9\.\$]+)+$+ 
 +or  
 + 
 +^(\w:\\$)|^(?:[\w]\:|\\)(\\(?!\.)(?!\s[a-zA-Z_0-9-])[a-zA-Z_\-\s0-9\.\$]+)+$
 </code> </code>
  
 <WRAP info> <WRAP info>
 **NOTE:** This is made up of: **NOTE:** This is made up of:
 +
 +  * **<nowiki>^(?:[a-zA-Z]:\\$)</nowiki>**:  Caters for <nowiki>C:\</nowiki>.
 +  * **<nowiki>^(?:[a-zA-Z]\:|\\)(\\(?!\.)(?!\s[a-zA-Z_0-9-\.])[a-zA-Z_\-\s0-9\.\$]+)+$</nowiki>**:  Cater for UNC and standard files with a directory name.
  
   * **<nowiki>^(\w:\\$)</nowiki>**:  Caters for <nowiki>C:\</nowiki>.   * **<nowiki>^(\w:\\$)</nowiki>**:  Caters for <nowiki>C:\</nowiki>.
-  * **<nowiki>^([A-Za-z_\-\s0-9\.\$]+)$</nowiki>**:  Caters for plain filenames, such as file.txt or file. +  * **<nowiki>^(?:[\w]\:|\\)(\\(?!\.)(?!\s[a-zA-Z_0-9-])[A-Za-z_\-\s0-9\.\$]+)+$</nowiki>**:  Cater for UNC and standard files with a directory name.
-  * **<nowiki>^(?:[\w]\:|\\)(\\(?!\.)(?!\s[a-z_0-9-])[A-Za-z_\-\s0-9\.\$]+)+$</nowiki>**:  Cater for UNC and standard files with a directory name.+
 </WRAP> </WRAP>
  
Line 59: Line 64:
 </code> </code>
  
-</WRAP>+To allow the top two entries to also be accepted as valid entries change the regex to include:
  
----- +  * **<nowiki>^([A-Za-z_\-\s0-9\.\$]+)$</nowiki>**:  Caters for plain filenames, such as file.txt or file.
- +
- +
-----+
  
 <code> <code>
-^(?:[\w]\:|\\)(\\(?!\.+)[A-Za-z_\-\s0-9\.\$]+)+$+^(\w:\\$)|^([A-Za-z_\-\s0-9\.\$]+)$|^(?:[\w]\:|\\)(\\(?!\.)(?!\s[a-z_0-9-])[A-Za-z_\-\s0-9\.\$]+)+$
 </code> </code>
  
- +To allow the very bottom entry here:  **<nowiki>\\server\filename\{token}\file</nowiki>**:
-^\\{2}[\w-.]+(\\{1}(([\w-][\w-\s]*[\w-]+[$$]?)|([\w-][$$]?$)))+$ +
- +
-Works.... +
-^(?:[\w]\:|\\)(\\[a-z_\-\s0-9\.]+)+\.?(.*)?$ +
- +
-^(?:[\w]\:|\\)(\\[a-z_\-\s0-9\.]+)+\.*\$? +
- +
- +
- +
-([a-z_\-\s\0-9\.\\]+)+([a-z_\-\s\0-9\.]+)(\\)([a-z_\-\s\0-9]+)$+
  
 <code> <code>
-^(?:[\w]\:|\\)(\\[a-z_\-\s0-9\.]+)+\.(txt|gif|pdf|doc|docx|xls|xlsx)$ +^(?:[a-zA-Z]:\\$)|^(?:[a-zA-Z]\:|\\)(\\(?!\.)(?!\s[a-zA-Z_0-9-\.])\{?[a-zA-Z_\-\s0-9\.\$]+)+(\}\\([a-zA-Z_\-\s0-9\.\$])+?)?$
- +
-^(?:[\w]\:|\\)(\\[a-z_\-\s\0-9\.]+)+(?:\\)([a-z_\-\s\0-9]+){1}$+
 </code> </code>
  
-<code> 
-(?:(?:[a-z]:|\\\\[a-z0-9_.$\●-]+\\[a-z0-9_.$\●-]+)\\|  # Drive 
-   \\?[^\\/:*?"<>|\r\n]+\\?                          # Relative path 
-(?:[^\\/:*?"<>|\r\n]+\\)*                              # Folder 
-[^\\/:*?"<>|\r\n]*                                     # File 
- 
- 
-^(?:[\w]\:\\|\\\\)([a-z0-9_.$\s-]+\\[a-z0-9_.$\.-]+\\|\\?[^\\/:*?"<>|\r\n]+\\?)(?:[^\\/:*?"<>|\r\n]+\\)*[^\\/:*?"<>|\r\n]*$ 
-</code> 
- 
- 
-===== Windows ===== 
- 
-<code bash> 
-(\\\\([a-z|A-Z|0-9|-|_|\s]{2,15}){1}(\.[a-z|A-Z|0-9|-|_|\s]{1,64}){0,3}){1}(\\[^\\|\/|\:|\*|\?|"|\<|\>|\|]{1,64}){1,}(\\){0,} 
-</code> 
- 
-<WRAP info> 
-**NOTE:**  Disallows a few characters: **<nowiki>\/:*?"<>|</nowiki>**. 
-</WRAP> 
- 
----- 
- 
-<code bash> 
-^((?:[a-z]:\\$|(?:[a-z]:|\\\\[a-z]+\\(?!\.)[^\r\n$<>]+\$?)))((\\|(\\(?!\.)[^\r\n<>\\]+)*)(?<!\\)$)  
-</code> 
- 
----- 
- 
-<code bash> 
-^[a-zA-Z]:\\(((?![<>:"/\\|?*]).)+((?<![ .])\\)?)*$ 
-</code> 
- 
-<WRAP info> 
-**NOTE:**  It makes the path conform to the NTFS standard (see the MSDN spec). 
- 
-  * **<nowiki>^[a-zA-Z]:\\</nowiki>** matches single drive letter, with colon and backslash 
- 
-  * **<nowiki>(?![<>:"/\\|?*])</nowiki>** is a negative lookahead to ensure the next character is not invalid 
- 
-  * **<nowiki>((?![<>:"/\\|?*]).)+</nowiki>** wraps that lookahead, followed by the next character, any number of times 
- 
-  * **<nowiki>(?<![ .])\\</nowiki>** is a negative lookbehind to ensure the file/directory doesn't end with a space or period. Please note: Lookbehinds are not fully implemented everywhere just yet. 
- 
-All of that is is repeated 0 to many times, with the last backslash optional. 
- 
-For many use cases it may be best to restrict the path length to 256 characters. 
- 
-  * To do so, replace <nowiki>*with {0,256}</nowiki>. 
 </WRAP> </WRAP>
  
regex/file_path/windows.1621552361.txt.gz · Last modified: 2021/05/20 23:12 by peter

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki