sitescripts/stats/common.py - Issue 5843385483001856: Stats processing: don`t create file names that are too long

Keyboard Shortcuts

	File
u :	up to issue
m :	publish + mail comments
M :	edit review message
j / k :	jump to file after / before current file
J / K :	jump to next file with a comment after / before current file
	Side-by-side diff
i :	toggle intra-line diffs
e :	expand all comments
c :	collapse all comments
s :	toggle showing all comments
n / p :	next / previous diff chunk or comment
N / P :	next / previous comment
<Up> / <Down> :	next / previous line
<Enter> :	respond to / edit current comment
d :	mark current comment as done

	Issue
u :	up to list of issues
m :	publish + mail comments
j / k :	jump to patch after / before current patch
o / <Enter> :	open current patch in side-by-side view
i :	open current patch in unified diff view

	Issue List
j / k :	jump to issue after / before current issue
o / <Enter> :	open current issue
# :	close issue

	Comment/message editing
<Ctrl> + s or <Ctrl> + Enter :	save comment
<Esc> :	cancel edit

Unified Diff: sitescripts/stats/common.py

Issue 5843385483001856: Stats processing: don`t create file names that are too long (Closed)

Patch Set: Created Dec. 27, 2013, 7:43 a.m.

Use n/p to move between diff chunks; N/P to move between comments.

Jump to:

View side-by-side diff with in-line comments

Download patch

Index: sitescripts/stats/common.py

===================================================================

--- a/sitescripts/stats/common.py

+++ b/sitescripts/stats/common.py

@@ -10,32 +10,41 @@

# Adblock Plus is distributed in the hope that it will be useful,

# but WITHOUT ANY WARRANTY; without even the implied warranty of

# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the

# GNU General Public License for more details.

# You should have received a copy of the GNU General Public License

# along with Adblock Plus. If not, see <http://www.gnu.org/licenses/>.

-import re

+import re, hashlib

def filename_encode(name):

"""

This encodes any string to a valid file name while ensuring that the

original string can still be reconstructed. All characters except 0-9, A-Z,

the period and underscore are encoded as "-12cd" where "12cd" stands for the

- hexadecimal representation of the character's ordinal.

+ hexadecimal representation of the character's ordinal. File names longer

+ than 200 characters will be still be unique but no longer reversible due to

+ file system limitations.

"""

- return re.sub(r"[^\w\.]", lambda match: "-%04x" % ord(match.group(0)), name)

+ result = re.sub(r"[^\w\.]", lambda match: "-%04x" % ord(match.group(0)), name)

+ if len(result) > 200:

+ hash = hashlib.md5()

+ hash.update(result[200:])

+ result = result[:200] + "--%s" % hash.hexdigest()

+ return result

def filename_decode(path):

"""

This reconstructs a string encoded with filename_encode().

"""

- return re.sub(r"-([0-9a-f]{4})", lambda match: unichr(int(match.group(1), 16)), path)

+ path = re.sub(r"--[0-9A-Fa-f]{32}", u"\u2026", path)

+ path = re.sub(r"-([0-9a-f]{4})", lambda match: unichr(int(match.group(1), 16)), path)

+ return path

basic_fields = [

{

"name": "day",

"title": "Days of month",

"coltitle": "Day",

"showaverage": True,

"sort": lambda obj: sorted(obj.items(), key=lambda (k,v): int(k)),

« no previous file with comments | « no previous file | sitescripts/stats/test/common.py » ('j') | no next file with comments »