Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

detecting duplicate urls? seomoz's api will do that for you...

url cruft is much more an aesthic thing. see http://www.mattcutts.com/blog/clean-up-extra-url-parameters-...



there is an RFC section dedicated to URL normalization, this doesn't need an API:

http://tools.ietf.org/html/rfc3986#section-6

I am currently working on making a urllib for python which is 3986 compliant


Anywhere I can follow progress on that? I'd love a Python-based URL normalization tool for http://SharedCount.com


email me and ill ping you once it is up (email in profile)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: