Next Steps
Some of my to do items include …
Things to Fix
- http://localhost:1313/snapshots/edu/www.njcu.edu/
- http://localhost:1313/snapshots/edu/www.ollusa.edu/
- http://localhost:1313/snapshots/edu/www.shawuniversity.edu/
- http://localhost:1313/snapshots/edu/www.nyu.edu/
- http://localhost:1313/snapshots/edu/www.snhu.edu/
Things to Link to
Vocabs
- Validation (Valid|Invalid|Unknown)
- Technology Taxonnomy?
- http headers
- x-generator
- x-powered-by
- server
- via
- set-cookie
- metatags
- generator
- twitter:card
- viewport
- json
- domain->ipv6
Related Links (via Alexa)
- Add ‘em in
Content Analysis
- Word Count
- Paragraph Count
- Flesch Kincaid Reading Ease
- Gunning Fog Score
- Coleman Liau Index
- SMOG Reability Index
- Automated Reability Index
Link Analysis
- Add data from my sifted index
Alexa Data
- Alexa Speed: 891
- Alexa Links In: 60 429
- Alexa Reach: 1 897
- Alexa Popularity: 1 673
Fine Tuning Hugo
Fine tuning this site by learning more about Hugo
- Front Matter
- weight
- summary - http://gohugo.io/content/summaries/
- summary divider
- Sections = Types (but can be overriden)
- List templates - can be overridden
- By default, content is ordered by weight, then by date
- http://gohugo.io/templates/list/
- http://gohugo.io/templates/views/
- Taxoxnomies
- Partials - http://gohugo.io/templates/partials/
- Menus - http://gohugo.io/extras/menus/
- Investigate Later
- Pagination - http://gohugo.io/extras/pagination/
- Scoping & Scratch - http://gohugo.io/extras/scratch/
- Shortcodes - http://gohugo.io/extras/shortcodes/
Data to Add
json
- basic
- content->rss[] -
- https://github.com/sdepold/jquery-rss
- http://stackoverflow.com/questions/22273907/use-jquery-to-read-a-json-google-news-feed
content->analysis->html->page_size
content->analysis->text->counts->word
content->analysis->text->counts->sentence
content->analysis->text->counts->avg_sentance_words
content->analysis->html->dom->body->hash
content->analysis->html->dom->counts->tags
image
content->analysis->html->links->rel->image_src
Long Term ToDos and Wish List
As I get time, here are some of the things I hope to investigate next.
- Google Custom Search - Integrate Search
- Color Palette Extraction - Think about various techniques for extracting color palettes from screenshots
- ApacheTika - I got started with processing pages before Tiki existed, but it could work a heckuva lot better for lots of things (for Open Graph & Twitter Card, etc in particular)
- DFP - even if I don’t use ads, I might use DFP for Small Business to support geospatial targeting of relevant snaphot
- Before/After Comparisons - Once I start processing long term, provide before/after comparisons of screenshots
- Elastic Search - Once I get things up and running, think about a faceted search supported by a hosted Elastic Search instance
- SenderBase Processing - Grab Snapshots from SenderBase for additional info