<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>James&#039; World &#187; Conferences</title>
	<atom:link href="http://www.jebriggs.com/blog/category/conferences/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.jebriggs.com/blog</link>
	<description>Observations by a Programmer of Silicon Valley and Beyond</description>
	<lastBuildDate>Mon, 06 Feb 2012 11:03:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Silicon Valley Codecamp 2011</title>
		<link>http://www.jebriggs.com/blog/2011/10/silicon-valley-codecamp-2011/</link>
		<comments>http://www.jebriggs.com/blog/2011/10/silicon-valley-codecamp-2011/#comments</comments>
		<pubDate>Mon, 10 Oct 2011 06:42:55 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Perl]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[Toys]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/2011/10/</guid>
		<description><![CDATA[Silicon Valley Codecamp 2011 was held this weekend at Foothill College near San Jose. There were 209 sessions and 3,414 registered attendees. I did not attend, but have seen some of the talks at other conferences in the past year. &#8230; <a href="http://www.jebriggs.com/blog/2011/10/silicon-valley-codecamp-2011/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.siliconvalley-codecamp.com/">Silicon Valley Codecamp</a> 2011 was held this weekend at Foothill College near San Jose.</p>
<p>There were <a href="http://www.siliconvalley-codecamp.com/Sessions.aspx">209 sessions</a> and 3,414 registered attendees.</p>
<p>I did not attend, but have seen some of the talks at other conferences in the past year. Pretty impressive list of speakers!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2011/10/silicon-valley-codecamp-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>OSCON 2011, Portland</title>
		<link>http://www.jebriggs.com/blog/2011/07/oscon-2011-portland/</link>
		<comments>http://www.jebriggs.com/blog/2011/07/oscon-2011-portland/#comments</comments>
		<pubDate>Sun, 31 Jul 2011 06:03:21 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[Japanese]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[OSCON]]></category>
		<category><![CDATA[Perl]]></category>
		<category><![CDATA[Photography]]></category>
		<category><![CDATA[Psychology]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[Toys]]></category>
		<category><![CDATA[Travel]]></category>
		<category><![CDATA[User Groups]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/?p=3735</guid>
		<description><![CDATA[Once again, the O&#8217;Reilly Open Source Conference (OSCON) was held in Portland, Oregon. It was held in parallel at the Oregon Convention Center with the O&#8217;Reilly OSdata and OSjava Conferences at the beginning of the week, and then later a &#8230; <a href="http://www.jebriggs.com/blog/2011/07/oscon-2011-portland/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Once again, the <a href="http://www.oscon.com/oscon2011/">O&#8217;Reilly Open Source Conference (OSCON)</a> was held in Portland, Oregon.</p>
<p>It was held in parallel at the Oregon Convention Center with the O&#8217;Reilly OSdata and OSjava Conferences at the beginning of the week, and then later a knitting conference.</p>
<p>The conferences were well-managed, as usual. Great economy: lots of job notices and recruiting appeals. There was some chatter about Tim&#8217;s <a href="http://radar.oreilly.com/2011/07/sexual-harassment-at-technical.html">anti-harassment blog post.</a></p>
<p><strong>Executive Summary:</strong></p>
<ul>
<li>HTML5/CSS3/AppCache are what should have been available 20 years ago, and are significant improvements that allow both desktop and mobile development in HTML. Although the HTML5 video tag gets a lot of press, HTML5 includes equally important forms improvements.
<li>DNSSEC is <a href="http://tools.ietf.org/html/draft-ietf-dnsop-rfc4641bis-07">complex</a> and new signatures should be generated every 30 days or less (to reduce replay attacks by limiting the signature validity period), which is a burden on companies without a full-time DNS hostmaster. Third-party DNS hosting companies are salivating over DNSSEC.
<li>MySQL long-term stewardship is still in question, with Oracle hemorrhaging MySQL developers and closing access to their bugs database, but MontyProgram and Percona maintaining strong forks.
</ul>
<p>Here&#8217;s my notes on some of the tutorials and talks I attended:</p>
<p><strong>Monday</strong></p>
<p><strong><a href="http://www.oscon.com/oscon2011/public/schedule/detail/19216">HTML5 &#038; CSS3: The Good Enough Parts</a></strong><br />
Estelle Weyl, Standardista.com<br />
<a href="http://www.standardista.com/forms/oscon/">Slides</a></p>
<p>- transform-origin is key to snowflake demo looking realistic, easy to use<br />
- background resets everything, so use individual properties<br />
- background-position &#8211; use all 4 values<br />
- background-size auto contain cover, handy for iPhones<br />
- text-overflow: ellipsis<br />
- minimal HTML5 document:<br />
<code><br />
&lt;!doctype html5&gt;<br />
&lt;meta charset=utf8&gt;<br />
&lt;title&gt;&lt;/title&gt;<br />
</code><br />
(head and body are implied)<br />
- or even send tags in server headers<br />
- changed most elements<br />
- &lt;i lang=&#8221;"&gt; useful to style<br />
- small tag useful for legal smallprint, since there&#8217;s no copyright metatag yet<br />
- <a href="http://code.google.com/p/html5shim/">html5shim</a><br />
- <a href="http://www.modernizr.com/">Modernizr</a><br />
- <a href="http://html5boilerplate.com/">HTML5 Boilerplate</a> &#8211; good way to learn HTML5 and CSS3<br />
- tabindex=&#8221;-1&#8243; allows JS to set focus and not bother user otherwise<br />
- spellcheck=&#8221;true&#8221; | &#8220;false&#8221;<br />
- itemtype=&#8221;http://data-vocabulary.org/Person&#8221;<br />
- new input types<br />
- placeholder, pattern, required, spellcheck, validate<br />
- a@b is deliverable for internal email servers. hmm.<br />
- meter, progress, output widgets<br />
- <a href="http://www.html5rocks.com/">HTML5Rocks</a><br />
- button generator at <a href="http://css3button.net/">css3button.net</a><br />
- <a href="http://en.wikipedia.org/wiki/Voronoi_diagram">Voronoi diagram</a> demo<br />
- <a href="http://code.google.com/p/webglsamples/">aquarium.js</a><br />
- web workers</p>
<p><strong>Monday Lunch</strong></p>
<p>Benjamin, Ubuntu<br />
- loves <a href="https://www.cloudflare.com/">CloudFlare</a><br />
- likes Linode<br />
- <a href="http://nimbula.com/">nimbula</a></p>
<p>Talked to an open mapping data fellow about various projects. Google ToS is scary when it comes to that kind of data.</p>
<p><strong>Monday Afternoon</strong></p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/17828">Moose is Perl: A Guide to the New Revolution</a><br />
Ricardo Signes, Pobox.com<br />
<a href="http://assets.en.oreilly.com/1/event/61/Moose is Perl_ A Guide to the New Revolution Presentation 1.pdf">Slides</a></p>
<p>- detailed talk about Moose features and syntax<br />
- chatted with other folks at break time about topics like <a href="http://search.cpan.org/~kamelkev/CSS-Inliner/">CSS::Inliner</a> and <a href="http://search.cpan.org/~sri/Mojolicious/">Mojolicious</a> web framework (with minimal dependencies) by Sebastian Riedel.</p>
<p><strong>Tuesday Afternoon</strong></p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/18984">Three For Five &#8211; Functional HTML5 &#038; CSS3 for Designers &#038; Developers</a><br />
Jason VanLue, Envy Labs and CodeSchools.com</p>
<p>- good training session with fun sample &#8211; a beer menu created from 1 photo (CSS3 text scaling and rotation) and HTML5/CSS3 styled text<br />
- <a href="http://threeforfive.codeschool.com/">training class is available online</a> for $75 (also jQuery and 2 Rails classes)<br />
<center><br />
<a href="http://jebriggs.com/php/3-for-5-beer.png"><img src="http://jebriggs.com/php/3-for-5-beer.png" alt="3-for-5 Beer Menu" title="3-for-5 Beer Menu" width="95%" height="95%"/></a><br />
<a href="http://jebriggs.com/php/3-for-5-beer.png">Click to Enlarge</a><br />
</center></p>
<p><strong>Tuesday Night</strong><br />
<center><br />
<img src="http://jebriggs.com/php/puppet_labs_logo.jpg" alt="Puppet Labs Logo" title="Puppet Labs Logo"/><br />
</center><br />
- went to <a href="http://www.puppetlabs.com/">Puppet Labs</a> office for CloudCamp lightning talks, which started about 90 minutes late<br />
- nice office, typical start-up look across from a small park. Comfy little meeting rooms with leather sofas a la Netflix.<br />
- about 5 lightning talks total, 2 were sales pitches, 2 had 40 slides crammed into 5 minutes. ick.<br />
- Puppet Labs CEO gave a good talk on optimizing Puppet for a client with 10,000+ nodes. Converted XML::RPC to REST, which doubled performance from 500 to 1,000 qps (I talked to Randy Ray about that, and he wasn&#8217;t surprised and that would be the case on simple requests), did some more work and maxed out at 2,500 qps. Enabling SSL did not slow down requests.<br />
- got too crowded for me, and also fire department, who manned the exits and counted people as they entered and left.<br />
<center><br />
<a href="http://www.flickr.com/photos/jacob_helwig/5979946129/sizes/m/in/photostream/"><img src="http://jebriggs.com/php/puppet_labs_office.jpg" alt="Puppet Labs Office" title="Puppet Labs Office" /></a><br />
Photo credit: Jacob Helwig<br />
</center><br />
<strong>Wednesday</strong></p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/18480">Creating a Scalable JavaScript Application Architecture</a><br />
Nicholas Zakas, NCZConsulting<br />
<a href="http://slideshare.net/nzakas">Slides</a></p>
<p>An AJAX client only cares about getting the data it wants, not response codes, etc. </p>
<p>Use layered JavaScript client architecture:</p>
<p>- sandbox<br />
- application<br />
- library (Dojo, YUI, <a href="http://mootools.net/">MooTools,</a> etc. )</p>
<p>www.nczonline.net<br />
@slicknet<br />
Author of &#8220;High Performance JavaScript&#8221;</p>
<p><strong>Wednesday Lunch</strong></p>
<p>I talked to Ben Golub, CEO of <a href="http://www.gluster.com/">Gluster.</a></p>
<p>- 80% business, 20% scientific<br />
- users include <a href="http://www.box.net/">box.net,</a> <a href="http://www.pandora.com/">Pandora.com</a><br />
- written in C<br />
- minimum is 2 nodes for replication<br />
- lots of people use it in EC2<br />
- office located in Sunnyvale.</p>
<p><strong>Wednesday Afternoon</strong></p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/18792">HTML5: All about Web Forms</a><br />
Estelle Weyl, standardista.com</p>
<p>- use label tag with forms to ease navigation for end-users<br />
- use placeholder attribute, better for screenreaders than JS coding<br />
- multiple autofocus defaults to last one in HTML5<br />
- type=&#8221;text&#8221; is default, so tel, email, etc. degrades on all browsers back to text<br />
- form element can disassociate parent form, useful for AJAX multiform pages<br />
- input types good for mobile devices to show useful soft keyboard for url or email input types<br />
- numeric step options<br />
- test date and numeric input types for usability. Scrolling birthdays or zip codes is painful<br />
- still need JS<br />
- Opera is first with new UI features but last with artistic design, so currently has hideous tooltip appearance<br />
- list and datalist like exploded select. Include select for IE backward compatibility<br />
- meter, progress and output UI elements<br />
- input type=text x-webkit-speech, now on Google homepage</p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/19154">HTML5 in Your Pocket: Application Cache and Local Storage </a><br />
Scott Davis, ThirstyHead.com</p>
<p>- 4 million Macs, 32 million iDevices in last quarter<br />
- Basecamp Mobile<br />
- <a href="http://diveintohtml5.org/">&#8220;Dive into HTML5&#8243;</a> by Mark Pilgrim online<br />
- cookies should be called thimbles, only 4k<br />
- HTML5 localstorage supported in IE8, FF 3.5, so practically all<br />
- 5 MB, QUOTA_EXCEEDED_ERR, can&#8217;t increase now<br />
- <a href="http://statcounter.com/">StatCounter</a> browser stats<br />
- <a href="https://gist.github.com/350433">gist 350433: Storage polyfill</a> using window.name and cookies<br />
- no version of IE or FF support web SQL and they probably won&#8217;t, FF for philosophical reasons<br />
- cache manifest<br />
- <a href="http://jameswragg.com/experiments/genmanifest/">genManifest</a> bookmarklet<br />
- FF <a href="http://about:cache">about:cache</a> and Firebug are handy to see caches<br />
- appcache has no expiry date<br />
- date stamping manifest file causes re-download<br />
- 404 causes none to be saved<br />
- treat appcache as only slightly more secure than cookies, which are round-tripped<br />
- webplication<br />
- still sandboxed from local file access, could use node.js or signed app<br />
- See W3C <a href="http://www.w3.org/TR/html5/offline.html">HTML5 offline</a></p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/18972">Profiling and Detecting Bottlenecks in Software</a><br />
Bryan Call, Yahoo!/Apache Committer<br />
<a href="http://assets.en.oreilly.com/1/event/61/Profiling%20and%20Detecting%20Bottlenecks%20in%20Software%20Presentation.pptx">PowerPoint .pptx</a></p>
<p>- usual savings (machines, moving parts, get smart)<br />
- top, <a href="http://htop.sourceforge.net/">htop</a><br />
- vmstat, dstat<br />
- time cmd<br />
- Boost logging does small writes, allocates memory when it gets behind, causing both IO and memory pressure<br />
- profilers like oprofile and google profile cause 1% to 8% slowdown<br />
- valgrind&#8217;s callgrind much more resources<br />
- oprofile has script to convert output to <a href="http://kcachegrind.sourceforge.net/html/Home.html">kcachegrind</a><br />
- opcontrol &#8211;deinit<br />
- sysctl nmi_watchdog off<br />
- opcontrol &#8211;no-vmlinux<br />
- opcontrol &#8211;daemon<br />
- google profiler userland, LD_PRELOAD<br />
- env CPUPROFILE=/tmp/mybin.prof /usr/local/bin/my_binary_compiled_with_libprofiler_so<br />
- caching: don&#8217;t do the same work twice<br />
- choose the correct algorithms and data structures:  dqueue vs. List, hash vs. trees, locks vs. r/w locks, bloom filter<br />
- reuse memory, stack vs. heap, <a href="http://goog-perftools.sourceforge.net/doc/tcmalloc.html">tcmalloc</a><br />
- make fewer system calls (larger reads and writes)<br />
- faster hardware, bonded NICs, SSDs, RAID, CPU, more cores<br />
- read <a href="https://cwiki.apache.org/TS/profiling.html">How to Profile Apache Traffic Server</a><br />
- <a href="http://www.bootchart.org/">bootchart</a><br />
- <a href="http://acme.com/software/http_load/">http_load</a> now uses epoll<br />
- he made ab multi-core</p>
<p>From <a href="http://www.mjmwired.net/kernel/Documentation/sysctl/kernel.txt">kernel.txt</a>: &#8220;nmi_watchdog: Enables/Disables the NMI watchdog on x86 systems.  When the value is non-zero the NMI watchdog is enabled and will continuously test all online cpus to determine whether or not they are still functioning properly. Currently, passing &#8220;nmi_watchdog=&#8221; parameter at boot time is required for this function to work. If LAPIC NMI watchdog method is in use (nmi_watchdog=2 kernel parameter), the NMI watchdog shares registers with oprofile. By disabling the NMI watchdog, oprofile may have more registers to utilize.&#8221;</p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/21187">CoffeeScript: A New Hope for JavaScript</a><br />
Scott Davis, ThirstyHead.com</p>
<p>- trainer, author, worked on Comcast/Time Warner TVs which mostly use WebKit<br />
- little language that compiles into JS<br />
- JS V8 headless, like node.js<br />
- PhantomJS is headless HTML, handy for testing<br />
- Google GWT compiles Java to JS<br />
- &#8220;transpiler&#8221;<br />
- install node.js<br />
- install npm<br />
- npm install -g coffeescript<br />
- &#8211;tokens, &#8211;nodes like java p<br />
- immediately invoked function expression IIFE<br />
- coffeescript: string interpolation #{name}, &#8220;&#8221;"<br />
- objects with left-hand spacing like python</p>
<p><strong>Thursday</strong></p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/18809">DNSSEC @ Mozilla</a><br />
Shyam Mani, Mozilla Corporation<br />
<a href="http://people.mozilla.org/~shyam/presentations/oscon-2011.pdf">Slides</a></p>
<p>- BIND 9.7 is nice for DNSSEC<br />
- Keys are everything, protect them. Have a backup plan.<br />
- Cisco core routers by default don&#8217;t expect large DNS transfers:<br />
<code><br />
policy-map global policy class inspection_default inspect dns maximum-length 4096<br />
</code><br />
- DS was live, no signed zones<br />
- watch log levels, can be chatty and quickly fill disk with logs<br />
- DNSSEC has no immediate benefit to end-users, since resolvers don&#8217;t honor it<br />
- their logs show 1000:1 dns vs dnssec queries for last 6 months, but growing<br />
- <a href="https://supportforums.cisco.com/thread/2013194">IOS Firewall DNSSEC</a></p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/18795">Ask Google Engineers Anything</a><br />
Chris DiBona, Google</p>
<p>- 55 Google employees attending OSCON this year<br />
- mostly end-user questions about Google+ circles and API<br />
- or running Go on android<br />
- or why does my telco not do firmware releases for my smartphone<br />
- or not happy with Google search results this month<br />
- I asked about original reason for GFS. Originally, the hardware was really that flaky, and Google even actively bought bulk refurbed computers and RAM, sometimes off the back of a truck. Got a gopher plushie in return.<br />
- also some good feedback complaints: google groups UI inadequate for managing 350 groups in an Education scenario<br />
- inadequate data import tools for non-profit users of groups, mentioned by a religious charity.</p>
<p><a href="http://www.oscon.com/oscon2011/public/schedule/detail/19925">How Not to Design Like a Developer: Open Source Can Look Good Too!</a><br />
Chrissie Brodigan, Mozilla/Firefox</p>
<p>- KPI vs. git (different goals)<br />
- @sirupsen<br />
- story about the <a href="http://glow.mozilla.org">downloads map graphic</a> for FF 4 &#8211; a developer silently removed social button graphics, limiting participation of wider audience. Marketing needs to explain why and how other staff fit into outreach programs.<br />
- hang out on #projectdesign<br />
- design contests are a good way to get them to come out of the woodwork<br />
- designers hang out on twitter, not irc<br />
- programmers should avoid big red buttons that scare users, and improve accessibility<br />
- Inkscape, Blender, HTML and CSS are some Open Source tools for design mockups<br />
- do AB testing or survey users<br />
- designers want to be martyrs, so be careful they don&#8217;t offer more than you are willing to accept (start with 1 icon rather than the whole set)<br />
- take a look at graphics libre for icons<br />
- <a href="http://quitestrong.com">Quitestrong.com</a> 5 girls who do design</p>
<p><strong><a href="http://gearman.org/">Gearman</a> BOF</strong><br />
Brian Aker</p>
<p>- Gearman polls, beanstalk busy waits<br />
- nice to have feature to give up to another thread<br />
- monitor projects handle launching of workers<br />
- Gearman has durable and non-durable queues<br />
- is a superset of the crap you handrolled. Most of the homegrown apps peak at 50% to 60% of Gearman&#8217;s features<br />
- Gearman is production ready, but the postgresql driver less so because of fewer test cases and Brian&#8217;s lesser familiarity<br />
- setup ntp and use Gearman coalescence for redundant cron servers<br />
- can inspect queue<br />
- agnostic to backend<br />
- 99designs.com looking at this, same use case as original developer<br />
- I still think that if you already have a database app, adding a status column gets you a lot of Gearman functionality without one more moving part.</p>
<p><strong><a href="http://mariadb.org/">MariaDB</a> BOF</strong><br />
Monty Widenius, MontyProgram AB</p>
<p>- Monty mentioned that the latest release of <a href="http://kb.askmonty.org/en/what-is-mariadb-53">MariaDB 5.3-beta</a> has faster replication from group commit and performance improvements on the master, which also help the slave. Also subqueries and joins work much better.<br />
- Monty talked about his Aria storage engine, which is a replacement for MyISAM that has both transaction and non-transaction modes. It&#8217;s intended for users who want the space savings of MyISAM. Over time it may compete with InnoDB.<br />
- Monty&#8217;s responsibility is to convince Percona to merge into 1 source base sometime<br />
- it&#8217;s estimated that although Oracle still has the InnoDB team, they may only have 1 general MySQL server programmer left.<br />
- he explained that MontyProgram developers work 50% on feature requests from end-users, and 50% Open Source-related. So paid requests for 1 week of work really need to cover 2 weeks of developer time for that model to work. Typically a medium-sized change is roughly $12,000 and includes development, testing and documentation.<br />
- Zmanda got FLUSH TABLES WITH READ LOCK NO CHECKPOINT for a beer, though. <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Monty was able to find a code path that could be locked to prevent Aria and MyISAM from writing anything during the backup.<br />
- I sponsored <a href="http://askmonty.org/worklog/Server-RawIdeaBin/?tid=232">WL#232</a> for USD$100 to add a SHUTDOWN statement to MySQL<br />
- Monty explained that MERGE tables may be a better choice than MySQL partitions for logging applications.<br />
- attendees from MontyProgram, SkySQL, Percona, DeNA</p>
<p><a href="http://kb.askmonty.org/en/1631">AskMonty: MySQL &#8220;Wishlist&#8221; Session from an online travel agency</a><br />
<a href="http://download.oracle.com/docs/cd/E17952_01/refman-5.5-en/flush.html">MySQL Manual 5.5: FLUSH Syntax</a></p>
<p><strong>Perl Lightning Talks</strong><br />
Hosted by Geoff Avery</p>
<p>- a talk on why arrogant community members telling others that &#8220;they need a thick skin&#8221; is unhelpful<br />
- a talk by a young Perl community member on getting commit access, and how others can get the spirit and contribute<br />
- Larry did several talks, mostly encouraging backporting Perl6 features to Perl5 it seemed, perhaps as a replacement to going Moose<br />
- nice song on the importance of public libraries, which face shutdown due to economic budgeting problems in Australia and USA<br />
- nice comedy juggling act comparing programming languages. Perl6 was omitted as &#8220;nothing has been updated in 5 or 6 years&#8221;, prompting Larry to say that he was happy he has a thick skin. See above. <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
- afterward, I talked to a booking.com rep about why a European company needed to actively recruit in USA and world-wide. He said that European developers are happy working where they are now, and it&#8217;s easier to recruit in places with mobile workforces like the USA. He would like to hire a couple developers per week to meet their development schedule.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2011/07/oscon-2011-portland/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DataStax Apache Cassandra Training in SF</title>
		<link>http://www.jebriggs.com/blog/2011/07/datastax-apache-cassandra-training-in-sf/</link>
		<comments>http://www.jebriggs.com/blog/2011/07/datastax-apache-cassandra-training-in-sf/#comments</comments>
		<pubDate>Wed, 13 Jul 2011 03:20:35 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Perl]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[Toys]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/?p=3576</guid>
		<description><![CDATA[I went to the DataStax training for Apache Cassandra today at the Hilton San Francisco Financial District Hotel. Ben Coverston, Director of Operations at DataStax, did a great job presenting some very technical material. Topics included data modeling, client programs, &#8230; <a href="http://www.jebriggs.com/blog/2011/07/datastax-apache-cassandra-training-in-sf/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I went to the <a href="http://www.datastax.com/">DataStax</a> training for <a href="http://cassandra.apache.org/">Apache Cassandra</a> today at the Hilton San Francisco Financial District Hotel.</p>
<p><a href="http://www.linkedin.com/pub/ben-coverston/4/680/225">Ben Coverston,</a> Director of Operations at DataStax, did a great job presenting some very technical material.</p>
<p>Topics included data modeling, client programs, server configuration, replication, tuning, performance monitoring and operations.</p>
<p>Labs were done using the provided USB drive containing a VM (system requirements: 2 GB RAM and 12 GB disk free) of Cassandra and other tools like <a href="http://java.sun.com/developer/technicalArticles/J2SE/jconsole.html">JConsole</a> running on Debian Linux.</p>
<p>(I should have used <a href="http://www.virtualbox.org/">VirtualBox,</a> which is free (GPL) and doesn&#8217;t require registration, instead of the VMware Fusion trial, which has restrictive licensing and only works for a month.</p>
<p>To use VirtualBox4, create a new VM and instead of creating new disk space, attach &#8220;Riptano Training.vmdk&#8221;. You may also need to rename the parent directory from &#8220;datastax_training_dist client.vmware&#8221; to &#8220;datastax_training_dist client&#8221; for the navigator to open it.)</p>
<p>About 45 people attended, a new class record.</p>
<p>If you get a chance to take Ben&#8217;s class, I highly recommend it &#8211; send your whole team.</p>
<p>To get the most out of the class, I recommend reading about Cassandra data modeling and trying the various tools (cassandra-cli, cql, nodetool, JConsole, JNA, JMX, <a href="http://www.datastax.com/products/opscenter">Opscenter,</a> etc.) and hanging out on the #cassandra channel on irc.freenode.net. beforehand.</p>
<p>The hotel was a good choice, located near the financial district and Chinatown, with reliable WiFi, conveniently organized meeting space, and a very good lunch.</p>
<p>I had an interesting discussion with some veteran DBAs about the popularity of NoSQL solutions on the East Coast. The most popular request was for &#8211; get this &#8211; SQL access via Hive or cql. <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><a href="http://www1.hilton.com/en_US/hi/hotel/SFOFDHF-Hilton-San-Francisco-Financial-District-California/index.do">Hilton San Francisco Financial District</a><br />
750 Kearny Street, San Francisco, California, United States 94108</p>
<p><a href="http://twitter.com/#!/bcoverston">@bcoverston</a><br />
Ben Coverston &#8211; The Apache Cassandra Project<br />
<a href="http://www.slideshare.net/MStarTechTalks/ben-coverston-the-apache-cassandra-project"><a href="http://events.sfgate.com/san-francisco-ca/events/show/189792846-datastax-training-for-apache-cassandra-sf">sfgate.com: DataStax Training for Apache Cassandra &#8211; SF</a></a><br />
<a href="http://geekaustin.org/news/2011/02/27/apache-cassandra-training-30-jbellis-datastax-no-way">Apache Cassandra training for $30? from @spyced of Datastax? No way! </a><br />
<a href="http://blog.octo.com/en/nosql-lets-play-with-cassandra-part-13/">Let’s play with Cassandra… (Part 1/3)</a></p>
<p>Some notes about 0.8 and previous releases:</p>
<p><strong>- Cassandra Clients</strong></p>
<ul>
<li>cassandra-cli</li>
<li><a href="http://prettyprint.me/2010/02/23/hector-a-java-cassandra-client/">Hector</a></li>
<li>CQL</li>
<li><a href="http://pycassa.github.com/pycassa/">pycassa</a></li>
<li>Perl and PHP clients are not as reliable because of poor Thrift bindings, should be rewritten to use cql binary protocol</li>
<li>Java</li>
</ul>
<p><strong>- 3 common cassandra-cli commands are:</strong></p>
<ol>
<li>show</li>
<li>describe</li>
<li>list</li>
</ol>
<p><strong>- Counters </strong></p>
<ul>
<li>If you use counters, then don&#8217;t delete SStables. decommission and bootstrap.</li>
<li>TimeoutException can cause overcount with counters.</li>
<li>No secondary indexes on counters.</li>
<li>Repair does not fix counters.</li>
</ul>
<p><strong>- nodetool</strong></p>
<ul>
<li>nodetool flush</li>
<li>nodetool drain</li>
</ul>
<p><strong>- Repair</strong></p>
<ul>
<li>Send schema changes 5 seconds apart to avoid wedging other nodes. To fix a wedge, delete old schema (leave data alone) and restart.</li>
<li>mlockall is enabled by default if JNA is found on the classpath (must be installed separately because of licensing). Also a good idea to turn swap off.</li>
<li>nodetool repair (or rsync all files then nodetool repair)</li>
<li>read at consistency level of all (ghetto repair)</li>
<li>adding a new node can be more efficient than removing a node then adding it.</li>
</ul>
<p><strong>- Compaction</strong></p>
<ul>
<li>rows over 64 MB need 2-pass compaction</li>
<li>minor compaction can purge tombstones > 0.6.6</li>
<li>avoid major compaction, but keep an eye on SStables-per-read histogram</li>
</ul>
<p><strong>- Monitoring</strong></p>
<ul>
<li>watch -d -n 5 nodetool info or cfstats or tpstats</li>
<li>JConsole and JMX</li>
<li>storageproxy latency, reads, writes</li>
<li>oadcdb storageproxy</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2011/07/datastax-apache-cassandra-training-in-sf/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Cassandra SF 2011, San Francisco</title>
		<link>http://www.jebriggs.com/blog/2011/07/cassandra-sf-2011-san-francisco/</link>
		<comments>http://www.jebriggs.com/blog/2011/07/cassandra-sf-2011-san-francisco/#comments</comments>
		<pubDate>Tue, 12 Jul 2011 07:02:17 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Tech]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/?p=3571</guid>
		<description><![CDATA[I went to the Cassandra SF 2011 conference today, at the UCSF Mission Bay Conference Center. It was hosted by Datastax.com, where many of the Cassandra developers work. Here&#8217;s an overview of some of the talks I attended. Keynote Jonathan &#8230; <a href="http://www.jebriggs.com/blog/2011/07/cassandra-sf-2011-san-francisco/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I went to the <a href="http://www.datastax.com/events/cassandrasf2011">Cassandra SF 2011</a> conference today, at the <a href="http://www.acc-missionbayconferencecenter.com/">UCSF Mission Bay Conference Center.</a></p>
<p>It was hosted by <a href="http://www.datastax.com/">Datastax.com</a>, where many of the Cassandra developers work.</p>
<p>Here&#8217;s an overview of some of the talks I attended.</p>
<p><strong>Keynote</strong><br />
<strong>Jonathan Ellis, DataStax CTO and Apache Project Chair of Cassandra</strong></p>
<p>Talked about the latest features and use cases for Cassandra and Brisk, Hadoop powered by Cassandra. </p>
<p><strong>Cassandra Use Cases at Twitter</strong><br />
<strong>Chris Goffinet, Twitter (ex-Yahoo! Performance Engineer)</strong></p>
<p>Ext4-writeback-512k best, xfs also good<br />
Noop best for reads p90, p99, avg, max<br />
All schedulers same for writes because of journal.<br />
Multi-threaded compaction all the time:<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-2191">CASSANDRA-2191</a> Multithread across compaction buckets<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-2191">CASSANDRA-2156</a> Compaction Throttling<br />
Compression reduces data by 7x<br />
memcached most efficient space-wise<br />
99th percentile went from 200-800 ms -> 2.5 ms<br />
Cuckoo replaces <a href="http://ganglia.sourceforge.net/">Ganglia,</a> updates every 60s<br />
140 MB/s writes, 70 MB/s reads<br />
500 GB new data / hour<br />
x3 replication factor<br />
Slab allocator fixed size allocs of 2 MB<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-2252">CASSANDRA-2252</a> off-heap memtables<br />
Reduce GC time by 1000x<br />
Don&#8217;t use TTL, drop SSTables after N days<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-2819">CASSANDRA-2819</a> Split rpc timeout for read and write ops<br />
mlockall recommended with JNA jar to fix JVM allocation problems<br />
git, Hudson/Jenkins, <a href="https://github.com/lg/murder/">murder</a> <a href="http://twitter.com/about/opensource">(Twitter Open Source),</a> <a href="http://www.bittorrent.com/help/faq">Bittorrent</a> to deploy to 100s of nodes in 20s<br />
Cassie is a light-weight Cassandra Client<br />
Based on finagle<br />
<a href="https://github.com/brianfrankcooper/YCSB/wiki">Yahoo! Cloud Serving Benchmark (YCSB)</a>-based test framework<br />
Generate 1,200 reports per build, takes 48 hours to run<br />
Understand your stack<br />
Measure everything<br />
Invest in your storage technology (right components, understand failure modes, etc.)<br />
Automate<br />
Expect everything to fail<br />
Linux page cache did not have good read performance since not LRU and fixed 4K, thus memcached esp. with narrow rows<br />
Need to evaluate alternative JVMs like <a href="http://www.oracle.com/technetwork/middleware/jrockit/overview/index.html">JRockit</a> and <a href="http://www.azulsystems.com/">Azul</a> for GC<br />
Apache <a href="http://www.mesosproject.org/">Mesos</a> cluster manager and virtualization project.<br />
Twitter Ops improvements &#8211; 6 people for 6 months.<br />
<a href="http://www.bittorrent.com/help/faq">Bittorrent</a> originally used to update 1000s of apache servers, create 1 seed and have it then do neighbor. Can update 300-node Cassandra cluster in 20s. <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>(An attendee said speed of deployment is important more for recovering after a bad push than anything else in his case. It&#8217;s also helpful for software developers writing deployment scripts, and also when you have network architecture issues like overcommitment or latency-sensitive production traffic.)</p>
<p><strong>Eric Onnen, Urban Airship</strong></p>
<p>Committer on HBase, Cassandra, Zookeeper projects<br />
Hosting for mobile services<br />
United API for services across platforms<br />
Common SLA is 10,000 or 120,000 messages / s<br />
Over 160 million app installs on 80 million devices<br />
Half of requests are device check-ins (ie. when an app foregrounds)<br />
Now 25 million per day<br />
Used early MongoDB, not so good<br />
EC2 does not offer enough RAM for caching all their data<br />
Cassandra starting in summer 2010 for android<br />
6 EC2 himems, 1000 reads/s, 750 writes/s<br />
30 GB node<br />
Rolling upgrades<br />
Column TTLs<br />
Nice community<br />
Good with EC2<br />
No SPOF<br />
Ability to alter CLs on a per-operation basis like checkins vs schema changes<br />
Problems<br />
Know your data model<br />
Creating indexes later is a PITA<br />
Wide rows are bad &#8211; IO, thrift, count<br />
JSON better than packed binaries<br />
Careful with thrift in the stack<br />
Read timeout vs connection refused, GC vs failure, library assumptions and exception handling<br />
Verify client<br />
Ops<br />
Ensure dynamic switch is enabled<br />
IO<br />
Avoid EBS except for snapshot backups or use S3<br />
Stripe epehemerals, not EBS volumes<br />
Use common init scripts<br />
Avoid smaller instances &#8211; more steal<br />
Bare metal will rock your world (compared to VMs or EC2)<br />
Uselargepages<br />
Look at thread dumps<br />
Java <a href="http://java.net/projects/tda">TDA (Thread Dump Analyzer)</a> &#8211; love it, only good thing to come out of SAP <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
<a href="http://www.eclipse.org/mat/">Eclipse Memory Analyzer (MAT)</a><br />
Perform major compactions daily in lull periods<br />
Monitor JMX religiously<br />
Cassandra exposes a lot of performance counters, use them<br />
Still too much time looking at GC<br />
Upgrading from 0.6 to 0.7 was rocky, had to do a read repair surprise<br />
Need CQL or Pig to offload ad hoc reporting from 12 developers to 11 administrative staff <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
Sample end-user ad hoc query: How many android devices are using this version of software title &#8220;whatever&#8221;?</p>
<p><strong>Introduction to Brisk</strong><br />
<strong>Jake Luciani, DataStax</strong></p>
<p>Great talk on Brisk.</p>
<p><strong>Highly Available DNS and Request Routing Using Apache Cassandra</strong><br />
<strong>David Strauss, Founder/CTO, Pantheon Systems</strong></p>
<p>We accept HTTP requests and forward them to app servers (mostly for hosted Drupal)<br />
Reconfiguring Varnish, nginx, and proxies not fun, disruptive<br />
DNS has a nice model<br />
And client tools like dig<br />
nginx did not do round-robin<br />
Node.js ok using DNS, HTTP proxy and Fugue modules, 99 lines of code<br />
Why another? DNS has no ring replication, defined master<br />
MS AD and ApacheDS too heavy<br />
DNS needs to withstand DoS<br />
Github cassandra-dns<br />
Twisted doesn&#8217;t leak memory, but clients may<br />
TxSQL is another way, and is non-blocking<br />
Other ideas, point client resolvers at haproxy</p>
<p><strong>Replacing Datacenter Oracle with global Apache Cassandra on AWS</strong><br />
<strong>Adrian Cockcraft, Netflix</strong></p>
<p>DC capacity is 4 Billion requests per month<br />
exponential growth to 30 billion requests per month<br />
10 minutes to create a cluster in SG, would take 9 months to build a DC there<br />
Full backup &#8211; daily cron snapshot<br />
Incremental -<br />
Continuous &#8211; commit log to EBS<br />
Archive &#8211; S3, encrypt, region, separate account, different cloud<br />
Guide to NoSQL, redux. Mark Atwood<br />
Practicalcloudcomputing.com<br />
Chaos monkey runs 9 am to 3 pm, need to opt-out<br />
For BI take backup after midnite, but restore and chop off extra seconds after 00:00:00</p>
<p><strong>Real World Capacity Planning Cassandra on Blades and Big Iron</strong><br />
<strong>Ed Capriolo, Media6degrees</strong></p>
<p>Compacting tombstones at nite (batch time) can help with day-time traffic (real-time)<br />
nodetool<br />
JBOD not commonly used, but you could, like Hadoop or Riak<br />
Puppet<br />
Cacti graphs to acquire JMX data<br />
Uses all SCSI/SAS drives<br />
Likes blades, though only 1 or 2 drives per blade<br />
Centos 5.5 ext4<br />
Writing &#8220;High Performance Cassandra Cookbook&#8221;</p>
<p><strong>Lightning Talks</strong></p>
<p><strong>Mike Weir, Mktg VP, Datastax</strong></p>
<p>Compaction performance improvements C-1608</p>
<p><strong>Ankit Shah, Principal Software Engineer, Verisign Authentication Group, Symantec</strong></p>
<p>DB is 1.2 GB or so (small) but want HA<br />
Can be read-intensive, some writes<br />
Want good read performance, failover, replication<br />
Secure gossip over IPSec or similar<br />
Doesn&#8217;t like existing AWS database product limitations<br />
So Cassandra is it</p>
<p><strong>The Auto-Clustering Brisk AMI</strong><br />
<strong>Joaquin Casares, Datastax</strong></p>
<p>Also RAIDs ephemeral volumes</p>
<p><strong>Twitter</strong></p>
<p>Largely done programming:</p>
<p><a href="https://issues.apache.org/jira/browse/CASSANDRA-47">CASSANDRA-47</a> SSTable compression<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-674">CASSANDRA-674</a> New SSTable Format<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-16">CASSANDRA-16</a> Memory efficient compactions<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-2319">CASSANDRA-2319</a> Promote row index<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-293">CASSANDRA-293</a> remove_key_range operation and<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-494">CASSANDRA-494</a> add remove_slice to the api<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-808">CASSANDRA-808</a> Need a way to skip corrupted data in SSTables<br />
<a href="https://issues.apache.org/jira/browse/CASSANDRA-2398">CASSANDRA-2398</a> Type specific compression</p>
<p><strong>Mike Bulman, Opscenter Team, Datastax</strong></p>
<p>Uses Cassandra itself for logging<br />
Ring view graph<br />
Add a node to ring tool</p>
<p><strong>Cassandra Anti-Patterns</strong><br />
<strong>Matt Dennis, Datastax</strong></p>
<p>Use Sun JVM u22 or newer, not OpenJDK or Blackdown<br />
6-8 GB for java heap, not above 16 GB<br />
Separate spindles for. data and commit log unless EBS<br />
No EBS, use ephemeral<br />
Don&#8217;t read before write if possible<br />
Use bulk loader in 0.8, Java SStable program<br />
Supercolumns need to be rewritten.</p>
<p><a href="http://www.slideshare.net/shotaz/cql-cassandra-query-language">SlideShare: Cassandra Query Language (CQL)</a><br />
<a href="http://www.infoq.com/news/2007/11/ibm-sap">IBM and SAP Open Source their JVM Diagnostics Tools</a><br />
<a href="http://www.slideshare.net/adrianco/migrating-netflix-from-oracle-to-global-cassandra">Migrating Netflix from Datacenter Oracle to Global Cassandra</a><br />
<a href="http://news.cnet.com/8301-1023_3-57319459-93/urban-airship-raises-$15-million/">Urban Airship raises $15 million (Nov., 2011)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2011/07/cassandra-sf-2011-san-francisco/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>O&#8217;Reilly Velocity Conference Web Performance and Operations Conference 2011, Santa Clara</title>
		<link>http://www.jebriggs.com/blog/2011/06/oreilly-velocity-conference-2011/</link>
		<comments>http://www.jebriggs.com/blog/2011/06/oreilly-velocity-conference-2011/#comments</comments>
		<pubDate>Sun, 26 Jun 2011 23:22:57 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Perl]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[Toys]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/?p=3502</guid>
		<description><![CDATA[I was too busy to make it to the Velocity Conference this year from June 14-16, but here&#8217;s some talk links. YouTube: O&#8217;Reilly Velocity 2011 Playlist YouTube: Velocity 2011 Search Results YouTube: O&#8217;Reilly Velocity 2010 Playlist Overclocking SSL at Google &#8230; <a href="http://www.jebriggs.com/blog/2011/06/oreilly-velocity-conference-2011/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I was too busy to make it to the <a href="http://velocityconf.com/velocity2011">Velocity Conference</a> this year from June 14-16, but here&#8217;s some talk links.</p>
<p><a href="http://www.youtube.com/user/OreillyMedia#p/c/C394849408B5F203">YouTube: O&#8217;Reilly Velocity 2011 Playlist</a><br />
<a href="http://www.youtube.com/results?search_query=velocity+2011">YouTube: Velocity 2011 Search Results</a></p>
<p><a href="http://www.youtube.com/user/OreillyMedia#p/c/D1D3B0B233F2AD66">YouTube: O&#8217;Reilly Velocity 2010 Playlist</a><br />
<a href="http://www.imperialviolet.org/2010/06/25/overclocking-ssl.html">Overclocking SSL at Google (2010)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2011/06/oreilly-velocity-conference-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>MySQL 5.5 News from Oracle OpenWorld</title>
		<link>http://www.jebriggs.com/blog/2010/10/mysql-5-5-news-from-oracle-openworld/</link>
		<comments>http://www.jebriggs.com/blog/2010/10/mysql-5-5-news-from-oracle-openworld/#comments</comments>
		<pubDate>Sun, 03 Oct 2010 06:18:08 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[Toys]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/?p=2390</guid>
		<description><![CDATA[I received an interesting email about MySQL news from Oracle. It was badly formatted in my email client, so I have reformatted it below. Sender: &#8220;Oracle MySQL&#8221; Subject: News from MySQL Sunday at Oracle OpenWorld Date: Sat, 2 Oct 2010 &#8230; <a href="http://www.jebriggs.com/blog/2010/10/mysql-5-5-news-from-oracle-openworld/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I received an interesting email about MySQL news from Oracle.</p>
<p>It was badly formatted in my email client, so I have reformatted it below.</p>
<div style="color:#000000; background-color:#faf0e6;">
<p>Sender: &#8220;Oracle MySQL&#8221;<br />
Subject: News from MySQL Sunday at Oracle OpenWorld<br />
Date: Sat, 2 Oct 2010 11:17:17 -0700</p>
<hr />
Did you miss the first MySQL Sunday at Oracle OpenWorld?</p>
<p>Here&#8217;s your opportunity to catch up on the exciting news and developments. In his keynote, Edward Screven, Oracle&#8217;s Chief Corporate Architect, underlined Oracle&#8217;s investment in MySQL and demonstrated how Oracle is making MySQL better every day.</p>
<p>As a testament to that investment, Edward announced the immediate availability of the release candidate version of MySQL 5.5, delivering major enhancements in performance &#038; scalability, availability and usability.</p>
<p><a href="http://medianetwork.oracle.com/media/show/15664?n=subCategory&#038;nid=35">Watch Edward&#8217;s presentation</a></p>
<p>With MySQL 5.5 You Get:</p>
<ol>
<li>Improved performance and scalability: Benchmarks showed up to 1,561% performance gains for Read/Write operations on Windows, and up to 364% performance gain in Read/Write operations on Linux!</li>
<li>Higher availability: New semi-synchronous replication and replication Heart Beat improve the reliability of data and the speed of failover for continuous application availability.</li>
<li>Improved usability: Enhanced index and table partitioning, signal/re-signal support and diagnostics further improve the usability of MySQL 5.5.</li>
</ol>
<p>Learn More and Get Going with MySQL 5.5:</p>
<ul>
<li><a href="http://dev.mysql.com/downloads/mysql/">Download MySQL 5.5 RC</a></li>
<li>Join us Live Webinar: <a href="http://dev.mysql.com/news-and-events/web-seminars/display-572.html">Delivering Scalability and High Availability with MySQL 5.5 Replication Enhancements</a></li>
</ul>
<p><a href="http://www.mysql.com/contact">Contact</a> your MySQL sales representative for more information online or at the number for your region below:</p>
<ul>
<li>USA/Canada &#8211; Toll Free: +1-866-221-0634 USA. From abroad: +1-208-338-8100</li>
<li><a href="http://www.mysql.com/about/contact/">Additional phone numbers</a> for EMEA, Latin America and APAC</li>
</ul>
<p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2010/10/mysql-5-5-news-from-oracle-openworld/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Defcon 18, Las Vegas</title>
		<link>http://www.jebriggs.com/blog/2010/08/defcon-18-las-vegas/</link>
		<comments>http://www.jebriggs.com/blog/2010/08/defcon-18-las-vegas/#comments</comments>
		<pubDate>Mon, 02 Aug 2010 02:13:15 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[Toys]]></category>
		<category><![CDATA[Travel]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/?p=2200</guid>
		<description><![CDATA[DEF CON 18 was held once again in Las Vegas at the Riviera Convention Center. There were a handful of talks on the subjects of DNS and IPv6. The hacker Jeopardy session was a lot of fun. I think the &#8230; <a href="http://www.jebriggs.com/blog/2010/08/defcon-18-las-vegas/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><a href="http://defcon.org/html/defcon-18/dc-18-index.html">DEF CON 18</a> was held once again in Las Vegas at the Riviera Convention Center.</p>
<p>There were a handful of talks on the subjects of DNS and IPv6.</p>
<p>The hacker Jeopardy session was a lot of fun. I think the audience got more correct answers than the panel. I was impressed with the software somebody wrote to show the game categories &#8211; very convincing. Afterward, the <a href="http://www.eff.org/">EFF</a> had an interesting fundraiser (your photo beside a &#8220;model&#8221;.)</p>
<p>The weather was hot but clear. The McDonald&#8217;s across the street is open 24 hours and has free WiFi.</p>
<p>I walked over to the <a href="http://www.thefashionshow.com/">Fashion Show Mall</a> (about 1 mile.) It has a variety of restaurants on different levels, including a <a href="http://www.maggianos.com/locations/detail.asp?unit_id=001.025.0193">Maggiano&#8217;s,</a> the <a href="http://www.thefashionshow.com/dining-entertainment/the-capital-grille">Capital Grille,</a> and a gourmet burger stand.</p>
<p><a href="http://www.theregister.co.uk/2010/08/06/ipv6_security_nightmare/">theregister.co.uk: Defcon speaker calls IPv6 a &#8216;security nightmare&#8217;</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2010/08/defcon-18-las-vegas/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>O&#8217;Reilly Open Source Conference 2010, Portland</title>
		<link>http://www.jebriggs.com/blog/2010/07/oscon-conference-2010-portland/</link>
		<comments>http://www.jebriggs.com/blog/2010/07/oscon-conference-2010-portland/#comments</comments>
		<pubDate>Fri, 23 Jul 2010 23:22:51 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[BSD]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[OSCON]]></category>
		<category><![CDATA[Perl]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[Toys]]></category>
		<category><![CDATA[Travel]]></category>
		<category><![CDATA[User Groups]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/?p=2140</guid>
		<description><![CDATA[Once again, the O&#8217;Reilly Open Source Conference (OSCON) was held in Portland, Oregon. It was a good conference, and we had beautiful weather all week long. Executive Summary The themes promoted by the conference organizers were Cloud Computing, NoSQL, Emerging &#8230; <a href="http://www.jebriggs.com/blog/2010/07/oscon-conference-2010-portland/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Once again, <a href="http://www.oscon.com/oscon2010">the O&#8217;Reilly Open Source Conference (OSCON)</a> was held in Portland, Oregon.</p>
<p>It was a good conference, and we had beautiful weather all week long.</p>
<p><strong>Executive Summary</strong></p>
<p>The themes promoted by the conference organizers were Cloud Computing, NoSQL, Emerging Languages (Scala, Erlang, Parrot, Go) and Android phone development.</p>
<p>The <a href="http://twitter.com/oscon">@oscon</a> twitter channel was heavily used to coordinate amongst organizers and attendees. I used the <a href="http://www.twixtreme.com/">TwiXtreme</a> twitter client program on my BlackBerry.</p>
<p>Plug Computers were very popular in the Expo area. They are 5 watt ARM-based computers running Debian Linux that fit into a power brick-sized case and cost $99 to $129 depending on features. The Marvell booth had a few models on display, from GlobalScale <a href="http://www.globalscaletechnologies.com/c-2-globalscale-technologies-products.aspx">(GuruPlug)</a> and <a href="http://www.ionics-ems.com/plugcomputer.html">Ionics.</a> High-end models have dual gigabit NICs, multiple USB ports, a WiFi access point and other expansion ports.</p>
<p>There was also continuing buzz regarding Facebook&#8217;s <a href="http://www.facebook.com/note.php?note_id=388112370932">Flashcache SSD module (GPL v2)</a> for linux, and also ZFS snapshots.</p>
<p><strong>Tutorials</strong></p>
<p>I went to the <a href="http://gearman.org/">Gearman</a> Cookbook tutorial, the first half of the <a href="http://opscode.com/chef/">Chef</a> tutorial and some of the Cloud Summit talks.</p>
<p>The <a href="http://gearman.org/">Gearman</a> Cookbook tutorial was excellent. After a detailed overview of the Gearman architecture and implementations in Perl and C, a number of use cases were explored in detail, including before and after code samples. The talk was both easy to listen to as an overall survey, as well as providing immediately useful info for those wanting to deploy it.</p>
<p>The <a href="http://opscode.com/chef/">Chef</a> tutorial was very detailed &#8211; too much so perhaps. I went to the first half only, since I am not planning to implement Chef soon (I use PXE and anaconda/kickstart with CentOS), and did not need that level of detail at this time. cfengine, puppet and chef are ops tools for configuring servers. Chef uses Ruby data structures for its configuration files, and has include files and other useful syntax. Basically, users can &#8220;code&#8221; server configuration, as if they were traditional apps.</p>
<p>I went to some of the <a href="http://www.oscon.com/oscon2010/public/schedule/detail/15295">Cloud Summit talks</a> and BOFs, but found that anybody who has done a simple project using EC2 knew as much or more than the speakers, some I would call blowhards.</p>
<p>Marten Mickos, president of Eucalyptus, is refreshing in that he is always clear about being in it for the money, while also promoting Open Source.</p>
<p><strong>Sessions</strong></p>
<p>Some of the most memorable sessions to me were:</p>
<p><strong>Introduction to MongoDB, Kristina Chodorow (MongoDB)</strong></p>
<p><a href="http://www.snailinaturtleneck.com/blog/">Kristina</a> is the maintainer of the Perl and PHP drivers for MongoDB. She gave an overview of MongoDB, a NoSQL document store, and its command-line interface, which uses JavaScript. </p>
<p>Some day she will release <a href="http://www.snailinaturtleneck.com/blog/2010/06/30/managing-your-mongo-horde-with-genghis-khan/">a sharding tool</a> for MongoDB.</p>
<p><strong>Scaling SourceForge with MongoDB, Nosh Petigara (10gen), Rick Copeland (SourceForge.net / GeekNet) </strong></p>
<p>Nosh and Rick gave an excellent review of incorporating MongoDB into the SourceForge site.</p>
<p>- SF query load is mostly read-only<br />
- ops team benchmarked a few NoSQL candidates, and MongoDB won on performance<br />
- original MySQL servers had 64 GB RAM. After migration to MongoDB, same server machines but only 8 GB RAM<br />
- backup dumps are verified to be bitwise the same as masters<br />
- have to be careful not to dump all documents in your database to the network or it will max out switches<br />
- SF relies on first-class data centers and replication slaves, less worried about MongoDB mmap (not crash-safe)<br />
- I personally looked at their performance numbers and site graphs (on an iPad), and the end result was impressive.</p>
<p><strong>Perl Lightning Talks</strong></p>
<p>As always, the Perl Lightning Talks are a highpoint of the conference.</p>
<p>The &#8220;cartoon&#8221; of <a href="http://www.math.u-bordeaux1.fr/~pit/">Vincent Pit&#8217;s</a> remarkable CPAN module<a href="http://search.cpan.org/~vpit/">(VPIT)</a> contributions was both informative and hilarious. Vincent is a French Ph.D. candidate in advanced geometry.</p>
<p><strong>Cloud BOF (3 Hours)</strong></p>
<p>The Cloud BOF was disorganized, starting 30 minutes late and for some reason was subdivided into 4 audience groups. Startups and vendors trying to make a cloud sales push led the BOF, including cloud and DNS service providers.</p>
<p>The Health Regulations subgroup came up with a couple ways to make the Cloud palatable to regulators by using encryption on all data due to the multi-tenancy issues with sharing public VMs.</p>
<p>I was in the NoSQL group, which discussed general issues and particular successes. <a href="http://www.memcached.org/">Memcached</a> was the clearest winner, while some people also had success with MongoDB and Redis.</p>
<p>My neighbor was an engineer at <a href="http://www.postrank.com/">Postrank.com</a>. He said that they were happy with HAProxy, but much less happy with the unpredictable IO available when running MySQL on EC2. He also said to carefully look at storage volumes available to your instance, as one is a useful tmpfs. They use <a href="http://www.authsmtp.com/">AuthSMTP</a> to get around EC2 being generally blacklisted for outbound email.</p>
<p><strong>Database BOFs</strong></p>
<p><strong>MySQL BOF</strong></p>
<p>The MySQL AB engineering staff has left Oracle. <a href="http://askmonty.org">Monty Program AB</a> (21 staff) has the core developers, and Percona Inc. (32 staff) has the consultants. Oracle still has some of the InnoDB programmers.</p>
<p>The business plan for Monty Program AB is 60% commercially-sponsored MySQL development, and 40% community-request development. Monty would like commercial users of MySQL to sponsor patches that would benefit them.</p>
<p>Mark mentioned that using Nehalem instructions for CRC were much faster, and that Facebook was using partitions for truncating tables instead of doing multi-record deletes. (See his blog for more details.)</p>
<p>One person mentioned using a commercial backup tool, <a href="http://www.r1soft.com/">R1Soft</a>, that inserts a linux kernel module to allow filesystem snapshots. He said to carefully test backup and restore in your environment, especially for filesystems greater than 1 TB which may exceed certain block counter limits. Peter said that some of his clients had used it with varying success.</p>
<p>It worked for him in his environment, and the file browser allows selective file restore (he uses it to restore by priority where a system runs multiple applications.) It starts at $299 for the Standard Edition, and also has MySQL Add-on and Enterprise Editions. </p>
<p><strong>PostgreSQL BOF</strong></p>
<p>The PostgreSQL BOF talked about 30 or so changes that went into version 9.</p>
<p>One of the most exciting new features is a native replication feature, called streaming replication (block-based.) The advantage over <a href="http://www.slony.info/">Slony-I</a> replication is that Slony-I is trigger-based, so has a variety of issues included inability to replicate DDL commands.</p>
<p>Some of the developers mimed replication events, which was rather amusing to watch. Yes, it was taped.</p>
<p>PostgreSQL is released under the <a href="http://www.opensource.org/licenses/postgresql">PostgreSQL Licence</a>, which is BSDish.</p>
<p>Peter Zaitsev, co-founder of <a href="http://www.percona.com/">Percona</a>, organized 3 BOFs, including XtraDB, XtraBackup, Maatkit, Percona Server, <a href="http://www.sphinxsearch.com/">Sphinx Search</a> and Running Databases on Flash Storage.</p>
<p><strong>Sphinx Search BOF</strong></p>
<p>Andrew Aksyonoff, the original programmer of Sphinx Search (GPL v2), couldn&#8217;t make it to OSCON (the good excuse was that he was busy coding), so Richard Kelm (Sphinx sales/customer support honcho) and Peter filled in (Percona is a business partner with Sphinx, and many of Percona&#8217;s clients use it.)</p>
<p>Some of the attendees were existing users, like myself, and some from HP and other companies were looking for a large-scale search solution or alternative to Lucene.</p>
<p>Monty mentioned that the latest MySQL 5.1 should be used, as there have been a number of performance and reliability improvements. Full-text search is supposed to be 10x faster than 5.0, and replication is nearly bug-free by now.</p>
<p>Sphinx Search now has <a href="http://www.sphinxsearch.com/docs/current.html#rt-indexes">real-time index updates</a> in version 1.1.0 beta. Another very nice feature is SQL+FS indexing.</p>
<p>Here is the full Sphinx 1.1.0 <a href="http://www.sphinxsearch.com/docs/current.html#rel110">changelog.</a></p>
<p><strong>Running Databases on Flash Storage BOF</strong></p>
<p>The Running Databases on Flash Storage BOF had a combination of MySQL and Postgres users who have tested or used most of the SSD products: FusionIO, violin, Intel, OCZ, etc. Everybody was happy with SSD IOPS performance, but less so with cost and metadata RAM requirements with the add-in boards (FusionIO may require 4 GB RAM for metadata.)</p>
<p>Peter said that 20% to 30% of his clients are already using SSD &#8211; across the spectrum of vendors and models. Some are also trying &#8220;massive RAM&#8221; solutions, like Cisco servers with 384 GB RAM.</p>
<p>Some users had 1+ TB Postgres databases with very thorny backup and mgmt. issues. One solution was to start a snapshot, but not do the copy operation.</p>
<p><strong>Expo Notes</strong></p>
<p>I had an enjoyable talk with Austin Hook, who has operated the OpenBSD Store for many years. He lives near Calgary, the center of OpenBSD/OpenSSH/PF development. He mentioned that some perennial financial contributors had stopped because of the recession, so here&#8217;s <a href="http://www.openbsd.org/donations.html">the donations link.</a></p>
<p>I also talked to some reps from a Brazilian outsourcing firm, <a href="http://www.actminds.com/">ActMinds.</a> They currently have 400 employees across Brazil and a sales office in Philadelphia. Brazil is only 2 hours ahead of EST. They said the minimum project size is 2 developers and developer turnover a low 5%/annum. Their pricing is $35 to $45/hour.</p>
<p>And I had fun handling the plug computers on display at the Marvell booth. The Ionics boards are amazingly densely populated.</p>
<p><strong>Discussions</strong></p>
<p>I had the opportunity to talk to a long-time Portland resident who works as a computer consultant. He said that the Portland economy is not doing great, and really hasn&#8217;t done well since old-growth logging was stopped after 90% of the forests were cleared. And although hundreds of miles of fiber optic has been laid downtown, it&#8217;s not available for residential use. However, the Beaverton area does have ubiquitous FTTH.</p>
<p>I also talked to somebody who attended the Emerging Languages talks. He&#8217;s working on his M.Sc. in Computer Science, so found those talks fascinating.</p>
<p><strong>Twitter Humor</strong></p>
<p>There were some humorous tweets:</p>
<p>- &#8220;my MongoDB and CouchDB mugs are fighting each other.&#8221;<br />
- &#8220;I got one MongoDB mug, but need two to safely store coffee.&#8221;</p>
<p><strong>Notes</strong></p>
<p>Note to self: skip the nightly parties unless you have a date. The bars are too loud to talk to anybody.</p>
<p>Note to the O&#8217;Reilly conference organizers: use meetup.com for the BOFs like ApacheCon does. The average audience was about 10 people, and with meetup it would  be 4x that.</p>
<p><a href="http://www.oscon.com/oscon2010/public/schedule/proceedings">OSCON 2010 Slides</a><br />
<a href="http://www.tbray.org/ongoing/When/201x/2010/07/21/DPH">Tim Bray: Desperate Perl Hacker</a><br />
<a href="http://www.youtube.com/results?search_query=oscon+2010&#038;aq=f">Youtube: OSCON 2010 videos</a><br />
<a href="http://blip.tv/?search=oscon2010;s=search">blip.tv: OSCON2010 videos</a><br />
<a href="http://en.wikipedia.org/wiki/Plug_computer">wikipedia: Plug Computer</a><br />
<a href="http://blog.zawodny.com/2010/05/22/mongodb-early-impressions/#comments">Jeremy Zawodny: MongoDB Early Impressions</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2010/07/oscon-conference-2010-portland/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>MySQL Conference 2010</title>
		<link>http://www.jebriggs.com/blog/2010/04/mysql-conference-2010/</link>
		<comments>http://www.jebriggs.com/blog/2010/04/mysql-conference-2010/#comments</comments>
		<pubDate>Mon, 12 Apr 2010 18:58:43 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Perl]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[Toys]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/?p=1774</guid>
		<description><![CDATA[The MySQL Conference was this week in Santa Clara. It was a well-organized and educational event with everybody involved in the MySQL community showing up once again. Executive Summary The highlights were: after 2 years of effort, the performance schema &#8230; <a href="http://www.jebriggs.com/blog/2010/04/mysql-conference-2010/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://en.oreilly.com/mysql2010/">MySQL Conference</a> was this week in Santa Clara. It was a well-organized and educational event with everybody involved in the MySQL community showing up once again.</p>
<p><strong>Executive Summary</strong></p>
<p>The highlights were:</p>
<ul>
<li>after 2 years of effort, the performance schema foundation is available as a 5.5.x patch. With another year of effort, it could be useful.
<li>the various community forks (Percona/XtraDB, MariaDB, OurDelta) will merge in the next 3 months into a maintenance fork by Monty Program, since MP has the most original MySQL developers.
<li>the various MySQL vendors are soldiering along, all releasing new, improved versions of their hw and sw products this year.
<li>The largest independent MySQL-centric consulting companies are Percona with 32 staff, and Monty Program 40 with staff, with a target of 50 employees.
<li>the MySQL source code will have to be modified to make MySQL fast enough to keep up with Fusion IO SSD devices. Currently, better than SSD performance can be gained by installing enough RAM to fit the entire database in buffer pool.
<li><a href="http://drizzle.org/">Drizzle</a> development is going nicely, but note that it&#8217;s not backward compatible with MySQL. Drizzle is a 64-bit only fork of MySQL with emphasis on community code development, increasing performance and maintainability through a plug-in architecture and strict code cleanliness.
</ul>
<p><strong>Monday Morning Tutorial</strong></p>
<p><strong>Using Partitioning in MySQL 5.1 and 5.5 with Giuseppe Maxia (Oracle)</strong></p>
<p>- available in MySQL 5.1 and later only<br />
- TO_DAYS and YEAR() special and recommended as they can prune partitions from lookups.<br />
- when using TO_DAYS() as a partitoning function, the first partition matters. Use value less than zero for first partition to create NULL partition to double performance as a bug workaround.<br />
- consider lock before inserting for all table types<br />
- for performance, consider non-partitioned on masters, partitions on slaves.<br />
- or different partition types</p>
<p>He also gave a nice tutorial on mysql sandboxes script.</p>
<p>Partition Limitations:</p>
<p>- cannot mix table types<br />
- cannot make read-only</p>
<p>I talked to some advanced users, and none have found a practical use for partitions in their environment that was better than using regular table types for logging type applications.</p>
<p>This is due to the fact that partitions do not increase fault-tolerance, often don&#8217;t benchmark any faster, and have little in the way of administrative mgmt. support after partition creation.</p>
<p>Partitions can increase performance in applications where the index serves to stripe operations, but most people are just using dates for logging, with no practical benefit, as most operations fall into the current date partition.</p>
<p><a href="http://tinyurl.com/par-tut-2010/">Slides</a></p>
<p><strong>Monday Afternoon Tutorial</strong></p>
<p>Talked to Arjen Lenz and a friend at lunch. </p>
<p>- OpenQuery is suitable for affordable, long-term contract databae admin, not firefighting<br />
- former partition tester and bugfixer<br />
- replication bug with TCP errors, nagios plugin should compare both replication lag seconds and log position<br />
- need SSL or heartbeat to detect/fix</p>
<p>memcached</p>
<p>- set all clients to same values<br />
- use JSON or YAML, not Storable or Pickl</p>
<p><strong>Tuesday</strong></p>
<p>Performance Schema with Peter Gulutzan</p>
<p>- coded by Alff, but not GA yet<br />
- PERFORMANCE_SCHEMA database optionally populated with events (mutex, lock, io) timing and count info<br />
- allows simple SQL reporting of performance</p>
<p>EXPLAIN Demystified with Baron Schwartz (Percona Inc.) </p>
<p>- perennial nice EXPLAIN overview<br />
- nice example of using mysql command prompt as pipeline for non-trivial processing</p>
<p>Introduction to InnoDB Monitoring System and Resource &#038; Performance Tuning with   Jimmy Yang</p>
<p>An Overview of Flash Storage for Databases with Vadim Tkach, Percona Inc.</p>
<p>- nice talk with useful performance graphs</p>
<p>Linux Performance Tuning and Stabilization Tips with Yoshinori Matsunobu, Sun Microsystems</p>
<p>- nice talk with detailed slide examples &#8211; he&#8217;s a hard worker<br />
- he&#8217;s a fan of xfs, so some info not always useful for ext3. ie. deadline scheduler may be better on xfs, but it feels the same to me as cfq on ext3.</p>
<p><strong>Wednesday</strong></p>
<p>More Mastering the Art of Indexing with Yoshinori Matsunobu, Sun Microsystems</p>
<p>- second-part continuation of his talk from last year (!) Were you there?<br />
- his understanding of the space requirements of blobs in Innodb is different than Peter Zaitsev&#8217;s.</p>
<p>Faster Than Alter &#8211; Less Downtime with Chris Schneider (Ning.com) </p>
<p>- Hipster presentation on doing practical DBA tasks<br />
- likes doing dump and restore on Innodb tables, 30% faster afterwards on his tables.</p>
<p>InnoDB Architecture and Performance Optimization with Peter Zaitsev, Percona Inc.</p>
<p>- perennial comprehensive overview of Innodb<br />
- talked about differences between <a href="http://www.innodb.com/doc/innodb_plugin-1.0/innodb-file-format.html#innodb-file-format-identifying">Antelope and Barracuda file formats</a><br />
<a href="http://lwn.net/Articles/329626/">lwn.net: A look at the MySQL forks</a></p>
<p><strong>BOFs</strong></p>
<p>O&#8217;Reilly failed to use meetup.com to promote the BOFs once again at this conference, so turnout was light to moderate as in past years.</p>
<p><strong>Sphinx BOF hosted by Andrew Aksyonoff</strong></p>
<p>I&#8217;ve been familiar with <a href="http://www.sphinxsearch.com/">SphinxSearch</a> for years and am a production user, so the general audience discussion was not interesting to me.</p>
<p>However, I had a chance to talk to Andrew about my take on the October Apache (Search!) Conference last October and suggested a few things:</p>
<ul>
<li>explain collections on the Sphinx homepage, since many users insist on this feature. The question, of course, is what does the term &#8216;collections&#8217; mean to various people?
<li>make it possible for a non-technical end-user (like a marketing asst.) to highlight 10 items for feature on the first page of results
<li>Microsoft is EOLing FAST for linux users, so think about promotion to that segment, who is considering migration to Lucene mainly &#8211; because Lucene is free, and migration is the same cost to any other product.
<li>look at the myriad &#8220;value-added features&#8221; of commercial search engines, mostly related to adserver integration, and decide what can be supported.
</ul>
<p><strong>MariaDB BOF hosted by Monty</strong></p>
<p>Not much talk about MariaDB, but lots of drinking! (See Monty&#8217;s keynote for more detailed info.)</p>
<p><strong>Conference Wrapup</strong></p>
<p>Overall, another good MySQL conference. The organizers restored balance to the presentations, with a fair number of independent consultants and end-users doing talks. (Though I miss the awesome Percona Performance Conference from last year.)</p>
<p>BOFs should be promoted on meetup.com to double participation.</p>
<p>There should be a room with exotic hardware to demonstrate high performance MySQL and MySQL Cluster configurations &#8211; SANs, Infiniband, failover, etc.</p>
<p>The lunch food was quite good on all days, as noted by several people. (Important because the suburban venue is not within walking distance to outside restaurants.)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2010/04/mysql-conference-2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ApacheCon 2009 Oakland</title>
		<link>http://www.jebriggs.com/blog/2009/11/apachecon-2009-oakland/</link>
		<comments>http://www.jebriggs.com/blog/2009/11/apachecon-2009-oakland/#comments</comments>
		<pubDate>Fri, 06 Nov 2009 21:52:43 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Perl]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[Toys]]></category>
		<category><![CDATA[Travel]]></category>

		<guid isPermaLink="false">http://www.jebriggs.com/blog/?p=1225</guid>
		<description><![CDATA[I went to ApacheCon 2009 in Oakland. Why Oakland? The ASF was founded here 10 years ago. Executive Summary Most of the attendees that I talked to were primarily interested in search technologies, or were Apache project comitters. The search &#8230; <a href="http://www.jebriggs.com/blog/2009/11/apachecon-2009-oakland/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I went to <a href="http://us.apachecon.com/c/acus2009/">ApacheCon 2009</a> in Oakland. Why Oakland? The ASF was founded here 10 years ago.</p>
<p><strong>Executive Summary</strong></p>
<p>Most of the attendees that I talked to were primarily interested in search technologies, or were Apache project comitters. The search users were already using either Lucene and Solr, or were using commercial software and evaluating Lucene and Solr.</p>
<p>Also a lot of interest in Hadoop, Zookeeper and NoSQL projects.</p>
<p>I added a wikipedia <a href="http://en.wikipedia.org/wiki/Nosql#Implementations">NoSQL project features table</a> after the NoSQL BoF.</p>
<p>The conference was very well-organized, with tutorials, BoFs, a BarCamp, and sessions. Meetup.com was used to generate the highest BoF turnout that I&#8217;ve ever seen &#8211; close to 100 at the Lucene and Hadoop BoFs. (O&#8217;Reilly Conferences can learn from that.)</p>
<p>The Oakland Convention Center was a good venue for this conference, though the attached Oakland Marriott hotel is $$$$ and fond of surcharges, like $33/day for parking, $5 draught beer and $3.75 for a bottle of water in-room.</p>
<p>The keynotes and one track per day were recorded and are available for $99 at <a href="https://streaming.linux-magazin.de/en/registration.php">Linux Pro Magazine Streaming</a>.</p>
<p>StoneCircle Productions was the conference organizer.</p>
<p><strong>Conference Notes</strong></p>
<p><strong>Monday</strong></p>
<p>Although I live in San Jose, Oakland is far enough away that I&#8217;ve never been there. Oakland  has a compact downtown full of historical-era buildings, and Alameda is also nice, but things get less pretty at night.</p>
<p>I went to the Lucene tutorial on Monday.</p>
<p><strong>Lunch Conversations</strong></p>
<p>- awesome views of Bay Area past Golden Gate bridge from 21st floor<br />
- FAST pretty good indexing and search solution, but bought by Microsoft recently (going to continue linux support or not?)<br />
- FAST has FQL (users pronounce it fecal) query language <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
- 150 FAST servers replaced by 40 lucene servers by 1 company<br />
- FAST4 to FAST5 upgrade tough, similar to port to say lucene, forced upgrades for support<br />
- linguistics is 60% of value of Fast according to Monster, 13 languages supported<br />
- &#8220;bad stems&#8221; can be a nightmare<br />
- SOLR gives you 90% of what you would need to program in java, built on top of Lucene<br />
- Open Source search is not really about price, but about control and flexibility</p>
<p><strong>Monday Afternoon &#8211; Lucene Tutorial</strong></p>
<p>- user-assigned document id not mandatory, but great idea for many reasons, including after an index-rebuild<br />
- lucene-assigned id only valid for that snapshot (life of score doc)<br />
- parameter to keep or delete old index directory<br />
- StringBuilder is more efficient than strcat<br />
- populating title column is a good idea<br />
- results boosting handy for ecommerce, specials, etc.<br />
- LUKE &#8211;  handy tool for index statistics, etc.<br />
- Searcher class, snapshot in time, won&#8217;t see new merges<br />
- contrib/ has more analyzers<br />
- snowball stemmers<br />
- use 1 tokenizer and 0 or more token filters<br />
- precision-recall curve ??<br />
- n-grams and shingles (&#8220;the president&#8221;, &#8220;United states&#8221;)<br />
- pre-2.9 lucene, numbers and dates really strings<br />
- 2.9 NumericField builds tri structure, help optimize range queries<br />
- SOLR analysis tool apache-solr<br />
- relevance feedback with MoreLikeThis</p>
<p><strong>Monday BoFs</strong></p>
<p><strong>Couchdb</strong></p>
<p>- &#8220;ground computing&#8221;<br />
- &#8220;offline by default&#8221;<br />
- now an ubuntu service<br />
- mozilla raindrop to combine chat client msgs<br />
- lockless<br />
- append-only btree<br />
- rsyncable since append-only, also replication<br />
- checksums everywhere<br />
- windows not first class yet, mozilla improving it</p>
<p><strong>@mozilla</strong></p>
<p>- browsercouch<br />
- don&#8217;t like sql<br />
- brasstacks test tool storage<br />
- store now, index later<br />
- replicate to handle large indexing load<br />
- testbot ci</p>
<p><strong>Marklogic</strong></p>
<p>- commercial<br />
- xml-centric<br />
- great for articles, books<br />
- transactional<br />
- search-centric<br />
- structure-aware<br />
- schema-free<br />
- xquery-driven<br />
- extremely fast, largest 200 TB xml, 166 on hosts<br />
- clustered<br />
- database server<br />
- 180 clients, 150 employees<br />
- <a href="http://markmail.org/">markmail.org</a> demo contains 42 million email messages, very impressive performance with 5 views in almost realtime. Search is distributed across 160 nodes.</p>
<p><strong>JCR in 15 minutes</strong></p>
<p>- Bertrand Del<br />
- <a href="http://jackrabbit.apache.org/">JCR</a> is JackRabbit,<br />
a fully conforming implementation of the Content Repository for Java Technology API (JCR). A content repository is a hierarchical content store with support for structured and unstructured content, full text search, versioning, transactions, observation, and more.<br />
- the ultimate content store<br />
- content repo, union of database and filesystem, best of both worlds<br />
- full-text search combined with structured search</p>
<p><strong>Solr Flair</strong></p>
<p>- information forage<br />
- &#8220;resume-driven design&#8221;</p>
<p><strong>Lucene Numerics</strong></p>
<p>- available in 1.4<br />
- tune by modifying precisionStep</p>
<p><strong>HBASE</strong></p>
<p>One bewildered attendee wished for a NoSQL product matrix, so I added that to the wikipedia <a href="http://en.wikipedia.org/wiki/Nosql">NoSQL</a> page.</p>
<p><strong>Wednesday Sessions</strong></p>
<p><strong>Becoming a Pig Developer, Alan Gates</strong></p>
<p>- Apache Pig is a sub-project of Apache Hadoop.<br />
- this talk was really how to use PIG as an end-user, not to become a Pig project developer</p>
<p><strong>Apache Hadoop in the Cloud, Tom White</strong></p>
<p>- general comments on using EC2 with Hadoop mostly</p>
<p><strong>Practical HBase, Michael Stack</strong></p>
<p>- Apache HBase is the Apache Hadoop database, similar to BigTable.<br />
- HBASE usage</p>
<p><strong>mod_jk / mod_proxy and others, Jean-Frederic Clere and 2 others</strong></p>
<p>- mod_jk, mod_proxy, mod_serf and mod_cluster original topics<br />
- mostly focused on mod_jk, mod_proxy and isapi_redirect<br />
- good talk by 3 long-term project contributors<br />
- jk is kind of Java-centric, with support for <a href="http://en.wikipedia.org/wiki/Apache_JServ_Protocol">Apache JServ Protocol (AJP)</a> only available in Java back-end servers for now, like Tomcat<br />
- isapi_redirect is primary way to do redirects on Windows IIS<br />
- survey of audience showed several mod_proxy users, maybe one intentional mod_jk user</p>
<p><strong>Thursday Sessions</strong></p>
<p>&#8220;Apache Lucene and Apache Solr Performance Tuning with Mark Miller&#8221; was packed, so moving along to a different room &#8230;</p>
<p><strong>Scalable Internet Architectures, Theo Schlossnagle</strong></p>
<p>- amazing and thought-provoking talk, also one of the most popular<br />
- think about performance from network packet level to application level<br />
- carp, vrrp, whackamole<br />
- alterdns, neustar<br />
- dynact<br />
- <a href="http://en.wikipedia.org/wiki/Anycast">anycast</a> (shared IP), geoip (but need actually accurate database)<br />
- activemq, rabbitmq instead of Spread<br />
- &#8220;memcached is the worst thing that ever happened to our industry &#8211; it solves a problem, just not the original problem&#8221;</p>
<p>- many apps today are so poorly designed that network issues never become scalability concerns &#8211; ie. RoR applications <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
- max out at 500 requests per second across 40 boxes &#8211; RoR<br />
- firebug and yslow have been fantastic at making front-end engineers aware of networking performance<br />
- 10 gb nics suck<br />
- instead of one big 20 Gbps loadbalancer, use anycast from core router to 5x 4 gpbs cheaper load balancers<br />
- spiky load or DDoS &#8211; announce a /32 to separate load balancer, use symmetric return path</p>
<p>- jms, aqmp, spread</p>
<p>durable message queues</p>
<p>- activemq (java)<br />
- openamq (c) &#8211; hard to use<br />
- rabbitmq (erlang) &#8211; nice except in durable mode because erlang disk io blows</p>
<p>- most common protocol <a href="http://stomp.codehaus.org/Protocol">Stomp</a> is awful and slow (hard to read 100k messages per second) and not binary, but lots of clients exist.</p>
<p>- activemq and stomp is a good start.<br />
- rabbitmq and native connectors are better, but no perl client.</p>
<p>- PCI compliance requires a stateful firewall. Hard to do 1.5 million packets per second traffic for most medium-sized data centers, need to use a CDN to distribute static requests and distribute the packets somewhere else<br />
- leaving trailing / off causes 302, doubles traffic<br />
- <a href="http://lethargy.org/~jesus/misc/Scalable-acus2009.pdf">Slides</a><br />
- read/write ratio is 1 &#8230; likely IM or email?<br />
- went over some networking details with Paul L. afterwards</p>
<p><strong>Recent Developments in SSL and Browsers, Rick Andrews, Thawte</strong></p>
<p>- 1.6 billion OCSP requests per day, need good infrastructure to support that<br />
- intermediate CA allows root CA to be offline &#8211; chained hierarchy &#8211; SSLCertificateChainFile,<br />
needs intermediate certificates before cross-certificates, some  clients need in proper order<br />
- EV hierarchy more complex. wanted new EV root, but older browsers don&#8217;t know about it.<br />
- browser ubiquity problem with any new feature, hash or crypto algorithm<br />
- logotypes &#8211; trademark and copyright issues with using other companies&#8217; logos in a product<br />
- Verisign does not have apache httpd committers, but should<br />
- 1 attendee wanted to sign JavaScript files, but what does it mean if most sites link to 10 advertising and tracking scripts? what do you tell the user if 1 JS is not signed?</p>
<p><strong>Subversion Meetup</strong></p>
<p>Organizers didn&#8217;t show up, so spent 10 minutes talking to a handful of end-users about subversion gripes and moved along to &#8230;</p>
<p><strong>Hadoop Meetup</strong></p>
<p><strong>Zookeeper</strong></p>
<p>- zk is persistent to disk<br />
- can run on one node, but 3 is minimum non-toy<br />
- zk is popular in academia now for some reason<br />
- avoid split-brain partitioning between 2 data centers &#8211; bad<br />
- very recent merge to fix -368, not ready for production yet<br />
- people using it for a message queue, perhaps more reliable than many other Open Source ones<br />
- need 1 zk node for testing, but 3 zk nodes for non-trivial implementation</p>
<p><strong>Scribe</strong></p>
<p>- github<br />
- 4x to 5x compression with lzo. similar disk bw improvement</p>
<p>A local owner of a gelato store handed out 6 free samples from a portable gelato freezer. <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><strong>Friday Sessions</strong></p>
<p><strong>Building Intelligent Search Applications with the Lucene Ecosystem, Ted Dunnin</strong></p>
<p>- some matrix math<br />
- using his matrix math optimization, a perl program on 1 server was faster than Mahout running on a $250k  cluster <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
- tdunning.blogspot.com</p>
<p>- the original LLR in NLP paper<br />
&#8220;Accurate Methods for the Statistics of Surprise and Coincidence&#8221; check on citeseer<br />
- Mahout project<br />
tdunning [at] apache.org</p>
<p><strong>Realtime Search, Jason Rutherglen</strong></p>
<p>- many technical issues prevent Lucene from being able to do realtime search<br />
- lots of patches done, lots to do<br />
- audience member thanked author for great work so far</p>
<p><strong>Closing Plenary: Brian Behlendorf on Open Source and Charity</strong></p>
<p>Talked to <a href="http://www.jroller.com/akarasulu/">Alex Karasulu</a> a little after the final presentation. He&#8217;s a committer on the Apache Directory project. He suggested adding dbm to the <a href="http://en.wikipedia.org/wiki/Nosql">NoSQL</a> product matrix. Wants a MacBook Air with 8 GB RAM to run his Java apps. <img src='http://www.jebriggs.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><a href="http://www.us.apachecon.com/c/acus2009/schedule/grid">Conference Schedule Grid</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.jebriggs.com/blog/2009/11/apachecon-2009-oakland/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk
Page Caching using disk (enhanced)
Database Caching 1/48 queries in 0.012 seconds using disk
Object Caching 819/918 objects using disk

Served from: www.jebriggs.com @ 2012-02-06 05:42:58 -->
