<?xml version="1.0" encoding="utf-8"?>
<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Disqus - Latest Comments for gengstrand</title><link>http://disqus.com/by/gengstrand/</link><description></description><atom:link href="http://disqus.com/gengstrand/comments.rss" rel="self"></atom:link><language>en</language><lastBuildDate>Sat, 10 Jan 2015 15:04:50 -0000</lastBuildDate><item><title>Re: Open Source Is Data Science's Missing Ingredient - by Matt Asay</title><link>http://readwrite.com/2015/01/08/open-source-big-data-science-python#comment-1785430978</link><description>&lt;p&gt;An important message, especially for the fortune 500. Open source is so successful in big data that you could not have a credible claim about using big data without open source. You have already mentioned the most common open source projects. What I learned this year at &lt;a href="http://it.toolbox.com/blogs/future-of-work/big-data-at-oscon-2014-64531" rel="nofollow noopener" target="_blank" title="http://it.toolbox.com/blogs/future-of-work/big-data-at-oscon-2014-64531"&gt;http://it.toolbox.com/blogs...&lt;/a&gt; is that there are some additional projects like Cascading, Storm, and Druid that are also worth some attention.&lt;/p&gt;&lt;p&gt;The most significant role in big data where I still see proprietary software being used is discussed at &lt;a href="https://about.zoosk.com/zh/engineering-blog/marriage-of-hadoop-and-olap-best-of-both-worlds-to-make-sense-of-200-terabytes-of-data/" rel="nofollow noopener" target="_blank" title="https://about.zoosk.com/zh/engineering-blog/marriage-of-hadoop-and-olap-best-of-both-worlds-to-make-sense-of-200-terabytes-of-data/"&gt;https://about.zoosk.com/zh/...&lt;/a&gt; where commercial OLAP cubes are used as a kind of front end GUI to Hive. Other places include connectors to proprietary databases for ETL purposes and cluster management dashboard tools.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sat, 10 Jan 2015 15:04:50 -0000</pubDate></item><item><title>Re: Big Data Source Code: Getting Better All The Time - by Matt Asay</title><link>http://readwrite.com/2014/12/11/big-data-open-source-software-improving-coverity#comment-1740969166</link><description>&lt;p&gt;It's hard to measure and compare proprietary code quality to open source code since the source code to most proprietary software cannot be analyzed for errors. I think that most companies today have more trust in open source because of the communities surrounding the more popular projects and because their already trusted engineers can analyze the open source code.&lt;/p&gt;&lt;p&gt;Here are a couple more big data open source projects that you missed. Cascalog makes it much easier for developers to write hadoop map reduce jobs. Apache Spark is a similar technology known for its high performance and has gotten quite a bit of industry attention. See &lt;br&gt;&lt;a href="http://glennengstrand.info/analytics/distributed/functional/programming" rel="nofollow noopener" target="_blank" title="http://glennengstrand.info/analytics/distributed/functional/programming"&gt;http://glennengstrand.info/...&lt;/a&gt; &lt;br&gt;for  how and why these two technologies accelerate big data engineering. Both Cascalog and Spark are now supported by the major Hadoop vendors Hortonworks and Cloudera.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sat, 13 Dec 2014 21:47:23 -0000</pubDate></item><item><title>Re: Hadoop 2.0 &amp;amp; YARN: Get Ready For This Summer&amp;#039;s Big Data Breakthrough  - by Brian Proffitt</title><link>http://readwrite.com/2013/05/24/hadoop-20-yarn-bid-data-mapreduce#comment-910041527</link><description>&lt;p&gt;YARN, or MRv2, has been out for over a year now. From &lt;a href="http://www.dynamicalsoftware.com/analytics/oss" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/analytics/oss"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; you can see that it is easy to build Java apps that run inside the Hadoop cluster. As with many open source big data technologies, the sweet spot for ISVs is enterprise integration tools such as ETL 4 OLAP.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Mon, 27 May 2013 13:44:52 -0000</pubDate></item><item><title>Re: Hadoop Adoption Accelerates, But Not For Data Analytics - by Matt Asay</title><link>http://readwrite.com/2013/05/10/hadoop-adoption-accelerates-but-not-for-what-you-might-think#comment-894880458</link><description>&lt;p&gt;I believe that labeling the use of open source software in managing big data as a "poor man's ETL" is a bit disingenuous. Also, I am not so sure about the term "unsupervised landfill" which feels to me to be overly provocative.&lt;/p&gt;&lt;p&gt;I have found ( see &lt;a href="http://www.dynamicalsoftware.com/analytics/oss" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/analytics/oss"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; ) that companies use Hadoop to crunch their big data down into something that can be loaded into OLAP cubes. Perhaps most companies aren't really aspiring to something as ambitious as a recommendation engine but that is a far cry from "unsupervised landfill."&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 12 May 2013 14:58:11 -0000</pubDate></item><item><title>Re: Open Source Search with Lucene &amp; Solr</title><link>http://www.igvita.com/2010/10/22/open-source-search-with-lucene-solr/#comment-872927074</link><description>&lt;p&gt;No wonder lucene is so prevalent in so many open source projects. It is high performance yet easy to use. You can find another tutorial embedding the next major release lucene 4 at &lt;a href="http://www.dynamicalsoftware.com/embedded/lucene" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/embedded/lucene"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; with emphasis on multi-threading concerns.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Tue, 23 Apr 2013 02:21:00 -0000</pubDate></item><item><title>Re: Programming and Data Interpretation Are the New Literacy</title><link>https://danielmiessler.com/blog/programming-and-data-interpretation-are-the-new-literacy#comment-731804766</link><description>&lt;p&gt;Programming and the ability to analyze has always been pretty important in my career. Also, consider &lt;a href="http://www.dynamicalsoftware.com/internet/skills" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/internet/skills"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; for 9 more skills that are a part of basic knowledge worker literacy, especially if some members of your team are remote.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 09 Dec 2012 14:29:40 -0000</pubDate></item><item><title>Re: Explorations in Unix</title><link>http://www.drbunsen.org/explorations-in-unix.html#comment-726127619</link><description>&lt;p&gt;You might also like &lt;a href="http://www.dynamicalsoftware.com/analytics/selection" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/analytics/selection"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; which goes into many unix commands (such as grep, cut, join, paste, awk) to reshape application log file data into something that can be analyzed and graphed.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Mon, 03 Dec 2012 11:17:42 -0000</pubDate></item><item><title>Re: Big Data: What Do You Think It Is?</title><link>http://www.readwriteweb.com/cloud/2012/07/big-data-what-do-you-think-it-is.php#comment-601026913</link><description>&lt;p&gt;I work at a start up who owns the world's largest romantic social network. Like many start ups, big data means the classic three Vs of volume, velocity, and variety. Unlike other start ups, big data (especially social big data) does not mean hadoop. We used &lt;a href="http://www.dynamicalsoftware.com/nosql/solr" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/nosql/solr"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; to implement our social big data app.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sat, 28 Jul 2012 15:35:39 -0000</pubDate></item><item><title>Re: How to Prevent Blogger Burnout: 3 Good Tips</title><link>http://www.hellobloggerz.com/blogging/blogger-burnout#comment-508520680</link><description>&lt;p&gt;Good advice, especially the blogging tools. It is still incubating but there is a personal assistant called opportunity eye that helps bloggers discover new topics to blog about that their users have shown interest in. Knowing what will be a hit is another way to prevent blogger burn out.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Wed, 25 Apr 2012 02:46:07 -0000</pubDate></item><item><title>Re: 4 Blogger Tools for Breaking Your Writing Block</title><link>http://www.socialmediaexaminer.com/4-blogger-tools-for-breaking-your-writing-block/#comment-506416922</link><description>&lt;p&gt;Keyword based suggestions are fine for a start but, to get better quality topics to blog about, you should get hints based on what your real readers are interested in. That is why you might be interested in the opportunity eye which is a project where your blog readers get to discover related opportunities in a widget surfaced on your blog site. Users are allowed to follow, discuss, or visit opportunities related to the blog topic. Engagement metrics are tracked and published for the blogger to explore.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 22 Apr 2012 18:37:25 -0000</pubDate></item><item><title>Re: Hippo Integration with PugPig for iPad Magazine Publishing | CMS Report</title><link>https://www.socpub.com/articles/hippo-integration-pugpig-ipad-magazine-publishing-14931#comment-458489119</link><description>&lt;p&gt;I can corroborate that Hippo CMS is easy to integrate with. During my work on &lt;a href="http://www.dynamicalsoftware.com/convocontent/ccm.html" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/convocontent/ccm.html"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; I was easily able to integrate Hippo CMS with Real-Time Communications technology. Congratulations to those folks for taking the time to do JCR-170 the right way.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Wed, 07 Mar 2012 01:32:26 -0000</pubDate></item><item><title>Re: Alfresco 4.0 to Upgrade CMS for the Cloud</title><link>http://cloudtimes.org/2012/02/05/alfresco-4-0-to-upgrade-cms-for-the-cloud/#comment-458485905</link><description>&lt;p&gt;I was working with Alfresco 3.4 on EC2 during my time with &lt;a href="http://www.dynamicalsoftware.com/convocontent/ccm.html" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/convocontent/ccm.html"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; and I can't say that I ran into any cloud related problems.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Wed, 07 Mar 2012 01:24:31 -0000</pubDate></item><item><title>Re: comSysto and Hippo Team Up To Focus On The Customer</title><link>https://www.socpub.com/articles/comsysto-and-hippo-team-up-to-focus-on-the-customer-3362#comment-456154052</link><description>&lt;p&gt;Hippo CMS has great support for standards compliant inter-operability which puts it in a great position for attracting Enterprise customers. For example, &lt;a href="http://www.dynamicalsoftware.com/convocontent/ccm.html" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/convocontent/ccm.html"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; is a beta where group meetings get summarized and published on a Hippo CMS automatically. That was possible only because of Hippo's commitment to JCR.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 04 Mar 2012 14:41:39 -0000</pubDate></item><item><title>Re: Alfresco Makes its CMS More Social</title><link>http://www.readwriteweb.com/cloud/2012/02/alfresco-makes-its-cms-more-so.php#comment-456150335</link><description>&lt;p&gt;I like Alfresco. It's standards based approach to integration has always allowed third parties such as &lt;a href="http://www.dynamicalsoftware.com/convocontent/ccm.html" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/convocontent/ccm.html"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; to enhance Alfresco in ways social and otherwise.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 04 Mar 2012 14:37:11 -0000</pubDate></item><item><title>Re: One language everywhere</title><link>http://codebetter.com/drusellers/2011/10/20/one-language-everywhere/#comment-340142102</link><description>&lt;p&gt;I believe this approach to be fundamentally flawed but, if you must, then take a look at clojure.&lt;/p&gt;&lt;p&gt;On a tangent, the difference between developers and engineers is in attitude. A developer believes in the swiss army knife approach to toolsets and wants to show the world how clever he is that he can make his favorite coding language do things way beyond its original scope. An engineer believes in the right tool for the job. Engineers don't identify with any single programming language. Instead, they pride themselves on knowing how to code in lots of different environments and API stacks.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Thu, 20 Oct 2011 15:49:56 -0000</pubDate></item><item><title>Re: Unlocking Big Data with R</title><link>http://www.readwriteweb.com/hack/2011/09/unlocking-big-data-with-r.php#comment-305696840</link><description>&lt;p&gt;R is great for engineers and developers who need to analyze big data. Accountants will still want to use OLAP and Excel. If you are more into the data mining part, then take a look at weka. I go into more detail on this at &lt;a href="http://www.dynamicalsoftware.com/analytics/graphing" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/analytics/graphing"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; including a short introduction to loading data and generating graphs using R.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Fri, 09 Sep 2011 01:20:46 -0000</pubDate></item><item><title>Re: Don&amp;#8217;t Let Your Babies Grow Up to be Knowledge Workers</title><link>http://andrewmcafee.org/2011/03/mcafee-india-jobs-skills-outsourcing-technology-digitization/#comment-171443581</link><description>&lt;p&gt;I've been on the other side of this equation as a technical lead in several small organisations who decided to offshore some of their software development responsibilities. Inevitably, the quality of what gets delivered is over promised and under delivered. I attribute the lack to the wide gap in time zone and culture and in the high turn-over that is characteristic of Indian developer shops. Here's the dirty little secret to software development. Engineering quality is the difference between success and failure. &lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Fri, 25 Mar 2011 01:07:41 -0000</pubDate></item><item><title>Re: Unlocking the Mayor Badge of Meaninglessness</title><link>http://blogs.hbr.org/haque/2010/12/unlocking_the_mayor_badge_of_m.html#comment-114858675</link><description>&lt;p&gt;Very relevant article. It reminds me of stack overflow where programmers share advice. There is an elaborate reputation system at play over at that franchise. At first, I found their badges to be a motivation but now I find the whole thing to be tainted. I now question the validity of the advice as posters play to the perceived predominant paradigm of the group in order to up their stats. &lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 19 Dec 2010 14:43:33 -0000</pubDate></item><item><title>Re: You’ve Got FMail</title><link>http://techcrunch.com/2010/11/13/youve-got-fmail/#comment-96958258</link><description>&lt;p&gt;Well, I'm not surprised. Google and Facebook already compete on Instant Messaging (did you know that you can chat with your facebook friends via XMPP?) and they compete on activity streams (Google Buzz vs Facebook Open Stream) so it only stands to reason that email would be next. Maybe Facebook will cook up a collaborative office suite too.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sat, 13 Nov 2010 17:22:21 -0000</pubDate></item><item><title>Re: A Beginner&amp;#8217;s Guide to Integrated Development Environments</title><link>http://mashable.com/2010/10/06/ide-guide/#comment-88429855</link><description>&lt;p&gt;I cast my vote for Eclipse. If you want to learn more tips and tricks about Eclipse plugins and plugin management, then check out &lt;a href="http://www.dynamicalsoftware.com/intro/plugins/eclipse" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/intro/plugins/eclipse"&gt;http://www.dynamicalsoftwar...&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Wed, 20 Oct 2010 00:42:50 -0000</pubDate></item><item><title>Re: Ubuntu 10.04 Lucid Lynx - A Fresh Look At What Is On Offer</title><link>http://linuxhelp.blogspot.com/2010/04/ubuntu-1004-lucid-lynx-fresh-look-at.html#comment-87499208</link><description>&lt;p&gt;Anyone here know when Lucid Lynx users will be able to synchronize their evolution contacts to Ubuntu One Cloud? &lt;a href="https://wiki.ubuntu.com/UbuntuOne/Status#Contacts" rel="nofollow noopener" target="_blank" title="https://wiki.ubuntu.com/UbuntuOne/Status#Contacts"&gt;https://wiki.ubuntu.com/Ubu...&lt;/a&gt; indicates that "backporting of the CouchDB and Erlang packages" must happen first.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sat, 16 Oct 2010 19:53:22 -0000</pubDate></item><item><title>Re: Eclipse Plugins</title><link>http://www.abdulqabiz.com/blog/archives/2010/07/28/eclipse-plugins/#comment-83503183</link><description>&lt;p&gt;Thanks for sharing your list. I would definitely vote up your top three and will check out the Amazon plugin. Here are some more favorite Eclipse plug ins (see &lt;a href="http://www.dynamicalsoftware.com/intro/plugins/eclipse)" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/intro/plugins/eclipse)"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; for you to check out.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 03 Oct 2010 14:43:45 -0000</pubDate></item><item><title>Re: XMPP servers head-to-head: ejabberd vs Openfire (first impressions)</title><link>http://freeasinbeard.org/post/1192039651#comment-83502706</link><description>&lt;p&gt;If scalability is key, then check out Tigase which is a highly scalable XMPP server written in Java. I have compared Tigase and Openfire (see &lt;a href="http://www.dynamicalsoftware.com/comparison/tigase/openfire)" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/comparison/tigase/openfire)"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; including notes on how to write extensions. Tigase is just as easy to extend as Openfire and offers more ways in which to extend.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 03 Oct 2010 14:40:11 -0000</pubDate></item><item><title>Re: XMPP becoming common standard for real-time chat applications</title><link>http://www.anas.co.in/2010/09/xmpp-becoming-common-standard-for-real.html#comment-77094761</link><description>&lt;p&gt;Thanks for this interesting introduction. Can you be more specific about the XMPP interface for Facebook?&lt;/p&gt;&lt;p&gt;You mentioned jabber and open source. Both openfire and tigase are two modern open source XMPP server projects that you might be interested int. &lt;a href="http://www.dynamicalsoftware.com/comparison/tigase/openfire" rel="nofollow noopener" target="_blank" title="http://www.dynamicalsoftware.com/comparison/tigase/openfire"&gt;http://www.dynamicalsoftwar...&lt;/a&gt; is an article that compares and contrasts those two technologies.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 12 Sep 2010 12:35:58 -0000</pubDate></item><item><title>Re: Yammer 2.0 To Launch As A Powerful, Full-Fledged Social Network For The Enterprise</title><link>http://techcrunch.com/2010/09/02/yammer-2-0-to-launch-as-a-powerful-full-fledged-social-network-for-the-enterprise/#comment-75395201</link><description>&lt;p&gt;We use yammer at work. It's great at broadcasting changes (kicking a server, upgrading a package, announcing a new service) to interested parties in the company. An activity stream is a good way to make announcements that are searchable for those who are interested without the heavy digital footprint of email. Yammer makes a lot of sense; however, the web interface does not. It's too much of a distraction. That's why I use Gabble which brings out the best of micro-blogging while filtering out the excessive noise of a full web portal.&lt;/p&gt;&lt;p&gt;These specialized clients are the way to go for real time communications. That's why I use Gwibber for micro-blogs and Empathy for IM. XMPP is for real time what HTTP was for publishing.&lt;/p&gt;&lt;p&gt;RTC is not just for human consumption. Check out &lt;a href="http://it.toolbox.com/blogs/future-of-work/your-next-esb-may-just-be-im-41045" rel="nofollow noopener" target="_blank" title="http://it.toolbox.com/blogs/future-of-work/your-next-esb-may-just-be-im-41045"&gt;http://it.toolbox.com/blogs...&lt;/a&gt; for an example of how these advances help the enterprise in other communication intensive activities.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Glenn Engstrand</dc:creator><pubDate>Sun, 05 Sep 2010 18:53:03 -0000</pubDate></item></channel></rss>