<?xml version="1.0" encoding="utf-8"?>
<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Disqus - Latest Comments for dtunkelang</title><link>http://disqus.com/by/dtunkelang/</link><description></description><atom:link href="http://disqus.com/dtunkelang/comments.rss" rel="self"></atom:link><language>en</language><lastBuildDate>Tue, 13 Aug 2019 17:57:33 -0000</lastBuildDate><item><title>Re: Asking the Right Questions: Query Expansion in Enterprise Search</title><link>https://www.cmswire.com/information-management/asking-the-right-questions-query-expansion-in-enterprise-search/#comment-4577338409</link><description>&lt;p&gt;Martin, as always I appreciate your posts. And I have great respect for Liz and Tony. As for our differences on the uses of query expansion, I think we may have different applications in mind. I'm specifically focused on using it to increase recall, rather than to increase precision. A lot of IR folks focus on the latter, e.g., query expansion using pseudo-relevance feedback. So I'm not sure we disagree -- we may have just been talking past each other.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Tue, 13 Aug 2019 17:57:33 -0000</pubDate></item><item><title>Re: 5 Things to Know About A/B Testing</title><link>https://www.kdnuggets.com/2018/09/5-things-know-about-ab-testing.html#comment-4088916384</link><description>&lt;p&gt;Nice post! For those specifically interested in applying A/B testing to search, I encourage you to read my post on the subject: &lt;a href="https://medium.com/@dtunkelang/a-b-testing-for-search-is-different-f6b0f6f4d0f5" rel="nofollow noopener" target="_blank" title="https://medium.com/@dtunkelang/a-b-testing-for-search-is-different-f6b0f6f4d0f5"&gt;https://medium.com/@dtunkel...&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Tue, 11 Sep 2018 02:19:05 -0000</pubDate></item><item><title>Re: 8 Useful Advices for Aspiring Data Scientists</title><link>https://www.kdnuggets.com/2018/05/8-useful-advices-aspiring-data-scientists.html#comment-3917280085</link><description>&lt;p&gt;My above advice feels pretty dated. Today, an aspiring data scientist should be learning Python, Scala, and Spark.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Fri, 25 May 2018 16:38:43 -0000</pubDate></item><item><title>Re: How To Do Startup Technical Due Diligence – code.dblock.org | tech blog</title><link>http://code.dblock.org/2017/10/29/how-to-do-startup-technical-due-diligence.html#comment-3591715533</link><description>&lt;p&gt;Great post! One question I don't believe you addressed: what do you feel is the required specificity of expertise on the part of the person doing due diligence? For example, do you need an AI expert to evaluate an AI startup? Given the scarcity of people who are qualified and available to perform technical due diligence in general, adding specific expertise as a requirement can harshly limit the options or make the process significantly more expensive. But sometimes it seems necessary.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Mon, 30 Oct 2017 11:20:21 -0000</pubDate></item><item><title>Re: What We Learned Analyzing Hundreds of Data Science Interviews</title><link>https://www.springboard.com/blog/data-science-interviews-lessons/#comment-2841099077</link><description>&lt;p&gt;Thanks Roger!&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Tue, 16 Aug 2016 17:50:11 -0000</pubDate></item><item><title>Re: What We Learned Analyzing Hundreds of Data Science Interviews</title><link>https://www.springboard.com/blog/data-science-interviews-lessons/#comment-2835884709</link><description>&lt;p&gt;Interesting post. But your information about LinkedIn is a bit dated. I only ran part of the data science team -- the folks with more of an engineering bent who built data products -- and I stopped doing that over 3 years ago. The company has had a few re-orgs since then, cf. &lt;a href="http://venturebeat.com/2014/10/31/linkedin-data-science-team/" rel="nofollow noopener" target="_blank" title="http://venturebeat.com/2014/10/31/linkedin-data-science-team/"&gt;http://venturebeat.com/2014...&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Sat, 13 Aug 2016 14:45:16 -0000</pubDate></item><item><title>Re: Is It Time For Google To Rank Paid News Content Better?</title><link>http://searchengineland.com/time-for-google-to-rank-paid-news-content-better-220918#comment-2030412364</link><description>&lt;p&gt;Interesting idea. But I think you're eliding over a big differences between news and music / videos. News has a high degree of fungibility -- if I can't read an article that's behind a paywall, it's likely that I can legally read something almost as good (yes, I realize that's a subjective measure) that isn't behind a paywall. For music and video, there's no comparable legal substitute unless the rights owners opt for one.  Your proposed solution may work, but I don't think it's obvious that what works for music / video will generalize to news.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Sun, 17 May 2015 13:48:05 -0000</pubDate></item><item><title>Re: The Case for More Women in Data Science</title><link>http://www.wired.com?p=1599267&amp;preview_id=1599267#comment-1644639728</link><description>&lt;p&gt;Claudia, thanks for having the courage to publish a piece like this -- I know it's not your usual publication material.&lt;/p&gt;&lt;p&gt;To those of you not familiar with Claudia's work, I encourage you to look at her papers in top-tier computer science journals and conferences:&lt;br&gt;&lt;a href="http://scholar.google.com/citations?user=DA-m1cYAAAAJ&amp;amp;hl=en" rel="nofollow noopener" target="_blank" title="http://scholar.google.com/citations?user=DA-m1cYAAAAJ&amp;amp;hl=en"&gt;http://scholar.google.com/c...&lt;/a&gt;&lt;/p&gt;&lt;p&gt;Or, if that's too technical for you, at her more accessible material on SlideShare:&lt;br&gt;&lt;a href="http://www.slideshare.net/SessionsEvents/mlconf-nyc-claudia-perlich" rel="nofollow noopener" target="_blank" title="http://www.slideshare.net/SessionsEvents/mlconf-nyc-claudia-perlich"&gt;http://www.slideshare.net/S...&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Mon, 20 Oct 2014 09:56:34 -0000</pubDate></item><item><title>Re: The Myths &amp;#038; Realities Of The EU’s New “Right To Be Forgotten” In Google Works</title><link>http://searchengineland.com/myths-realities-eus-new-right-forgotten-google-works-191604#comment-1390722071</link><description>&lt;p&gt;Outstanding and thorough explanation, Danny. That said, it only reinforces my outrage at the European Court of Justice for taking such an irrational approach to suppress the freedom of expression under the guise of protecting privacy. I'm no Google fanboy, but in this case the company has been acting as a voice of reason and sanity against governments that disgracefully haven't learned that history cannot and should not be forgotten.  Not to mention the inconsistency of going after the search engines but not the original publishers. Ah well, let's see them try to enforce this madness.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Sat, 17 May 2014 10:47:37 -0000</pubDate></item><item><title>Re: Google is researching ways to make encryption easier to use in Gmail | VentureBeat | Business | by Harrison Weber</title><link>http://venturebeat.com/2014/04/21/google-is-researching-ways-to-make-encryption-easier-to-use-in-gmail/#comment-1352243521</link><description>&lt;p&gt;Perhaps I wasn't clear. I'm talking about processing encrypted email in the browser / app client in a way that retrieves targeted ads but doesn't share the unencrypted email back to Google at all. At most it would share a few keywords or some kind of vector that minimizes disclosure of the email content. This is technically feasible -- but I have no idea if Google or anyone else is doing this. Which is why I'm curious to see a reference for Bob Bigellow's statement that Gmail ads already work client-side.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Wed, 23 Apr 2014 14:42:30 -0000</pubDate></item><item><title>Re: Google is researching ways to make encryption easier to use in Gmail | VentureBeat | Business | by Harrison Weber</title><link>http://venturebeat.com/2014/04/21/google-is-researching-ways-to-make-encryption-easier-to-use-in-gmail/#comment-1349219825</link><description>&lt;p&gt;Interesting, I didn't know that. Could you point me to a reference?&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Mon, 21 Apr 2014 15:45:39 -0000</pubDate></item><item><title>Re: Google is researching ways to make encryption easier to use in Gmail | VentureBeat | Business | by Harrison Weber</title><link>http://venturebeat.com/2014/04/21/google-is-researching-ways-to-make-encryption-easier-to-use-in-gmail/#comment-1348990677</link><description>&lt;p&gt;Actually, Google -- or any other ad-supported mail provider -- could implement ad targeting in a way that processes the page client-side, without ever storing the clear text of the emails on its servers. Probably a lot more complex, but that may ultimately be the price of secure messaging. Unless consumers decide that security is worth paying for.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Mon, 21 Apr 2014 13:06:15 -0000</pubDate></item><item><title>Re: Forgotify: The Tool for Discovering Spotify&amp;#039;s 4 Million Unheard Tracks</title><link>http://www.theatlantic.com/technology/archive/2014/01/forgotify-the-tool-for-discovering-spotifys-4-million-unheard-tracks/283484/#comment-1228454102</link><description>&lt;p&gt;Interesting -- trying it out now. Any idea how much of the song do you have to listen to for it to qualify as "played"?&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Sun, 02 Feb 2014 22:55:10 -0000</pubDate></item><item><title>Re: The Bay Area’s 1 Percenters    </title><link>http://www.nationalreview.com/article/361828/bay-areas-1-percenters-victor-davis-hanson#comment-1098802115</link><description>&lt;p&gt;Let's try to move beyond personal experience and bring some data to the conversations.&lt;/p&gt;&lt;p&gt;Palo Alto public high schools have class sizes above the state average:&lt;/p&gt;&lt;p&gt;&lt;a href="http://paloalto.patch.com/groups/schools/p/crowded-compare-class-sizes-at-silicon-valley-schools" rel="nofollow noopener" target="_blank" title="http://paloalto.patch.com/groups/schools/p/crowded-compare-class-sizes-at-silicon-valley-schools"&gt;http://paloalto.patch.com/g...&lt;/a&gt;&lt;/p&gt;&lt;p&gt;In Mountain View, overcrowding has become enough of a concern at a local public elementary school to motivate a policy study on alleviating it:&lt;/p&gt;&lt;p&gt;&lt;a href="http://publicpolicy.stanford.edu/system/files/shared/Alleviating_Overcrowding_at_Castro_2.pdf" rel="nofollow noopener" target="_blank" title="http://publicpolicy.stanford.edu/system/files/shared/Alleviating_Overcrowding_at_Castro_2.pdf"&gt;http://publicpolicy.stanfor...&lt;/a&gt;&lt;/p&gt;&lt;p&gt;Is it possible that some of the "stampede to enroll students in upscale private academies" is simply overflow from overcrowded public schools in an area that has been struggling to manage an increase in the student population?&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Sun, 27 Oct 2013 17:21:52 -0000</pubDate></item><item><title>Re: The Bay Area’s 1 Percenters    </title><link>http://www.nationalreview.com/article/361828/bay-areas-1-percenters-victor-davis-hanson#comment-1098765099</link><description>&lt;p&gt;I think it's pretty accurate to say that Hanson is pretending that people like me don't exist by painting a one-sided picture. That's his prerogative, much as it's mine to call him on it.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Sun, 27 Oct 2013 16:37:29 -0000</pubDate></item><item><title>Re: The Bay Area’s 1 Percenters    </title><link>http://www.nationalreview.com/article/361828/bay-areas-1-percenters-victor-davis-hanson#comment-1096593664</link><description>&lt;p&gt;And here is someone who has responded to Vic's claims with some data. It's pretty opinionated, but so is Vic. In any case, everyone is entitled to their own opinions, but not their own facts.&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.cjr.org/the_audit/silicon_valley_regressive_prog.php" rel="nofollow noopener" target="_blank" title="http://www.cjr.org/the_audit/silicon_valley_regressive_prog.php"&gt;http://www.cjr.org/the_audi...&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Fri, 25 Oct 2013 13:23:44 -0000</pubDate></item><item><title>Re: The Bay Area’s 1 Percenters    </title><link>http://www.nationalreview.com/article/361828/bay-areas-1-percenters-victor-davis-hanson#comment-1095029334</link><description>&lt;p&gt;Vic doesn't offer data or even define a "stampede", other than implying that he talks with friends who live in Silicon Valley. So it's hard to evaluate the truth of his unfalsifiable and unsubstantiated claim.&lt;/p&gt;&lt;p&gt;What I do know is that that the hot real estate market on those areas is driven in significant part by "1 percenters" wanting their kids to be able to go to those local public schools -- a big part of property value comes from the associated school district. That argues more for a stampede into the local public schools than out of them.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Thu, 24 Oct 2013 09:17:05 -0000</pubDate></item><item><title>Re: The Bay Area’s 1 Percenters    </title><link>http://www.nationalreview.com/article/361828/bay-areas-1-percenters-victor-davis-hanson#comment-1094667973</link><description>&lt;p&gt;Victor, I have to wonder if your preconceptions are leading you to only see what you want to see. My wife and I, like many Silicon Valley parents, send our child to public school even though we can afford to send her to private school. We deliberately chose a school that was majority Hispanic. Partly because we want her to learn Spanish (it's a dual-immersion program), but also because we want her to get to know people with different backgrounds. Pretending that people like us don't exist makes me wonder if you're even trying to make an objective analysis. Don't caricature people just because you disagree with them politically -- it's petty.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Thu, 24 Oct 2013 00:42:15 -0000</pubDate></item><item><title>Re: What Mugshots Mean For Public Data</title><link>http://www.hilarymason.com/blog/what-mugshots-mean-for-public-data/#comment-1072746009</link><description>&lt;p&gt;This debate only reinforces what Jim Adler has been preaching for a while: we need to regulate data use rather than data access.&lt;/p&gt;&lt;p&gt;&lt;a href="http://jimadler.me/post/26615240224/is-big-data-a-thoughtcrime" rel="nofollow noopener" target="_blank" title="http://jimadler.me/post/26615240224/is-big-data-a-thoughtcrime"&gt;http://jimadler.me/post/266...&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Sun, 06 Oct 2013 15:23:29 -0000</pubDate></item><item><title>Re: a16z</title><link>http://cdixon.org/2012/11/19/a16z/#comment-714502985</link><description>&lt;p&gt;Congrats Chris! Looking forward to seeing you here in the Bay Area. And congrats to a16z for landing you!&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Mon, 19 Nov 2012 22:31:16 -0000</pubDate></item><item><title>Re: Hubris and the Data Scientist</title><link>https://cloudofdata.com/2012/03/hubris-and-the-data-scientist/#comment-457253866</link><description>&lt;p&gt;Paul, I wish the discussion had been recorded. As I wrote in my blog post, the question proposed was absurdly Manichean: if you had to hire your first data scientist and could only hire one, would you pick a domain expert or a machine learning expert? Most of the room disagreed with the premise of the question, but the debaters made the most of it by taking extreme positions and defending them with gusto. It was a lot of fun, with enthusiastic audience participation and the debaters exploiting their inside knowledge of their opponents’ work histories.&lt;/p&gt;&lt;p&gt;I'm not denying that we data scientists have to watch out for hubris. But the community is not as extreme as to not find a place for domain expertise. In fact, much of our work should help to both objectively validate and maximally benefit from domain expertise.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Mon, 05 Mar 2012 17:52:57 -0000</pubDate></item><item><title>Re: Strata 2012: Is Privacy a Big Data Prison?</title><link>http://jimadler.me/post/18618791545#comment-455553908</link><description>&lt;p&gt;Loved your session, even if it was way too short. Eager to pick up on the question of whether it makes sense to consider inference illegal or unethical and, if so, how we don't devolve into creating a class of thoughtcrime -- which, ironically, would be the ultimate government invasion of privacy.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Sat, 03 Mar 2012 16:51:29 -0000</pubDate></item><item><title>Re: Don't write on the whiteboard by Joseph Perla</title><link>http://www.jperla.com/blog/post/don-t-write-on-the-whiteboard#comment-407410325</link><description>&lt;p&gt;Joseph, thanks for all the traffic this morning! :-)&lt;/p&gt;&lt;p&gt;I hope this is what you mean by a "Dan Tunkelang type problem":&lt;/p&gt;&lt;p&gt;1) It is a real problem that has come up in the couse of developing production software.&lt;/p&gt;&lt;p&gt;2)  It does not require any specialized knowledge — just strings, sets, maps, recursion, and other basics that are covered in a first- or second-year undergraduate course in computer science.&lt;/p&gt;&lt;p&gt;3) The code is non-trivial but compact enough to use under the tight conditions of a 45-minute interview, whether in person or over the phone using a tool like Collabedit.&lt;/p&gt;&lt;p&gt;4)  The problem is challenging, but it isn’t a gotcha problem. Rather, it requires a methodical analysis of the problem and the application of basic computer science tools.&lt;/p&gt;&lt;p&gt;5) The candidate’s performance on the problem isn’t binary. The worst candidates don’t even manage to implement the fizzbuzz solution in 45 minutes. The best implement a great solution in 10 minutes, allowing you to make the problem even more interesting. Most candidates perform somewhere in the middle.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Tue, 10 Jan 2012 16:45:22 -0000</pubDate></item><item><title>Re: Duck Duck Go</title><link>http://www.usv.com/posts/duck-duck-go#comment-333647031</link><description>&lt;p&gt;Brad, congrats to you and to Gabe! Was an early user of DDG -&lt;a href="http://thenoisychannel.com/2008/10/16/duck-duck-go/" rel="nofollow noopener" target="_blank" title="http://thenoisychannel.com/2008/10/16/duck-duck-go/"&gt;http://thenoisychannel.com/...&lt;/a&gt; - and happy to see folks trying to make web search better. It's an uphill battle, but I'm sure you all know that first-hand!&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Thu, 13 Oct 2011 11:15:32 -0000</pubDate></item><item><title>Re: Whither TechCrunch?</title><link>http://avc.com/2011/09/whither-techcrunch/#comment-305015689</link><description>&lt;p&gt;As my friend Rob Gonzalez says, structured data repositories are a public good that no one is ever willing to pay for. Freebase has always sounded great in theory, but has never lived up to its promise because it can't deliver on coverage.&lt;/p&gt;&lt;p&gt;&lt;a href="http://thenoisychannel.com/2011/05/15/in-search-of-structure/" rel="nofollow noopener" target="_blank" title="http://thenoisychannel.com/2011/05/15/in-search-of-structure/"&gt;http://thenoisychannel.com/...&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Daniel Tunkelang</dc:creator><pubDate>Thu, 08 Sep 2011 11:03:23 -0000</pubDate></item></channel></rss>