<?xml version="1.0" encoding="utf-8"?>
<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Disqus - Latest Comments for yaroslavvb</title><link>http://disqus.com/by/yaroslavvb/</link><description></description><atom:link href="http://disqus.com/yaroslavvb/comments.rss" rel="self"></atom:link><language>en</language><lastBuildDate>Fri, 12 Nov 2021 17:39:02 -0000</lastBuildDate><item><title>Re: Matrices as Tensor Network Diagrams</title><link>https://www.math3ma.com/blog/matrices-as-tensor-network-diagrams#comment-5606448558</link><description>&lt;p&gt;BTW, there's could be something special about the tensor network that's not completely captured by looking at the graphical model corresponding to its line graph. There's a fast algorithm for contracting planar tensor networks, no equivalent for graphical models is known -- Jakes-Schauer, J., D. Anekstein, and P. Wocjan. 2019. “Carving-Width and Contraction Trees for Tensor Networks.” arXiv [cs.DM]. arXiv. &lt;a href="https://doi.org/10.1016/j.jpdc.2014.06.002" rel="nofollow noopener" target="_blank" title="https://doi.org/10.1016/j.jpdc.2014.06.002"&gt;https://doi.org/10.1016/j.j...&lt;/a&gt;.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Fri, 12 Nov 2021 17:39:02 -0000</pubDate></item><item><title>Re: What About Extra Virgin Olive Oil?</title><link>https://nutritionfacts.org/2017/10/17/what-about-extra-virgin-olive-oil/#comment-5403750009</link><description>&lt;p&gt;The first study is interventional so that's good. To summarize, adding "extra virgin olive oil" to Mediterranean diet was slightly better than adding extra nuts, but not statistically significant (57% reduction vs 55% reduction in risk), whereas going for control "low-fat" diet was 9% increase in risk. TLDR; replacing some nuts with a tea-spoon of extra-virgin olive oil doesn't seem to impact CVD mortality whereas going from "low-fat" to Mediterranean diet has a huge effect&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Mon, 31 May 2021 19:53:38 -0000</pubDate></item><item><title>Re: 
        IGraph/M: a Mathematica interface for igraph</title><link>http://szhorvat.net/pelican/igraphm-a-mathematica-interface-for-igraph.html#comment-4739272371</link><description>&lt;p&gt;I see, btw is there anything relating to treewidth in igraph? (ie, finding tree decomposition)&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Sun, 29 Dec 2019 22:31:14 -0000</pubDate></item><item><title>Re: 
        IGraph/M: a Mathematica interface for igraph</title><link>http://szhorvat.net/pelican/igraphm-a-mathematica-interface-for-igraph.html#comment-4738821204</link><description>&lt;p&gt;Just tried and it worked out of the box, great to see it's being maintained. One minor nit, the initial message prints "It can now be loaded using the command Get["IGraphM`"]". If I copy paste this, I get "Get[\" IGraphM` \"]". Maybe this last line could be emitted as a code cell that user could run directly&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Sun, 29 Dec 2019 12:51:49 -0000</pubDate></item><item><title>Re: Distributed TensorFlow - A Gentle Introduction</title><link>http://localhost:4000/distributed-tensorflow-a-gentle-introduction#comment-3635261336</link><description>&lt;p&gt;Nice overview!&lt;/p&gt;&lt;p&gt;Regarding worker leaving, it should be no problem if the worker permanently leaves AS LONG as the other worker doesn't restart. If it does restart, the first &lt;a href="http://session.run" rel="nofollow noopener" target="_blank" title="session.run"&gt;session.run&lt;/a&gt; call will hang since it sets up the cluster and needs all the workers to be available. The solution to this is to use "sparse job config" -- use dictionary of worker-&amp;gt;ip mapping for necessary workers only. This way any worker not in this list can be down without affecting current worker. In a Parameter Server environment, workers don't need to know about other workers.&lt;/p&gt;&lt;p&gt;For failure tolerance, it's a bit annoying, but you have to recreate session each time there's any error and wait until things are OK (session created successfully and tf.report_unininitialized_variables gives empty list). So if a parameter server restarts, this causes &lt;a href="http://session.run" rel="nofollow noopener" target="_blank" title="session.run"&gt;session.run&lt;/a&gt; to crash in all the workers which go into the waiting loop. The chief worker has a similar loop, except it only tries to create session and then call initialization op. Eventually initialization op succeeds, workers stop waiting and training continues. I have a simpler that implements failure robustness for a set of workers adding 1's to a central parameter server here -- &lt;a href="https://github.com/diux-dev/cluster/commit/0ba0587452ea9ee02e52cfbf02e08a126058dfd6#diff-eac03bf307e32a2ede706d76d8487120" rel="nofollow noopener" target="_blank" title="https://github.com/diux-dev/cluster/commit/0ba0587452ea9ee02e52cfbf02e08a126058dfd6#diff-eac03bf307e32a2ede706d76d8487120"&gt;https://github.com/diux-dev...&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Mon, 27 Nov 2017 14:24:30 -0000</pubDate></item><item><title>Re: Tensorflow I Love You, But You're Bringing Me Down</title><link>http://blog.nateharada.com/tensorflow-i-love-you-but#comment-3353977109</link><description>&lt;p&gt;GraphDef issue is similar to issue of "compiled vs interpreted". Compiled programs run faster at the expense of being harder to debug and longer iteration cycle. You need GraphDef in order to be able to optimize the program. But a lot of engineering work is needed to bring the ease of use back in. I'm not sure Google is best-positioned to make a good high-level neural net library. Applications are somewhat different and incentives aren't there. I've seen TensorPack gaining in popularity and it's not made by Google&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Sun, 11 Jun 2017 14:50:49 -0000</pubDate></item><item><title>Re: Fisher Information and the Hessian of Log Likelihood</title><link>http://mark.reid.name/blog/fisher-information-and-log-likelihood.html#comment-3169876525</link><description>&lt;p&gt;BTW, there's a mistake in derivation, which gets fixed by another mistake in the last line, when you use product rule to compute derivative, you use Di,jpθ(x)/p(x) for the first term, but it should be Di,jp(x) . (ps, this was the first result I found when searching for derivation of the connection, by searching for fisher hessian)&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Wed, 22 Feb 2017 16:36:21 -0000</pubDate></item><item><title>Re: Calculus on Computational Graphs: Backpropagation</title><link>http://colah.github.io/posts/2015-08-Backprop/#comment-2638815003</link><description>&lt;p&gt;There's another cool algebraic view: for f(g(h(...))) the derivative is F*G*H where * is matmul and F,G,H are Jacobian matrices. If you have many inputs and one output, f is R^n-&amp;gt;R^1, then your last matrix is skinny and tall, then Matrix Chain Multiplication solution tells you to do (F G)H, which is reverse mode AD. But if you have many outputs and one input, your H is wide and short, so most efficient is to do F(G H) which is forward mode AD. But also there are cases where neither forward nor reverse mode AD are the most efficient, and those are the "other" solutions of the MCM problem&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Sat, 23 Apr 2016 11:12:49 -0000</pubDate></item><item><title>Re: New Hack: CPU/Memory process monitor for Google Chrome</title><link>http://insomanic.me.uk/post/52026854106#comment-1566770329</link><description>&lt;p&gt;Doesn't seem to work on the latest from Stable channel, stuck at "Loading"&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Sat, 30 Aug 2014 17:30:48 -0000</pubDate></item><item><title>Re: Updated List of Datasets &amp; Video Lectures</title><link>http://www.datawrangling.com/updated-list-of-datasets-video-lectures#comment-14008535</link><description>&lt;p&gt;Hey, I've just put together another digit OCR dataset. It's 20k digit crops taken from natural scene photographs, and I believe this dataset is more challenging than MNIST &lt;a href="http://yaroslavvb.blogspot.com/2009/08/new-robust-ocr-dataset.html" rel="nofollow noopener" target="_blank" title="http://yaroslavvb.blogspot.com/2009/08/new-robust-ocr-dataset.html"&gt;http://yaroslavvb.blogspot....&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yaroslav Bulatov</dc:creator><pubDate>Wed, 05 Aug 2009 19:16:45 -0000</pubDate></item></channel></rss>