http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&feed=atom&action=history
CI05 - História úprav
2024-03-28T15:43:09Z
História úprav pre túto stránku na wiki
MediaWiki 1.23.3
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2966&oldid=prev
Brona: /* Aproximácia Jaccardovej miery: MinHash */
2023-10-20T10:10:44Z
<p><span dir="auto"><span class="autocomment">Aproximácia Jaccardovej miery: MinHash</span></span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 10:10, 20. október 2023</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 175:</td>
<td colspan="2" class="diff-lineno">Riadok 175:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Vieme si tiez pre kazdu hasovaciu funkciu spravit slovnik, ktory mapuje minHash do zoznamu dokumentov a budeme porovnavat iba dvojice dokumentov, ktore sa niekde dostali do toho isteho zoznamu  (t.j ich odhad ''J''(''A'',''B'') bude nenulovy)</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Vieme si tiez pre kazdu hasovaciu funkciu spravit slovnik, ktory mapuje minHash do zoznamu dokumentov a budeme porovnavat iba dvojice dokumentov, ktore sa niekde dostali do toho isteho zoznamu  (t.j ich odhad ''J''(''A'',''B'') bude nenulovy)</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>Alternativa: namiesto ''s'' roznych funkcii pouzijeme iba jednu a vezmeme nielen minimum, ale ''s'' najmensich prvkov. Potom ''J''(''A'',''B'') odhadneme pomocou <math><del class="diffchange diffchange-inline">J(</del>S_A<del class="diffchange diffchange-inline">, </del>S_B<del class="diffchange diffchange-inline">)</del></math> kde <math><del class="diffchange diffchange-inline">s_A</del></math> je mnozina hodnot v sketchi mnoziny <math>A</math>. To usetri cas pri vypocte sketchu.</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>Alternativa: namiesto ''s'' roznych funkcii pouzijeme iba jednu a vezmeme nielen minimum, ale ''s'' najmensich prvkov. Potom ''J''(''A'',''B'') odhadneme pomocou <math><ins class="diffchange diffchange-inline">|</ins>S_A<ins class="diffchange diffchange-inline">\cap </ins>S_B<ins class="diffchange diffchange-inline">|/s</ins></math> kde <math><ins class="diffchange diffchange-inline">S_A</ins></math> je mnozina hodnot v sketchi mnoziny <math>A</math>. To usetri cas pri vypocte sketchu<ins class="diffchange diffchange-inline">, lebo nemusime hashovat vsetky prvky ''s'' krat</ins>.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Broder AZ. On the resemblance and containment of documents<del class="diffchange diffchange-inline">. InProceedings</del>. Compression and Complexity of SEQUENCES 1997 <del class="diffchange diffchange-inline">(Cat. No. 97TB100171) 1997 Jun 13 </del>(pp. 21-29). IEEE. [https://www.cs.princeton.edu/courses/archive/spring13/cos598C/broder97resemblance.pdf]</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Broder AZ. On the resemblance and containment of documents. Compression and Complexity of SEQUENCES 1997 (pp. 21-29). IEEE. [https://www.cs.princeton.edu/courses/archive/spring13/cos598C/broder97resemblance.pdf]</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>=== Hľadanie podobných sekvencií ===</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>=== Hľadanie podobných sekvencií ===</div></td></tr>
</table>
Brona
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2965&oldid=prev
Brona: /* Minimizery: ako ušetriť pamäť a čas */
2023-10-20T10:06:13Z
<p><span dir="auto"><span class="autocomment">Minimizery: ako ušetriť pamäť a čas</span></span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 10:06, 20. október 2023</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 52:</td>
<td colspan="2" class="diff-lineno">Riadok 52:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Zarucene najdeme jadro, ak mame aspon k+s-1 zhod za sebou</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Zarucene najdeme jadro, ak mame aspon k+s-1 zhod za sebou</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Mohli by sme v query tiež hľadať iba každý s-ty k-mer?</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Mohli by sme v query tiež hľadať iba každý s-ty k-mer?</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>** Čo ak by db a query boli to isté, iba v query <del class="diffchange diffchange-inline">ba </del>chýbalo prvé písmeno?</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>** Čo ak by db a query boli to isté, iba v query <ins class="diffchange diffchange-inline">by </ins>chýbalo prvé písmeno?</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Priklad k=5, s=3, k-mery nalavo sa ulozia, k-mery napravo sa hladaju, najde sa jedno jadro</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Priklad k=5, s=3, k-mery nalavo sa ulozia, k-mery napravo sa hladaju, najde sa jedno jadro</div></td></tr>
<tr><td colspan="2" class="diff-lineno">Riadok 74:</td>
<td colspan="2" class="diff-lineno">Riadok 74:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Prefikanejsia idea je '''minimizer''': uvazuj vsetky skupiny s po sebe iducich k-merov (sliding window), v kazdej skupine najdi abecedne najmensi k-mer (minimizer) a uloz do slovnika</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Prefikanejsia idea je '''minimizer''': uvazuj vsetky skupiny <ins class="diffchange diffchange-inline">''</ins>s<ins class="diffchange diffchange-inline">'' </ins>po sebe iducich k-merov (sliding window), v kazdej skupine najdi abecedne najmensi k-mer (minimizer) a uloz do slovnika</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Pri posune okna o jedno doprava casto najmensi k-mer zostava ten isty a netreba ho znovu ukladat, cim sa usetri pamat (a čas)</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Pri posune okna o jedno doprava casto najmensi k-mer zostava ten isty a netreba ho znovu ukladat, cim sa usetri pamat (a čas)</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Rozdiel je pri hladani: v slovniku nehladame vsetky k-mery druhej sekvencie, ale tiez len minimizery, co moze usetrit cas</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Rozdiel je pri hladani: v slovniku nehladame vsetky k-mery druhej sekvencie, ale tiez len minimizery, co moze usetrit cas</div></td></tr>
</table>
Brona
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2843&oldid=prev
Brona na 14:55, 20. október 2022
2022-10-20T14:55:53Z
<p></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 14:55, 20. október 2022</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 1:</td>
<td colspan="2" class="diff-lineno">Riadok 1:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>==Vzorec na výpočet senzitivity jadra==</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>==Vzorec na výpočet senzitivity jadra==</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Uvažujme jadro dĺžky <del class="diffchange diffchange-inline">w </del>(ako v programe BLAST pre nukleotidy)</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Uvažujme jadro dĺžky <ins class="diffchange diffchange-inline">k </ins>(ako v programe BLAST pre nukleotidy<ins class="diffchange diffchange-inline">, na prednáške sa dĺžka jadra označovala w, teraz k</ins>)</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Uvažujme pravdepodobnostný model zarovnania, v ktorom má každá pozícia pravdepodobnosť p, že bude zhoda a (1-p), ze bude nezhoda alebo medzera, zarovnanie ma dlzku L</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Uvažujme pravdepodobnostný model zarovnania, v ktorom má každá pozícia pravdepodobnosť p, že bude zhoda a (1-p), ze bude nezhoda alebo medzera, zarovnanie ma dlzku L</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Nahodna premenna X_i = 1 ak na pozicii i je zhoda, 0 inak</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Nahodna premenna X_i = 1 ak na pozicii i je zhoda, 0 inak</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Nahodna premenna Y_i = 1 ak na pozicii i zacina jadro, t.j. ak <math>X_i=1, X_{i+1}=1, \dots, X_{i+<del class="diffchange diffchange-inline">w</del>-1}=1</math></div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Nahodna premenna Y_i = 1 ak na pozicii i zacina jadro, t.j. ak <math>X_i=1, X_{i+1}=1, \dots, X_{i+<ins class="diffchange diffchange-inline">k</ins>-1}=1</math></div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* <math>P(Y_i = 1) = p^<del class="diffchange diffchange-inline">w</del></math>, nakolko X_i su navzajom nezavisle</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* <math>P(Y_i = 1) = p^<ins class="diffchange diffchange-inline">k</ins></math>, nakolko X_i su navzajom nezavisle</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Nech <math>Y = \sum_{i=1}^{L-<del class="diffchange diffchange-inline">w</del>-1} Y_i</math></div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Nech <math>Y = \sum_{i=1}^{L-<ins class="diffchange diffchange-inline">k</ins>-1} Y_i</math></div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Z linearity strednej hodnoty vieme lahko odhadnut <math>E(Y) = (L-<del class="diffchange diffchange-inline">w</del>+1)p^<del class="diffchange diffchange-inline">w</del></math></div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Z linearity strednej hodnoty vieme lahko odhadnut <math>E(Y) = (L-<ins class="diffchange diffchange-inline">k</ins>+1)p^<ins class="diffchange diffchange-inline">k</ins></math></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Nas ale zaujima P(Y>0) = 1-P(Y=0)</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Nas ale zaujima P(Y>0) = 1-P(Y=0)</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div><del class="diffchange diffchange-inline">** Ako by sme vedeli odhadnut simulaciou? My ale chceme rychlejsi sposob.</del></div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* <math>P(Y=0) = P(Y_1=0 \wedge \dots \wedge Y_{L-<ins class="diffchange diffchange-inline">k</ins>+1}=0)</math></div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* <math>P(Y=0) = P(Y_1=0 \wedge \dots \wedge Y_{L-<del class="diffchange diffchange-inline">w</del>+1}=0)</math></div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Preco neplati, <math>P(Y=0) = P(Y_i = 0)^{L-<ins class="diffchange diffchange-inline">k</ins>+1}</math>?</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Preco neplati, <math>P(Y=0) = P(Y_i = 0)^{L-<del class="diffchange diffchange-inline">w</del>+1}</math>?</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>** Jednotlive Y_i nie su nezavisle, napr. <math>P(Y_{i+1}=1|Y_i=1)=p</math></div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>** Jednotlive Y_i nie su nezavisle, napr. <math>P(Y_{i+1}=1|Y_i=1)=p</math></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>** V postupnosti Y_i sa jednotky maju tendenciu zhlukovat spolu</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>** V postupnosti Y_i sa jednotky maju tendenciu zhlukovat spolu</div></td></tr>
<tr><td colspan="2" class="diff-lineno">Riadok 16:</td>
<td colspan="2" class="diff-lineno">Riadok 15:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Nech A[n] je pravdepodobnost nepritomnosti jadra v prvých ''n'' stlcoch zarovnania (0<=n<=L)</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Nech A[n] je pravdepodobnost nepritomnosti jadra v prvých ''n'' stlcoch zarovnania (0<=n<=L)</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Budeme rozlisovat pripady podla toho, kolko je na konci X_1..X_n jednotiek</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* Budeme rozlisovat pripady podla toho, kolko je na konci X_1..X_n jednotiek</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>** Tento pocet moze byt 0..<del class="diffchange diffchange-inline">w</del>-1 (ak by bol >=<del class="diffchange diffchange-inline">w</del>, mali by sme vyskyt jadra)</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>** Tento pocet moze byt 0..<ins class="diffchange diffchange-inline">k</ins>-1 (ak by bol >=<ins class="diffchange diffchange-inline">k</ins>, mali by sme vyskyt jadra)</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* <math></div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>* <math></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>A[n] = \left\{\begin{array}{ll}</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>A[n] = \left\{\begin{array}{ll}</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>1 & \mbox{ak } n < <del class="diffchange diffchange-inline">w</del>\\  </div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>1 & \mbox{ak } n < <ins class="diffchange diffchange-inline">k</ins>\\  </div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>\sum_{i=0}^{<del class="diffchange diffchange-inline">w</del>-1} p^i (1-p)A[n-i-1] & \mbox{ak } n \ge <del class="diffchange diffchange-inline">w</del>\\  </div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>\sum_{i=0}^{<ins class="diffchange diffchange-inline">k</ins>-1} p^i (1-p)A[n-i-1] & \mbox{ak } n \ge <ins class="diffchange diffchange-inline">k</ins>\\  </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\end{array}\right.</math></div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\end{array}\right.</math></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>V druhom riadku <math>p^i(1-p)</math> zodpoveda  <math>P(X_1...X_n\mbox{ konci presne }i\mbox{ jednotkami})</math> a A[n-i-1] je <math>P(X_1...X_{n-i-1}\mbox{ neobsahuje jadro})</math>, ale to je to iste ako <math>P(X_1...X_n\mbox{ neobsahuje jadro }| X_1...X_n\mbox{ konci presne }i\mbox{ jednotkami})</math></div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>V druhom riadku <math>p^i(1-p)</math> zodpoveda  <math>P(X_1...X_n\mbox{ konci presne }i\mbox{ jednotkami})</math> a A[n-i-1] je <math>P(X_1...X_{n-i-1}\mbox{ neobsahuje jadro})</math>, ale to je to iste ako <math>P(X_1...X_n\mbox{ neobsahuje jadro }| X_1...X_n\mbox{ konci presne }i\mbox{ jednotkami})</math></div></td></tr>
<tr><td colspan="2" class="diff-lineno">Riadok 55:</td>
<td colspan="2" class="diff-lineno">Riadok 54:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>** Čo ak by db a query boli to isté, iba v query ba chýbalo prvé písmeno?</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>** Čo ak by db a query boli to isté, iba v query ba chýbalo prvé písmeno?</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>Priklad <del class="diffchange diffchange-inline">w</del>=5, s=3, k-mery nalavo sa ulozia, k-mery napravo sa hladaju, najde sa jedno jadro</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>Priklad <ins class="diffchange diffchange-inline">k</ins>=5, s=3, k-mery nalavo sa ulozia, k-mery napravo sa hladaju, najde sa jedno jadro</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div><pre></div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div><pre></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>AGTGGCTGCCAGGCTGG    cGaGGCTGCCtGGtTGG   </div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>AGTGGCTGCCAGGCTGG    cGaGGCTGCCtGGtTGG   </div></td></tr>
</table>
Brona
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2840&oldid=prev
Brona: /* Odbočka do analýzy web-stránok: Podobnosť textov */
2022-10-20T14:02:55Z
<p><span dir="auto"><span class="autocomment">Odbočka do analýzy web-stránok: Podobnosť textov</span></span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 14:02, 20. október 2022</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 126:</td>
<td colspan="2" class="diff-lineno">Riadok 126:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Potom otázku "Ktoré dvojice textov sú podobné?" môžeme preformulovať napríklad ako "Ktoré dvojice textov majú Jaccardovu mieru podobnosti vyššiu ako <math>\alpha</math>?", kde <math>\alpha \in (0, 1)</math> je nejaká prahová hodnota.</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Potom otázku "Ktoré dvojice textov sú podobné?" môžeme preformulovať napríklad ako "Ktoré dvojice textov majú Jaccardovu mieru podobnosti vyššiu ako <math>\alpha</math>?", kde <math>\alpha \in (0, 1)</math> je nejaká prahová hodnota.</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">Ako rýchlo by sme vedeli spočítať Jaccardovu mieru pre dve množiny slov, každú s ''n'' prvkami?</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Exaktný výpočet Jaccardovej miery podobnosti nie je vždy dostatočne rýchly pre účely konkrétnej aplikácie, takže logickým riešením je pokúsiť sa jej hodnotu vypočítať iba približne (t.j. aproximovať).</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Exaktný výpočet Jaccardovej miery podobnosti nie je vždy dostatočne rýchly pre účely konkrétnej aplikácie, takže logickým riešením je pokúsiť sa jej hodnotu vypočítať iba približne (t.j. aproximovať).</div></td></tr>
</table>
Brona
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2837&oldid=prev
Brona: /* Vzorec na výpočet senzitivity jadra */
2022-10-20T11:43:23Z
<p><span dir="auto"><span class="autocomment">Vzorec na výpočet senzitivity jadra</span></span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 11:43, 20. október 2022</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 22:</td>
<td colspan="2" class="diff-lineno">Riadok 22:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\sum_{i=0}^{w-1} p^i (1-p)A[n-i-1] & \mbox{ak } n \ge w\\  </div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\sum_{i=0}^{w-1} p^i (1-p)A[n-i-1] & \mbox{ak } n \ge w\\  </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\end{array}\right.</math></div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\end{array}\right.</math></div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>V druhom riadku <math>p^i(1-p)</math> zodpoveda  <math>P(X_1...X_n\mbox{ konci presne }i{ jednotkami})</math> a A[n-i-1] je <math>P(X_1...X_{n-i-1}\mbox{ neobsahuje jadro})</math>, ale to je to iste ako <del class="diffchange diffchange-inline"><!-- </del><math>P(X_1...X_n\mbox{ neobsahuje jadro }| X_1...X_n\mbox{ konci presne <del class="diffchange diffchange-inline">$</del>i<del class="diffchange diffchange-inline">$ </del>jednotkami})</math<del class="diffchange diffchange-inline">>--</del>></div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>V druhom riadku <math>p^i(1-p)</math> zodpoveda  <math>P(X_1...X_n\mbox{ konci presne }i<ins class="diffchange diffchange-inline">\mbox</ins>{ jednotkami})</math> a A[n-i-1] je <math>P(X_1...X_{n-i-1}\mbox{ neobsahuje jadro})</math>, ale to je to iste ako <math>P(X_1...X_n\mbox{ neobsahuje jadro }| X_1...X_n\mbox{ konci presne <ins class="diffchange diffchange-inline">}</ins>i<ins class="diffchange diffchange-inline">\mbox{ </ins>jednotkami})</math></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>==Minimizery: ako ušetriť pamäť a čas==</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>==Minimizery: ako ušetriť pamäť a čas==</div></td></tr>
</table>
Brona
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2836&oldid=prev
Brona: /* Vzorec na výpočet senzitivity jadra */
2022-10-20T11:42:47Z
<p><span dir="auto"><span class="autocomment">Vzorec na výpočet senzitivity jadra</span></span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 11:42, 20. október 2022</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 22:</td>
<td colspan="2" class="diff-lineno">Riadok 22:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\sum_{i=0}^{w-1} p^i (1-p)A[n-i-1] & \mbox{ak } n \ge w\\  </div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\sum_{i=0}^{w-1} p^i (1-p)A[n-i-1] & \mbox{ak } n \ge w\\  </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\end{array}\right.</math></div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\end{array}\right.</math></div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>V druhom riadku <math>p^i(1-p)</math> zodpoveda  <math>P(X_1...X_n\mbox{ konci presne <del class="diffchange diffchange-inline">$</del>i<del class="diffchange diffchange-inline">$ </del>jednotkami})</math> a A[n-i-1] je <math>P(X_1...X_{n-i-1}\mbox{ neobsahuje jadro})</math>, ale to je to iste ako <math>P(X_1...X_n\mbox{ neobsahuje jadro }| X_1...X_n\mbox{ konci presne $i$ jednotkami})</math></div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>V druhom riadku <math>p^i(1-p)</math> zodpoveda  <math>P(X_1...X_n\mbox{ konci presne <ins class="diffchange diffchange-inline">}</ins>i<ins class="diffchange diffchange-inline">{ </ins>jednotkami})</math> a A[n-i-1] je <math>P(X_1...X_{n-i-1}\mbox{ neobsahuje jadro})</math>, ale to je to iste ako <ins class="diffchange diffchange-inline"><!-- </ins><math>P(X_1...X_n\mbox{ neobsahuje jadro }| X_1...X_n\mbox{ konci presne $i$ jednotkami})</math<ins class="diffchange diffchange-inline">>--</ins>></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>==Minimizery: ako ušetriť pamäť a čas==</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>==Minimizery: ako ušetriť pamäť a čas==</div></td></tr>
</table>
Brona
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2835&oldid=prev
Brona: /* Vzorec na výpočet senzitivity jadra */
2022-10-20T11:41:36Z
<p><span dir="auto"><span class="autocomment">Vzorec na výpočet senzitivity jadra</span></span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 11:41, 20. október 2022</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 22:</td>
<td colspan="2" class="diff-lineno">Riadok 22:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\sum_{i=0}^{w-1} p^i (1-p)A[n-i-1] & \mbox{ak } n \ge w\\  </div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\sum_{i=0}^{w-1} p^i (1-p)A[n-i-1] & \mbox{ak } n \ge w\\  </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\end{array}\right.</math></div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>\end{array}\right.</math></div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>V druhom riadku <math>p^i(1-p)</math> zodpoveda  P(X_1...X_n konci presne i jednotkami).</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>V druhom riadku <math>p^i(1-p)</math> zodpoveda  <ins class="diffchange diffchange-inline"><math></ins>P(X_1...X_n<ins class="diffchange diffchange-inline">\mbox{ </ins>konci presne <ins class="diffchange diffchange-inline">$</ins>i<ins class="diffchange diffchange-inline">$ </ins>jednotkami<ins class="diffchange diffchange-inline">}</ins>)<ins class="diffchange diffchange-inline"></math> a A[n-i-1] je <math>P(X_1</ins>.<ins class="diffchange diffchange-inline">..X_{n-i-1}\mbox{ neobsahuje jadro})</math>, ale to je to iste ako <math>P(X_1...X_n\mbox{ neobsahuje jadro }| X_1...X_n\mbox{ konci presne $i$ jednotkami})</math></ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>==Minimizery: ako ušetriť pamäť a čas==</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>==Minimizery: ako ušetriť pamäť a čas==</div></td></tr>
</table>
Brona
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2834&oldid=prev
Brona: /* Aproximácia Jaccardovej miery: MinHash */
2022-10-20T11:38:11Z
<p><span dir="auto"><span class="autocomment">Aproximácia Jaccardovej miery: MinHash</span></span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 11:38, 20. október 2022</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 157:</td>
<td colspan="2" class="diff-lineno">Riadok 157:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Budeme mať náhodné hašovacie funkcie <math>h_1, h_2, \dots, h_s</math>.  </div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Budeme mať náhodné hašovacie funkcie <math>h_1, h_2, \dots, h_s</math>.  </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>O každej hašovacej funkcii predpokladáme, že ak ju použijeme na nejakú množinu <math>A = \{a_1, a_2, \ldots, a_n\}</math>, tak <math>h(a_1), h(a_2), \ldots, h(a_n)<<del class="diffchange diffchange-inline">-</del>math> bude náhodná permutácia množiny <math>A</math>.</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>O každej hašovacej funkcii predpokladáme, že ak ju použijeme na nejakú množinu <math>A = \{a_1, a_2, \ldots, a_n\}</math>, tak <math>h(a_1), h(a_2), \ldots, h(a_n)<<ins class="diffchange diffchange-inline">/</ins>math> bude náhodná permutácia množiny <math>A</math>.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Pre množinu <math>A = \{a_1, a_2, \ldots, a_n\}</math> a hašovaciu funkciu ''h'' je <math>minHash_{h}(A)</math> je definovaný nasledovne:  </div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Pre množinu <math>A = \{a_1, a_2, \ldots, a_n\}</math> a hašovaciu funkciu ''h'' je <math>minHash_{h}(A)</math> je definovaný nasledovne:  </div></td></tr>
<tr><td colspan="2" class="diff-lineno">Riadok 166:</td>
<td colspan="2" class="diff-lineno">Riadok 166:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Nech <math>X_i</math> je náhodná premenná, ktorá nadobúda hodnotu 1, ak <math>minHash_{h_i}(A) = minHash_{h_i}(B)</math>, a inak hodnotu 0. Potom <math>E[X_i] = J(A, B)</math>, lebo celkovo máme <math>|A\cup B|</math> prvkov a <math>J(A,B) = |A\cap B|/|A\cup B|</math> značí, aké percento z nich je v prieniku.</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Nech <math>X_i</math> je náhodná premenná, ktorá nadobúda hodnotu 1, ak <math>minHash_{h_i}(A) = minHash_{h_i}(B)</math>, a inak hodnotu 0. Potom <math>E[X_i] = J(A, B)</math>, lebo celkovo máme <math>|A\cup B|</math> prvkov a <math>J(A,B) = |A\cap B|/|A\cup B|</math> značí, aké percento z nich je v prieniku.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>Takýmito hodnotami X_i teda nahradíme náhodné vzorky diskutované vyššie.</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>Takýmito hodnotami <ins class="diffchange diffchange-inline"><math></ins>X_i<ins class="diffchange diffchange-inline"></math> </ins>teda nahradíme náhodné vzorky diskutované vyššie.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Algoritmus:</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Algoritmus:</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Pre kazdy dokument hasuj kazde slovo ''s'' funkciami, najdi minHash pre kazdu funkciu a uloz vektor tychto hodnot ako "sketch" dokumentu. Cas vypoctu pre dokument s n slovami je O(ns)</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Pre kazdy dokument hasuj kazde slovo ''s'' funkciami, najdi minHash pre kazdu funkciu a uloz vektor tychto hodnot ako "sketch" dokumentu. Cas vypoctu pre dokument s <ins class="diffchange diffchange-inline">''</ins>n<ins class="diffchange diffchange-inline">'' </ins>slovami je <ins class="diffchange diffchange-inline">''</ins>O<ins class="diffchange diffchange-inline">''</ins>(<ins class="diffchange diffchange-inline">''</ins>ns<ins class="diffchange diffchange-inline">''</ins>)</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Pre dva dokumenty porovname vektor po zlozkach a ak najdeme x zhod, J(A,B) odhadneme ako x/s. Cas vypoctu O(s)</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Pre dva dokumenty porovname vektor po zlozkach a ak najdeme <ins class="diffchange diffchange-inline">''</ins>x<ins class="diffchange diffchange-inline">'' </ins>zhod, <ins class="diffchange diffchange-inline">''</ins>J<ins class="diffchange diffchange-inline">''</ins>(<ins class="diffchange diffchange-inline">''</ins>A<ins class="diffchange diffchange-inline">''</ins>,<ins class="diffchange diffchange-inline">''</ins>B<ins class="diffchange diffchange-inline">''</ins>) odhadneme ako <ins class="diffchange diffchange-inline">''</ins>x<ins class="diffchange diffchange-inline">''</ins>/<ins class="diffchange diffchange-inline">''</ins>s<ins class="diffchange diffchange-inline">''</ins>. Cas vypoctu <ins class="diffchange diffchange-inline">''</ins>O<ins class="diffchange diffchange-inline">''</ins>(<ins class="diffchange diffchange-inline">''</ins>s<ins class="diffchange diffchange-inline">''</ins>)</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Vieme si tiez pre kazdu hasovaciu funkciu spravit slovnik, ktory mapuje minHash do zoznamu dokumentov a budeme porovnavat iba dvojice dokumentov, ktore sa niekde dostali do toho isteho zoznamu  (t.j ich odhad J(A,B) bude nenulovy)</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Vieme si tiez pre kazdu hasovaciu funkciu spravit slovnik, ktory mapuje minHash do zoznamu dokumentov a budeme porovnavat iba dvojice dokumentov, ktore sa niekde dostali do toho isteho zoznamu  (t.j ich odhad <ins class="diffchange diffchange-inline">''</ins>J<ins class="diffchange diffchange-inline">''</ins>(<ins class="diffchange diffchange-inline">''</ins>A<ins class="diffchange diffchange-inline">''</ins>,<ins class="diffchange diffchange-inline">''</ins>B<ins class="diffchange diffchange-inline">''</ins>) bude nenulovy)</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>Alternativa: namiesto ''s'' roznych funkcii pouzijeme iba jednu a vezmeme nielen minimum, ale ''s'' najmensich prvkov. Potom J(A,B) odhadneme pomocou J(<del class="diffchange diffchange-inline">s_A</del>, <del class="diffchange diffchange-inline">s_B</del>) kde s_A je mnozina hodnot v sketchi mnoziny A. To usetri cas pri vypocte sketchu.</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>Alternativa: namiesto ''s'' roznych funkcii pouzijeme iba jednu a vezmeme nielen minimum, ale ''s'' najmensich prvkov. Potom <ins class="diffchange diffchange-inline">''</ins>J<ins class="diffchange diffchange-inline">''</ins>(<ins class="diffchange diffchange-inline">''</ins>A<ins class="diffchange diffchange-inline">''</ins>,<ins class="diffchange diffchange-inline">''</ins>B<ins class="diffchange diffchange-inline">''</ins>) odhadneme pomocou <ins class="diffchange diffchange-inline"><math></ins>J(<ins class="diffchange diffchange-inline">S_A</ins>, <ins class="diffchange diffchange-inline">S_B</ins>)<ins class="diffchange diffchange-inline"></math> </ins>kde <ins class="diffchange diffchange-inline"><math></ins>s_A<ins class="diffchange diffchange-inline"></math> </ins>je mnozina hodnot v sketchi mnoziny <ins class="diffchange diffchange-inline"><math></ins>A<ins class="diffchange diffchange-inline"></math></ins>. To usetri cas pri vypocte sketchu.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
</table>
Brona
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2833&oldid=prev
Brona: /* Hľadanie podobných sekvencií */
2022-10-20T11:35:53Z
<p><span dir="auto"><span class="autocomment">Hľadanie podobných sekvencií</span></span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 11:35, 20. október 2022</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 180:</td>
<td colspan="2" class="diff-lineno">Riadok 180:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>=== Hľadanie podobných sekvencií ===</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>=== Hľadanie podobných sekvencií ===</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>Ako "slová" použijeme všetky k-mery danej sekvencie. Potom na hľadanie dvoch podobných <del class="diffchange diffchange-inline">čítaní </del>z množiny <del class="diffchange diffchange-inline">čítaní </del>môžeme použiť minhash.</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>Ako "slová" použijeme všetky k-mery danej sekvencie. Potom na hľadanie dvoch podobných <ins class="diffchange diffchange-inline">sekvencií </ins>z množiny <ins class="diffchange diffchange-inline">sekvencií </ins>môžeme použiť minhash.</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* <del class="diffchange diffchange-inline">Napriklad </del>Mash <del class="diffchange diffchange-inline">pouziva </del>k=21, s=1000 (s <del class="diffchange diffchange-inline">najmensich </del>v jednej funkcii) na <del class="diffchange diffchange-inline">porovnavanie genomov</del>, sketch <del class="diffchange diffchange-inline">ma </del>asi 8kb na <del class="diffchange diffchange-inline">genom </del>(<del class="diffchange diffchange-inline">genom ma miliony </del>alebo miliardy nukleotidov)</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* <ins class="diffchange diffchange-inline">Napríklad </ins>Mash <ins class="diffchange diffchange-inline">používa </ins>k=21, s=1000 (s <ins class="diffchange diffchange-inline">najmenších </ins>v jednej funkcii) na <ins class="diffchange diffchange-inline">porovnávanie genómov</ins>, sketch <ins class="diffchange diffchange-inline">má </ins>asi 8kb na <ins class="diffchange diffchange-inline">genóm </ins>(<ins class="diffchange diffchange-inline">genóm má milióny </ins>alebo miliardy nukleotidov)</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
</table>
Brona
http://compbio.fmph.uniba.sk/vyuka/mbi/index.php?title=CI05&diff=2832&oldid=prev
Brona: /* Hľadanie podobných sekvencií */
2022-10-20T11:31:50Z
<p><span dir="auto"><span class="autocomment">Hľadanie podobných sekvencií</span></span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr style='vertical-align: top;'>
<td colspan='2' style="background-color: white; color:black; text-align: center;">← Staršia verzia</td>
<td colspan='2' style="background-color: white; color:black; text-align: center;">Verzia zo dňa a času 11:31, 20. október 2022</td>
</tr><tr><td colspan="2" class="diff-lineno">Riadok 181:</td>
<td colspan="2" class="diff-lineno">Riadok 181:</td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Ako "slová" použijeme všetky k-mery danej sekvencie. Potom na hľadanie dvoch podobných čítaní z množiny čítaní môžeme použiť minhash.</div></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"><div>Ako "slová" použijeme všetky k-mery danej sekvencie. Potom na hľadanie dvoch podobných čítaní z množiny čítaní môžeme použiť minhash.</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* Napriklad Mash pouziva k=21, s=1000 na porovnavanie genomov, sketch ma asi 8kb na genom (genom ma miliony alebo <del class="diffchange diffchange-inline">mileardy </del>nukleotidov)</div></td><td class='diff-marker'>+</td><td style="color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* Napriklad Mash pouziva k=21, s=1000 <ins class="diffchange diffchange-inline">(s najmensich v jednej funkcii) </ins>na porovnavanie genomov, sketch ma asi 8kb na genom (genom ma miliony alebo <ins class="diffchange diffchange-inline">miliardy </ins>nukleotidov)</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td><td class='diff-marker'> </td><td style="background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;"></td></tr>
</table>
Brona