k-means++: The Advantages of Careful Seeding

todo314 @ ウィキ内検索 / 「k-means++: The Advantages of Careful Seeding」で検索した結果

k-means++: The Advantages of Careful Seeding
k-means++ The Advantages of Careful Seeding David Arthur, Sergei Vassilvitskii In SODA 2007 メモ Stanford大学の人々概要 k-meansアルゴリズム（Lloydのアルゴリズム）の改良版元のアルゴリズムの問題点である初期値依存性を解決し、Θ(log k)近似を達成。 k-meansアルゴリズムの問題点最適コストとの比に上界が無い nとkの値が固定でも、コストがいくらでも悪くなるような入力を作れる k-means++ k個のクラスタ中心c1,…ckの初期値を確率的に選択する。入力点から一様ランダムに初期点c1を選択し、C←{c...
論文一覧
お役立ち情報スケジュール https //confsearch.ethz.ch/?query=STOC+FOCS+SODA+CCC+ICALP+ITCS+LICS+IPCO+ISSAC+SoCG+PODS+COLT+EC+ESA+STACS+APPROX+RANDOM+MFCS+SWAT+WADS+ISAAC+FUN http //www.conferencelist.info/upcoming.html http //community.dur.ac.uk/tom.friedetzky/conf.html http //www.lix.polytechnique.fr/~hermann/conf.html http //csconf.net/deadlines 国際会議・雑誌 MSAR field rati...
気になった論文
理論計算機科学 ACM Symposium on Theory of Computing STOC 2000 Circuit minimization problem A combinatorial, strongly polynomial-time algorithm for minimizing submodular functions Improved algorithms for submodular function minimization and submodular flow On dual minimum cost flow algorithms The small-world phenomenon an algorithm perspective A random graph model for mass...
Yinyang K-Means: A Drop-In Replacement of the Classic K-Means with ...
Yinyang K-Means A Drop-In Replacement of the Classic K-Means with Consistent Speedup Yufei Ding, Yue Zhao, Xipeng Shen, Madanlal Musuvathi, Todd Mytkowicz ICML 2015 概要 k-meansのLloydアルゴリズムの距離計算枝刈り+中心再計算省略による高速化手法提案手法三角不等式 $$ |d(x,b)-d(b,c)| \leq d(x,c) \leq d(x,b)+d(b,c) $$ $$C,c$$ クラスタ中心の集合、クラスタ中心（のどれか） $$b(x)$$ xの属するクラスタ中心 $$C ,c ,b (x)$$ 次の反復...
StreamKM++: A Clustering Algorithm for Data Streams
StreamKM++ A Clustering Algorithm for Data Streams In JEA 2012 Journal of Experimental Algorithmics In ALENEX 2010 概要コアセットを使ったストリームk-meansアルゴリズムアルゴリズムストリームモデルでコアセットを頑張って計算する確かマルチパス最後にk-means++を適用コアセット Pの(k,ε)コアセット重み付き集合Sと重み関数wの組 $$ \forall C \subset \mathbb{R}^{d}(|C|=k) \quad (1-\epsilon)cost(P,C) \leq cost_w (S,C) \...
Maximizing the Spread of Cascades Using Network Design
Maximizing the Spread of Cascades Using Network Design Daniel Sheldon, Bistra Dilkina, Adam N. Elmachtoub, Ryan Finseth, Ashish Sabharwal, Jon Conrad, Carla P. Gomes, David Shmoys, William Allen, Ole Amundsen, William Vaughan UAI 2010 We apply our model to a sustainability problem that is part of an ongoing collaboration with The Conservation Fund to optimize the conservation of ...
Triadic Measures on Graphs: The Power of Wedge Sampling
Triadic Measures on Graphs The Power of Wedge Sampling C. Seshadhri, Ali Pinar, Tamara G. Kolda SDM 2013 概要クラスタ係数、次数毎のとか、色々を統一的なフレームワークでもとめる wedgeを適当な重みでサンプリングしてtriangleを数えるだけサンプルサイズがグラフのサイズに依存しないところがポイントシンプルにboundが出る提案手法サンプリングの方法基本的にはwedgeをとってきて、それがtriangleを構成するか？を判定する global clustering coefficient choose(deg(v), 2)に比例する確率でv...
Co-clustering documents and words using Bipartite Spectral Graph Partitioning
Co-clustering documents and words using Bipartite Spectral Graph Partitioning Inderjit S. Dhillon KDD 2001 概要文書と単語の共クラスタリングカット最小化として妥当な問題設定スペクトラルグラフ理論の知見から固有ベクトル計算問題として緩和後はk-meansすると大体上手い問題設定 $$ G=({\cal D}, {\cal W}, E) $$ 文書と単語の二部グラフ辺重みが謎… 単語・文書クラスタリングの関係文書のクラスタが与えられた時、各単語は{最も重み和の大きい文書クラスタ}に対応する単語クラスタに属すると考えるというわけで、k...
k-means--: A unified approach to clustering and outlier detection
k-means-- A unified approach to clustering and outlier detection Sanjay Chawla, Aristides Gionis SDM 2013 概要 l点除いてかつk-meansも最小除いたLが外れ値問題定義 D(X\L, C)を最小化するC (of size k) とL (of size l) を求めたい Dは最小二乗距離和 k=1ならn^d^3で解けるけど基本NP-hard 提案手法 k-meansの変形に相当各点についてクラスタまでの距離計算距離でソートして大きいl点をLにつっこむ Lを無視してクラスタ更新これを収束するま...
Limiting the Spread of Misinformation in Social Networks
Limiting the Spread of Misinformation in Social Networks Ceren Budak, Divyakant Agrawal, Amr El Abbadi In WWW 2011 概要誤情報に対する訂正情報をイイ感じに流して誤情報の伝播を減らしたい submodular ってからのヒューリスティクス問題 Multi-Campaign Independent Cascade Model 誤情報と訂正情報はそれぞれ辺ごとに異なる確率をとる同時なら訂正優先一度情報を受け取ったら以後変化しない到達時間が大事定式化既に誤情報はいくらか伝わっている rターンたっている k...
Hartigan's Method: k-means Clustering without Voronoi
Hartigan s Method k-means Clustering without Voronoi Matus Telgarsky, Andrea Vattani JMLR 2010 Journal of Machine Learning Research
Stochastic Submodular Maximization: The Case of Coverage Functions
Stochastic Submodular Maximization The Case of Coverage Functions Mohammad Reza Karimi, Mario Lucic, Hamed Hassani, Andreas Krause NIPS 2017 概要確率的劣モジュラ最大化のための新しい手法を提案まずは、被覆関数から連続拡張をさらに緩和した問題を直接解いてから元に戻す問題目的関数 $$ f(S) = \mathbb{E}_{\gamma \sim \Gamma}[f_\gamma(S)] $$ 影響最大化なら、影響グラフからサンプルしたグラフ上で到達可能な頂点数線形連続拡張 $$ F(\bm{x}) = \mathbb{E}...
Nonnegative Spectral Clustering with Discriminative Regularization
Nonnegative Spectral Clustering with Discriminative Regularization Yi Yang, Heng Tao Shen, Feiping Nie, Rongrong Ji, Xiaofang Zhou AAAI 2011 概要（だけ）よくあるクラスタリング手法どの要素がどのクラスタに属するかを表すindicator matrixで目的関数を表現そのままだと解けないので{0,1}から緩和する固有値分解をココらへんで使う頑張って解く ±混ざっている {0,1}にする何が問題？混合符号の行列がもらえた時にそれを量子化する簡単な方法が無い EM-like / k-means / spe...
Debunking the Myths of Influence Maximization: An In-Depth Benchmarking Study
Debunking the Myths of Influence Maximization An In-Depth Benchmarking Study SIGMOD 2017 概要だけ提案されたきた影響最大化の手法は本当に効率的なのか？比較手法 CELF, CELF++, TIM+, IMM, PMC, StaticGreedy, LDAG, SIMPATH, EaSyIM, IRIE, IMRANK 徹底的な実験を決行個々の論文の著者の主張は間違っている！！ • PMC [39] PMC establishes itself as the only technique that consistently provides high spread and scales for bot...
The Role of Network Distance in LinkedIn People Search
The Role of Network Distance in LinkedIn People Search Shih-Wen Huang, Daniel Tunkelang, Karrie Karahalios 2人目はLinkedIn SIGIR 2014 概要 LinkedInのクエリを色々調べてみたクエリの種類（タグ）でクリックの挙動がかなり変わる名前じゃないクエリは自分から2次の人が選ばれやすい 1次はもういらないが，距離1の人のコネを使うと新しいコネができやすいログ解析 Non-name Job Title, Skill, Company Name Name Last Name, First Name, First Nam...
On the Streaming Complexity of Computing Local Clustering Coefficients
On the Streaming Complexity of Computing Local Clustering Coefficients Konstantin Kutzkov, Rasmus Pagh WSDM 2013 概要ワンパスでlocal clustering coefficientを求めたいローカルなので、頂点ごと辺リストが任意の順でもらえる全く三角形が無い or 1/2以上のCCを持つ次数2d以上の頂点がある、かをある程度の確率でワンパスで判定するためにはΩ(m/d)ビット必要 ↑の限界にマッチした乱択アルゴリズムを考案 Lower bound Theorem 1 ワンパス乱択アルゴリズムが次数2dの頂点はクラスタ係数0 ...
Probabilistic Solutions of Influence Propagation on Networks
Probabilistic Solutions of Influence Propagation on Networks Miao Zhang, Chunni Dai, Chris Ding, Enhong Chen 色々いるし名前を知らん CIKM 2013 概要新しいinfluence spreadの計算方法包除原理？実験もしてオリジナルより速くなったよ！ Exact influence spread n=3,4,5について、頑張ってinfluence spreadを厳密計算する 3頂点について 1がseed パターンを全部考えて、2、3がactiveになる確率を求めたでも、もっと簡単にできる 1から2がactiveにな...
Outward Influence and Cascade Size Estimation in Billion-scale Networks
Outward Influence and Cascade Size Estimation in Billion-scale Networks]] Hung T. Nguyen, Tri P. Nguyen, Tam N. Vu, Thang N. Dinh SIGMETRICS 2017 （会議） Proceedings of the ACM on Measurement and Analysis of Computing Systems 2017 （ジャーナル）概要影響力推定の新しい指標outward influenceを作ったよ！…E[拡散サイズ]-|シードサイズ| 相対誤差を保証するのが難しい高速アルゴリズムを作ったよカスケードが小さくなり過ぎようにimportance samplingを...
Simulated Annealing Based Influence Maximization in Social Networks
Simulated Annealing Based Influence Maximization in Social Networks Qingye Jiang, Guojie Song, Cong Gao, Yu Wang, Wenjun Si, Kunqing Xie In AAAI 2011 概要 influence maximizationに対する初の焼きなましベースアルゴリズム influence spreadを高速に近似計算アルゴリズム SA based 適当にseed setを変更するだけ SAEDV (Expected Diffusion Value) Aによりactivateされるノード数の期待値は $$ |A| + \sum_{v \in N^{o...
Analyzing Spammer's Social Networks for Fun and Profit
Analyzing Spammer s Social Networks for Fun and Profit -- A Case Study of Cyber Criminal Ecosystem on Twitter Chao Yang, Robert Harkreader, Jialong Zhang, Seungwon Shin, Guofei Gu Texas A M Universityの人々 In WWW 2012 参考 http //www.slideshare.net/KuoE0/www2012-analyzing-spammers-social-networks-for-fun-and-profit 概要 Twitterのスパムに関するcase study スパム同士は結合...
The Pursuit of a Good Possible World: Extracting Representative Instances of ...
The Pursuit of a Good Possible World Extracting Representative Instances of Uncertain Graphs Panos Parchas, Francesco Gullo, Dimitris Papadias Francesco Bonchi SIGMOD 2014 概要 Uncertain graphsを扱うのは大変サンプリングは標本数が多くなる問題：最短経路長，パターンマイニング，部分グラフ探索問題毎にアドホックに開発されている→既存の枠組みが無駄→辛いあるグラフで代表させたい元の性質を保ったまま，決定的な代表的グラフを作るよ元の性質=（今回は）期待頂点次数平均次数リワイヤ(AD...
Fast Approximation Algorithms for the Diameter and Radius of Sparse Graphs
Fast Approximation Algorithms for the Diameter and Radius of Sparse Graphs Liam Roditty, Virginia Vassilevska Williams In STOC 2013 メモ Y.Yano 直径2近似O(n+m) BFSして最大の高さ additive approximation Aingworth 2つの組み合わせ 2種類のBFS木の高さの最大値 $$ s \in [1,n] $$ O(ns^2 + (s+n/s)m) 論文の内容 ns^2の項を消したい ↑高々s頂点のBFS N_s^out(v)を求めないで頑...
From Dango to Japanese Cakes: Query Reformulation Models and Patterns
From "Dango" to "Japanese Cakes" Query Reformulation Models and Patterns Paolo Boldi, Francesco Bonchi, Carlos Castillo, Sebastiano Vigna 概要 Reformulation model QRT(query reformulation type)の分類学習結果は精度92% Reformulation strategies QRTの列からミッションを探してパターンを見つける手動（小さいデータ）と一致するよ！ Query Flow GraphをQRTでアノテートレコメンドをQFG上のランダムウォークでやる ...
The Importance of Communities for Learning to Influence
The Importance of Communities for Learning to Influence Eric Balkanski, Nicole Immorlica, Yaron Singer NIPS 2017 概要 The Power of Optimization from Samplesはcurvature制限時だった今回は影響最大化をコミュニティ構造が明らかな場合、OPSの枠組みでなんとかするよコミュニティの大きさを捉えられそうなアルゴリズムを提案 SBMの簡単な設定で近似比を証明アルゴリズム COmmunity Pruning from Samples 設定無向なので、基本的に連結成分の大きさ（の和）の期待値が影響力になる underlyingな...
Towards Context-Aware Search by Learning A Very Large Variable Length Hidden ...
Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs Huanhuan Cao, Daxin Jiang, Jian Pei, Enhong Chen, Hang Li MSRAとUniversity of Science and Technology of China WWW 2009 概要たった今調べたクエリからURLを正しくレコメンドするのは無理例ホントは車のレビューサイトを見たい検索クエリ Ford new cars → Toyota new cars 個々のクエリに着目するとautohome.comは出てこない ...
Top-K Nearest Keyword Search on Large Graphs
Top-K Nearest Keyword Search on Large Graphs Miao Qiao, Lu Qin, Hong Cheng, Jeffrey Xu Yu, Wentao Tian In VLDB 2013 メモ Jeffrey Xu Yu!! 概要 top-k nearest keyword search 頂点 0個以上のキーワード辺長さクエリノード、キーワード、k 出力キーワードを含みノードに近いkノード top-k nearest keyword search に対するアルゴリズム最短経路木 2つ提案 kが小さいよう大きくてもOK 応用 Facebo...
4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online ...
4chan and /b/ An Analysis of Anonymity and Ephemerality in a Large Online Community Michael S. Bernstein, Andrés Monroy-hernández, Drew Harry, Paul André, Katrina Panovich, Greg Vargas ICWSM 2011 概要 anonymity（匿名性）とephemerality（短命であること）の面から4chanを理解する 4chanと/b/ /b/はランダム掲示板 4chan全体の30%のトラフィックリプライしたりすると最初のページに浮上 15ページ目の最下層に行くと消されて`Page Not Found ...
Minimizing the Spread of Contamination by Blocking Links in a Network
Minimizing the Spread of Contamination by Blocking Links in a Network Masahiro Kimura, Kazumi Saito, Hiroshi Motoda AAAI 2008 概要タイトルまんまの論文汚染最小化問題目的関数は各頂点のσの平均既存の研究（Kempe依然）では，出次数の大きい順に頂点を消していけば大体良いとしてた今回は辺を消す問題定義 min 1/|V|Σσ(v) Eからk辺取り除ける提案手法最も目的関数が小さい辺を選ぶ貪欲アルゴリズム σの計算がやばいこの著者らが考えたBond Percolationで高速化実験 ...
Information Diffusion and External Influence in Networks
Information Diffusion and External Influence in Networks Seth Myers, Chenguang Zhu, Jure Leskovec In KDD 2012 メモ http //cs.stanford.edu/people/jure/pubs/ext-kdd12.pdf アブスト influence の始まりってどこ？ってのを考える influence の伝わり方は2パターンあるノード-ノードを伝ってネットワーク外からこれに注目新しいモデルフィッティングしやすいパラメータ Twitterに適用完全なひと月のデータ情報がネットワークをジャンプしている ...
Maximizing the Extent of Spread in a Dynamic Network
Maximizing the Extent of Spread in a Dynamic Network Habiba, Tanya Y. Berger-Wolf ? 2007 概要動的ネットワークでinfluence maximizationやるお！問題定義 Dynamic Network G_t = (V_t, E_t)の列特定の時点のものだから，辺は消えたりする Independent Cascade Modelの拡張 uが時刻tでactiveだったら(u_t, v_t)なv_tを伝播確率でactivate 成功したらvは時刻t+1でactiveになる一旦activeになったらずっとactive 他の時刻で辺があったらまた試行できる ...
In Search of Influential Event Organizers in Online Social Networks
In Search of Influential Event Organizers in Online Social Networks Kaiyu Feng, Gao Cong, Sourav S. Bhowmick, Shuai Ma SIGMOD 2014 概要影響最大化 + 集合被覆のような問題タイトルの通りイベント主催者を見つけるのが動機づけ貪欲アルゴリズムと近似比2のアルゴリズムを提案イントロ Plancast，Meetupというサービスが出てきているイベント主催者も影響力が有ったほうがいいねでも，分野横断とかだと色々な内容をカバーしてないと駄目だね this paper 問題定義グラフ G = (V, E, w, A) ...
Spheres of Influence for More Effective Viral Marketing
Spheres of Influence for More Effective Viral Marketing Yasir Mehmood, Francesco Bonchi, David Garcia-Soriano SIGMOD 2016 概要確率的な挙動の典型的なカスケードが欲しい期待Jaccardを最小化する頂点集合を計算する問題ありうるカスケード皆に程々に近い、安定性の指標でもある O(1)サンプルで定数倍近似貪欲に勝った！動機づけ「少数のスーパースター」よりも「多数の凡人」の方が信頼できる個々の影響力は小さいけれど、クリティカルマス到達最確カスケードは良くないカスケードは2^n種類あるので、最大の確率はかなり小さく、代表的とは...
Subgraph Frequencies: Mapping the Empirical and Extremal Geography of Large ...
Subgraph Frequencies Mapping the Empirical and Extremal Geography of Large Graph Collections Johan Ugander, Lars Backstrom, Jon Kleinberg In WWW 2013 メモ岩田さん概要 social networkを全体を見て解析するのではなく、密な部分ごとに見る 3-4頂点の部分グラフの出現頻度を見ると特徴がわかる実験データ Facebook Lars BackstromがFacebookの人だからしょーがないね(´・ω・`) 抽出する部分グラフ Neighborhoods 友人 Groups...
Real-Time Influence Maximization on Dynamic Social Streams
Real-Time Influence Maximization on Dynamic Social Streams Yanhao Wang, Qi Fan, Yuchen Li, Kian-Lee Tan VLDB 2017 概要クエリ Stream Influence Maximization sliding windowモデルで考える Influential Checkpoints 途中途中で結果をとっておいて ε-近似 Sparse Influential Checkpoints チェックポイントの数が多すぎるので、対数個くらいにまで減らす (log N)/β個で、ε(1-β)/2-近似問題定式化行動 $$ a_t = \langle...
The Effect of the Back Button in a Random Walk: Application for PageRank
The Effect of the Back Button in a Random Walk Application for PageRank Fabien Mathieu, Mohamed Bouklit WWW 2004 概要バックボタンを入れたPageRankを考える計算も早そう Back Button Model Reversible Back バックボタンの効果 1つ前の「状態」に戻る 2回バックすると，今いるページに戻る記憶領域は前の状態一つだけどれかの辺をたどる確率とバックボタンの確率は同じ t+1ステップ目でwからvに遷移する確率 Pr[今wにいてwvを辿る]+Pr[vからwに移動した状態でバック] (wv∈E) Pr...
On k-Path Covers and their Applications
On k-Path Covers and their Applications Stefan Funke, André Nusser, Sabine Storandt VLDB 2014 best paper award スライド見てね概要 Minimum k-All Path Cover (k-APC) 入力有向グラフ G=(V,E) 整数k 出力 C⊆V であって，任意の長さkの単純経路πについて，C∩π≠∅ minimize |C| Minimum k-Shortest-Path Cover (k-SPC) [Tao, Sheng, Pei. SIGMOD 11] 入力有向グラフ G=(V,E,c) ...
Finding Influential Nodes in a Social Network from Information Diffusion Data
Finding Influential Nodes in a Social Network from Information Diffusion Data Masahiro Kimura, Kazumi Saito, Ryohei Nakano, Hiroshi Motoda SBP 2009 Social Computing and Behavioral Modeling 概要ノードの影響力をカスケード情報からランキングしたい ICモデルで確率を見積もるよ！ただし，確率の値は一様実際のネットワークで実験してみるヒューリスティクスより精度良い手法 Prediction of Information Diffusion Probabilities ...
Influence of the Dynamic Social Network Timeframe Type and Size on the Group ...
Influence of the Dynamic Social Network Timeframe Type and Size on the Group Evolution Discovery Stanisław Saganowski, Piotr Bródka, Przemysław Kazienko ASONAM 2012 概要 GED (Group Evolution Discovery) 法のパラメータチューニングの解析グループ発展時間発展でコミュニティは変化していくが，それを下記に分類 Continuing(停滞) サイズに変化なし．頂点がちょっと変わるくらいならOK Shrinking サイズが小さくなる Growing サイズが大きくなる...
Structure-Preserving Sparsification of Social Networks
Structure-Preserving Sparsification of Social Networks Gerd Lindner, Christian L. Staudt, Michael Hamann, Henning Meyerhenke, Dorothea Wagner ASONAM 2015 概要疎化したグラフの性質を調べたよ "Local Degree"なる単純な手法で実は充分良いよ辺数が元の20%でも、大体保存される比較手法 Random Edge (RE) 一様ランダムに辺を選んでいく Triangles 属する△の個数の多い辺を順に残す Local Similarity (LS) 得点 = J(N(u...
Distance Constraint Reachability Computation in Uncertain Graphs
Distance Constraint Reachability Computation in Uncertain Graphs Ruoming Jin, Lin Liu, Bolin Ding, Haixun Wang VLDB 2011 概要経路長に制限を入れた到達可能性確率計算クエリ普通のシミュレーションは遅いので，頭のいい方法を考案最大で1M辺規模で動く問題定義と動機付け距離制約到達可能性クエリ (Distance-constraint reachability) 入力 2頂点s,t, uncertainグラフ G, 閾値d 出力 s-t間の最短経路長がd以下である確率$$ R_{s,t}^d(\mathcal{G}) $$は？ P2Pネットワークでの応用例...
Negative Influence Minimizing by Blocking Nodes in Social Networks
Negative Influence Minimizing by Blocking Nodes in Social Networks Senzhang Wang, Xiaojian Zhao, Yan Chen, Zhoujun Li, Kai Zhang, Jiali Xia AAAI Workshop 2013 概要ノードを取り除いて影響の拡散を抑えたい！ウイルス，誤情報等 Negative Influence Minimization 感染シード(given) I ブロック S(|S|=k) 目標 minimize σ(I; V-S) Sは感染しない手法 σが小さくなるノードを貪欲に選んでいく実験比較...
FAST-PPR: Scaling Personalized PageRank Estimation for Large Graphs
FAST-PPR Scaling Personalized PageRank Estimation for Large Graphs Peter Lofgren, Siddhartha Banerjee, Ashish Goel, C. Seshadhri KDD 2014 概要 $$ \pi_s(t) \delta $$を$$ \mathcal{O}(\sqrt{d/\delta}) $$時間で近似計算相対誤差が小さい既存は$$ \Omega(1/\delta) $$時間で、$$ \delta=\Theta(1/n) $$にしたいので遅い□ 計算時間の下限$$ \Omega(1/\sqrt{\delta}) $$を示した実験的には数十倍速い提案手法設定 ...
Learning Stochastic Models of Information Flow
Learning Stochastic Models of Information Flow Luke Dickens, Ian Molloy, Jorge Lobo, Pau-Chen Cheng, Alessandra Russo ICDE 2012 概要 ICモデルの確率予測 Metropolis-Hastingsアルゴリズム attributed 影響の親が分かる unattributed 親が分からん両方について実験 Attributedの場合シード集合，活性頂点集合，拡散の履歴が分かる βICモデル各辺の確率ベータ分布(α_e,β_e)に従う平均α/(α+β) 拡散の履歴から各α，βをインクリメントするだ...
Revenue Maximization in Incentivized Social Advertising
Revenue Maximization in Incentivized Social Advertising Çigdem Aslay, Francesco Bonchi, Laks V. S. Lakshmanan, Wei Lu VLDB 2017 概要 Viral Marketing Meets Social Advertising Ad Allocation with Minimum Regret VLDB 15の続き感 Incentivized ユーザにも収入の分け前を上げる設定単調劣モジュラ最大化 with 分割マトロイド制約 (for 広告-シード配置) and 劣モジュラナップサック制約 (for 各広告主の予算) 2つの近似手法を提案して、RISで高速化定式化 ...
Community-based Greedy Algorithm for Mining Top-K Influential Nodes in ...
Community-based Greedy Algorithm for Mining Top-K Influential Nodes in Mobile Social Networks Yu Wang, Gao Cong, Guojie Song, Kunqing Xie 焼きなましベースの人々と大体同じ KDD 2010 概要 NewGreedyIC（MixedGreedy）がstate-of-the-artだったころの話どうしても時間がかかっちゃうので、コミュニティに分割することにした Community-based Greedy algorithm ちょっとおもしろい点コミュニティ分割がICモデルのシミュレートで行われる ↑の後はDPする予備知識みたいな...
k-Nearest Neighbors in Uncertain Graphs
k-Nearest Neighbors in Uncertain Graphs Michalis Potamias, Francesco Bonchi, Aristides Gionis, George Kollios VLDB 2010 概要 k近傍クエリ最短距離や酔歩に基づく尺度を拡張厳密計算は難しい Monte-Carloサンプリングで近似アルゴリズムグラフ変形，枝刈り実験：数十M辺メモ VLDB版ではない原稿を読んだのでちょっと違うイントロ確率的グラフ辺に確率が割り振られているセンサや実験による雑音不安定な通信リンク予測秘匿性のための摂動気になること...
Scalable and Parallelizable Processing of Influence Maximization for ...
Scalable and Parallelizable Processing of Influence Maximization for Large-Scale Social Networks Jinha Kim, Seung-Keol Kim, Hwanjo Yu 浦項（ぽはん）工科大学校 In ICDE 2013 概要並列化可能なアルゴリズム競技相手はPMIA 質はCELF並、PMIAより良い速度はPMIAより速い提案手法 Independent Path Algorithm(IPA) 経路を指定したらそれを使う確率は全部かければ良いグラフがもらえた有るノードからtraverseして木っぽくパスを広げる（同じ頂点がいくつかある...
New Models for Competitive Contagion
New Models for Competitive Contagion Moez Draief, Hoda Heidari, Michael Kearns AAAI 2014 概要競合モデル2つ Connectivity と Endogenous 過去にはGoyal and Kearnsモデルがある影響度だけでは不十分なので連結度を考える Price of Anarchy と Budget Multiplierが大事な要素（らしい）これに結果を与えた Switching-selection framework Goyal and Kearnsのモデル Connectivity 活性頂点内の辺の数も考慮する E...
The Power of Random Neighbors in Social Networks
The Power of Random Neighbors in Social Networks Silvio Lattanzi, Yaron Singer WSDM 2015 概要友達は友達が多い (friendship paradox) power-lawが十分条件では無いどういうモデルがparadoxを説明できるのか？というわけで，モデルを提案辺を張り替える影響最大化への適用例友達の逆説 friendship paradox 友達は友達が多い（頂点の近傍次数）（その頂点の近傍の平均次数） power-lawが十分条件では無い実際にそういう例を作れる極端な例正則グラフ ☆ ...
The Role of Social Networks in Information Diffusion
The Role of Social Networks in Information Diffusion Eytan Bakshy, Itamar Rosenn, Cameron Marlow, Lada Adamic In WWW 2012 メモ Y.Yoshidaさん新しい情報は「疎遠」な友人から not 親しい友人仕事とか思ったよりは多いねというくらい親しい友人の方が多いなんで？強い情報がすぐ拡散公募はすぐ埋まる得る情報 Feed 友人のURLをシェア外界メールとか Feedを遮断 |V|=253M Feed有：6...
@wiki全体から「k-means++: The Advantages of Careful Seeding」で調べる

更新順にページ一覧表示 | 作成順にページ一覧表示 | ページ名順にページ一覧表示 | wiki内検索

todo314 @ ウィキ

メニュー

更新履歴

k-means++: The Advantages of Careful Seeding

todo314 @ ウィキ内検索 / 「k-means++: The Advantages of Careful Seeding」で検索した結果