Social network dataset github. This folder contains network data for relationships between President Donald Trump and other people, which was originally compiled by John Templon, Anthony Cormier, Alex Campbell, and Jeremy Singer-Vine as part of a larger project of mapping "TrumpWorld" for BuzzFeed News. We also provide interactive visual graph mining. Short Mutually liked facebook pages. The latest and most popular social events will be disclosed and discussed on Weibo as soon as possible. The confusion matrix with SVM (linear kernel) shows that our model predicts 90 This project focuses on Social Network Analysis (SNA) using Hierarchical Clustering. [1] It characterizes networked structures in terms of nodes (individual actors, people, or things within the network) and the ties, edges, or links (relationships or interactions) that connect them. Utilizing libraries to analyse network measures like degree, centrality, diameter , descriptive statistics and linear regression modeling to predict user engagement based on followers. This comprehensive guide will explore the Social Network Ads Dataset available on GitHub, its significance, and how you can use it for analysis and modeling. csv Cannot retrieve latest commit at this time. xlsx at master · Hevenicio/Network-Data-Science-with-NetworkX-and-Python Community detection in complex networks is crucial for understanding the structure and dynamics of various systems, including social networks. Social network of LastFM users from Asia. Publicly available datasets for downstream tasks in social network analysis. - GitHub - lum A collection of multiple social media dataset samples. GitHub is where people build software. Data collected about Facebook pages (November 2017). edu, and Reddit Communities; time series datasets; and the largest public network evolution dataset with over 20,000 networks and over a million real-world graphs. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). The Structure and Dynamics of Networks, edited by Mark E. This large comprehensive collection of graphs are useful in machine learning and network science. Amazon product metadata: product info and all reviews on around 548,552 products. Contribute to socialx-analytics/dataset-sna development by creating an account on GitHub. The Stanford SNAP Logo Specifically, this dataset catalogues hyperlinks between subreddits over the course of 2. - benedekrozemberczki/datasets Google+ is a social networking service and website offered by Google. This repository contains the social networks course notes, network data sets and python programs for network analysis. Contribute to SophiaVei/Community-Detection-in-Social-Networks development by creating an account on GitHub. py The Dataset: The bottlenose dolphin is a very intelligent social creature. All datasets are in igraph format. Each entry represents a user and includes attributes like User ID, Gender, Age, Estimated Salary, and Purchased (indicating a purchase with 1 and no purchase with 0). Nodes represent the pages and edges are mutual likes among them. " Learn more This folder contains network data for character relationships within the Marvel comic book universe (beginning in 1961 and ending around 1999/2000?), which was originally compiled by Cesc Rosselló, Ricardo Alberich, and Joe Miro from Russ Chappell's Marvel Chronology Project *, a database that catalogues every appearance by every significant character in the Marvel comic book universe. The dataset consists of anonymized social network data from Facebook, where nodes represent users and edges About This project involves analyzing a social network dataset using the NetworkX library in Python. The datasets combine the raw data of various Gaoseng zhuan 高僧傳 projects with the Buddhist Person Name Authority. 6 million Twitter tweets. Jun 2, 2025 · Social Network Analysis with Facebook Dataset Overview This project analyzes the SNAP Social Circles: Facebook Dataset using Python, focusing on network analysis and visualization techniques. The Dataset contains information about users on a Social Networking site and using that information as features for our ML model, the model predicts whether a particular user after clicking on an ad on the Social networking site goes on to buy a particular product or not. The dataset you are referring to is the Facebook Social Circles Dataset, which is part of a collection of social network datasets. All data sets are easily downloaded into a standard consistent format. We explore several aspects, including social graphs, user mobility patterns and malicious account detection. txt and soc-pokec-relationships. This includes social networks, animal networks and movie networks. Kaggle-Datasets / Social_Network_Ads. Social network of Twitch users. Nodes are Twitch users and edges are mutual follower relationships between them. edu/data/github-social. The last column of the dataset is a vector of boolean This is a public dataset for network things. The SNS Data Clustering project explores a dataset of 30,000 social media users, capturing various behavioral and demographic features. Foursquare: This dataset contains check-ins in NYC and Tokyo collected for about 10 month. NetworkX is a Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks. Sina Weibo is Chinese largest public social media platform. ML20m: The ML-20M dataset is a larger movie rating dataset consisting of 20 million ratings from users on a vast collection of movies. This repository contains a series of machine learning experiments for link prediction within social networks. These datasets represent blue verified Facebook page networks of different categories. Nodes are developers who have starred at least 10 repositories and edges are mutual follower relationships between them. Load a sample dataset and start to play with the data. Nodes represent the pages and edges are mutual likes among First the pre-processing of data is done and then the prediction is done using Support Vector Machine (SVM) and kernel SVM. Follow these steps to work with the dataset: Download the dataset from Pokec Social Network Dataset provided by Jure Leskovec of Stanford University. The project includes network analysis techniques, community det Our dataset contains some information about all of our users in the social network, including their User ID, Gender, Age, and Estimated Salary. Social network analysis is the process of investigating social structures through the use of networks and graph theory. 25). , 2022) and have a GitHub account, either clone or fork and clone the repository to your computer using the usual Git Contribute to SatadruMukherjee/Dataset development by creating an account on GitHub. The dataset includes anonymized user profiles (gender, age, hobbies, education, etc. This project explores the use of Graph Neural Networks (GNNs) to enhance community detection, transforming the problem into a node classification task. Contribute to dnllvrvz/Social-Network-Dataset development by creating an account on GitHub. Prediction system to predict which user is going to buy a product displayed on a social media advertisement using random forest classification. The confusion matrix and visualization clearly shows the prediction made by both models and the difference. The goal is to preprocess the data, handle missing values, and prepare it for clustering analysis. The dataset is split into 75/25 ratio (training set = 0. [Project done on Coursera Project Network] - Network-Data-Science-with-NetworkX-and-Python/Social Network Dataset. Therefore, it is of great significance to build a real-time and full-scale Weibo public opinion dataset. This GitHub repository is intended to create social network datasets based on data pulled from Twitter and provide some useful tools for analysis. Download and Install Gephi on your computer. Here you'll find various public large-scale datasets that include online social network datasets, such as Facebook, Google+, Academia. Kumpulan dataset untuk Social Network Analysis. 2009 Research on Location-Based Social Networks (LBSNs) at the Mobile Systems and Networking Group at Fudan University Background We are interested in understanding the user behavior under the context of mobile social apps. stanford. These datasets are ideal for brand awareness, consumer sentiment analysis, and for tracking social me A repository of pretty cool datasets that I collected for network science and machine learning research. Using our organization social network crawler, we collected data from six companies on three different scales: Small (S), Medium (M), and Large (L) scale companies currently employing 500 to 2,000, 4,000 to 20,000, and more than 50,000 employees, respectively. data-science data machine-learning awesome twitter sentiment-analysis social-networks social-network dataset awesome-list datasets social-network-analysis Updated on Nov 27, 2023 Social-Networks-Ads One of the most basic data sets to learn and implement some of the most easy and basic algorithms of machine learning and visualization Social Network Ads A categorical dataset to determine whether a user purchased a particular product This project is an implementation of the paper entitled "An automata algorithm for generating trusted graphs in online social networks" which combines graph-based and artificial intelligence methodologies to develop a hybrid model for enhancing OSN coverage and accuracy Publicly available datasets for downstream tasks in social network analysis. Leveraging NLP techniques, including traditional ML and BERT models, it conducts sentiment analysis on a dataset of 1. We first implement and apply a variety of link prediction methods to each of the ego networks contained within the SNAP Facebook dataset and SNAP Twitter dataset, as well as to various random networks generated using networkx, and then calculate and compare the ROC AUC, Average Contribute to yashbaisoya/Social-Network-Dataset development by creating an account on GitHub. Please note that this is a work in progress and much of the information related to the dataset statistics and citations needs to be updated. ⠀ Signed network datasets collected for network science, deep learning, and social network analysis research. Predict connections in a social network using a random forest classifier. . The graph forms a single strongly connected component without missing attributes. Nodes are developers who have starred at least 10 repositories and edges are follower relationships between them. melaniewalsh / sample-social-network-datasets Public Notifications You must be signed in to change notification settings Fork 200 Star 126 The training data from “Influencers in Social Networks” dataset from Kaggle was used to identify key predictors of social influence in Twitter. Here, we specifically study and build our model over Facebook's social network, with the following areas of motivation: General application of friends recommendation to a particular user. Add this topic to your repo To associate your repository with the social-networking-dataset topic, visit your repo's landing page and select "manage topics. Graph for "Get On With" Dataset Graph for "Work With" Dataset Background Social network anlaysis (SNA) has found utility is institutional, classroom and analyses of networked data in socially-based educational games. Gowalla: This dataset is from a location-based social networking website where users share their locations by checking-in, and contains a total of 6,442,890 check-ins of these users over the period of Feb. Newman, Albert-László Barabási and The Data In this project, I worked with the Stanford Social Network: Reddit Hyperlink Network dataset made available through SNAP, the Stanford Network Analysis Platform. Jul 6, 2024 · We collect the publicly available dataset repository of information diffusion tasks with the available links and compare them based on six attributes affiliated to users and content: user information, social network, bot label, propagation content, propagation network, and veracity label. At present, given specified Dataset Features: In our study, we use the dataset TwiBot-20*, a comprehensive Twitter bot detection benchmark that presents one of the largest Twitter datasets to date. Social Network Analysis, by John Scott (2017). Social Network Analysis. A graph and network repository containing hundreds of real-world networks and benchmark datasets. A large social network of GitHub developers which was collected from the public API in June 2019. Community Detection on a Twitter Dataset. - shivang98/Social-Network-ads-Boost GitHub, a platform widely used for version control and collaboration, features a plethora of datasets, including the Social Network Ads Dataset. Sampson (unpublished PhD dissertation, 1968). An analysis on the github social network dataset. ) and friendships The "Social Network Ads Dataset" contains information on users' demographic details and purchase behavior. This A collection of social network datasets for teaching with tools like Gephi - melaniewalsh/sample-social-network-datasets The Dataset contains information about users on a Social Networking site and using that information as features for our ML model, the model predicts whether a particular user after clicking on an ad on the Social networking site goes on to buy a particular product or not. Just like humans, dolphins group have their own social connection with each member. 75 & test set = 0. Methods and Applications, by Stanley Wasserman and Katherine Faust (1994). This project involves analyzing the Pokec Social Network dataset using concepts from discrete mathematics. It focuses on measuring the degree of centrality in a graph at different time intervals, identifying influential nodes, and visualizing the results. The Dataset contains information about users on a Social Networking site and using that information as features for our ML model, the model predicts whether a particular user after clicking on an ad on the Social networking site goes on to buy a particular product or not. This dataset was collected by analyzing ego networks on Facebook, where an ego network is defined as a focal node (the ego) and all the nodes (friends) connected to it, along with the links (friendships) between MuMiN: A large-scale multilingual multimodal fact-checked misinformation social network dataset. We used the Reddit dataset, leveraging subreddit interactions to classify communities and detect GitHub is where people build software. If you run into any trouble or have questions consult our discussions. The machine learning tasks related to the graph are count data The edges described in the problem statement could be of any form: friendship, collaboration, following or mutual interests. https://snap. Place the files in your corresponding directory. This repository provides social network data for the study of Chinese Buddhist history. com/c/predict-who-is-more-influential-in-a-social-network/overview. Add a description, image, and links to the social-network-dataset topic page so that developers can more easily learn about it A social network analysis project on the Facebook dataset from SNAP Stanford, focusing on community detection, centrality measures, and social behaviors. The first interactive network data repository with visual analytic tools The largest network data repository with thousands of network data sets Interactive network visualization and mining Download thousands of real-world network datasets: from biological to social networks Animal Social Networks Repository. html - CocoNautty/Github-Social-Network-Analysis This project demonstrates the application of spectral clustering (a graph-based clustering method) to identify communities in social networks using the Facebook Social Circles Dataset. These datasets are ideal for brand awareness, consumer sentiment analysis, and for tracking social me About Friends Recommendation and Link Prediction in Social Netowork machine-learning facebook social-network dataset networkx recommender-system social-network-analysis network-embedding datamining link-prediction graphalgorithm networkx-graph linkprediction networkx-drawing-utilities friends-recommender Readme EECE 5645 Project: Performing community detection on Reddit Hyperlink network dataset and leverage the power of Spark and GraphFrames - kedarghule/Community-Detection-in-Social-Networks A collection of social network datasets for teaching with tools like Gephi - melaniewalsh/sample-social-network-datasets GitHub is where people build software. The dataset includes features such as age, gender, participation In this talk I will present network theory and application of building and analyzing social networks for practical use-cases in Python with NetworkX. Generating social networks with LLMs This repo contains code and results for the paper "LLMs generate structurally realistic social networks but overestimate political homophily", by Serina Chang*, Alicja Chaszczewicz*, Emma Wang, Maya Josifovska, Emma Pierson, and Jure Leskovec (ICWSM 2025). The dataset represents a social network as an undirected graph, where nodes are users and edges represent friendships. About Collection of graphs with communities and ground truth partition clustering graphs community-detection dataset classification ground-truth-partition Readme MIT license The Dataset contains information about users on a Social Networking site and using that information as features for our ML model, the model predicts whether a particular user after clicking on an ad on the Social networking site goes on to buy a particular product or not. Add a description, image, and links to the multilayer-social-network-dataset topic page so that developers can more easily learn about it multimodal social media content (text, image) classification - firojalam/multimodal_social_media This repository is related to my final year project which explores sentiment and social network analysis in the context of social media platforms. Apr 8, 2024 · This project analyzes CTU-13 dataset network traffic by creating visual graphs and calculating key graph attributes, such as degree and centrality, to explore network behavior and interactions. Accessing the dataset via GitHub is straightforward, allowing for easy integration into various data analysis workflows. It provides diversified entities and relations on the Twitter network, and has considerably better annotation quality than most existing datasets (Feng et al, 2018). 6 million users. A Google+ member can add any other member to his circles, creating a directed social graph. - yzhouli/SocialNet A Novitiate in a Period of Change: An Experimental and Case Study of Social Relationships, by Samuel F. Nov 10, 2024 · A sample dataset of over 1000 Xing social network , extracted using the Bright Data API, ideal for lead generation, CRM enrichment, investment opportunities, and talent recruitment. You can see the full documentation of NetworkX HERE GitHub Social Network - graph based dataset consisting of Nodes and Edges. Update the file paths in the source code if needed. We would like to show you a description here but the site won’t allow us. Predicting hidden links in a social network group formed by terrorists along with Performing the analysis and visualization of the centrality network on educational data sets. However, the utility of the method largely rests on being able to ascribe meaning to the structure of the network. The dataset files (soc-pokec-profiles. Each check-in is associated with its time stamp, its GPS coordinates and its semantic meaning. txt) are too large to be uploaded to GitHub. A collection of multiple social media dataset samples. The dataset contains a list of all of links, where a link represents frequent associations between Contribute to Awadelrahman/GNN4SocialNWTutorial development by creating an account on GitHub. Contribute to bansallab/asnr development by creating an account on GitHub. Each sample contains over 1,000 records. The code also visualizes the dataset. AUTHORS: Justin Kim Syed Muhammad Sabih Louis Mitchell About the dataset: A social network of Twitch users which was collected from the public API in Spring 2018. This repository contains a comprehensive analysis and graph neural network-based classification of the GitHub Social Network dataset. You can access the dataset from: https://www. The package contains a large collection of network dataset with different context. ML1m: The ML-1M dataset is a movie rating dataset that contains one million ratings from users on various movies. Get started with the Quick Start and follow the Tutorials. - yzhouli/SocialNet The Social Network Ads Dataset contains user demographics, including gender, age, and purchase behavior. It is a CLASSIFICATION PROBLEM as the output says whether the user buys th… Social network of Github developers. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects. J. Social network of Deezer users from Europe. The present repository contains the datasets relating to trust networks of two social networking websites named BitCoin and Advogato - NM001007/Social-Trust-Network-Datasets A collection of social network datasets for teaching with tools like Gephi - melaniewalsh/sample-social-network-datasets Exploring a dataset from Kaggle containing social network data. The full dataset, which you can access as a Google Sheet or on GitHub also includes information about organizations and Implementation and exploration of some algorithms related to social opinion analysis and mining Mainly includes. The Dataset contains information about users on a Social Networking site and using that information as features for our ML model, the model predicts whether a particular user after clicking on an a Social Network Analysis In this prcatice we will use NetworkX. If you use Git (Torvalds et al. Pokec is Slovakia's most popular online social network, with over 1. We used a dedicated crawler to obtain this dataset. Oklahoma: Oklahoma is a dataset composed of social networks of the University of Oklahoma. Social network analysis (SNA) is the process of investigating social structures through the use of networks and graph theory. They communicate with their group by ultrasonic, which can help them exchange information and divide their work and make decisions. The dataset used includes information about individuals' interests, names, and social platform usage. Wikipedia page network with traffic information. This repository contains sample social network datasets specifically collected and formatted for teaching with Gephi. Utilizing graph traversal algorithms and visualizations to reveal influential nodes, community clusters, and connectivity patterns within the network. kaggle. About Social Network Analysis Project based on R programming to conduct an in-depth analysis of a social network dataset. ANIMAL SOCIAL NETWORK REPOSITORY A repository of interaction data from published studies of wild, captive, and domesticated animals K-Nearest Neighbors for the Social Network Ads dataset - knn. NPTEL (National Programme on Technology Enhanced Learning) Social Networks - This is a public dataset for network things. The analysis involves feature engineering, including encoding categorical data, and visualizing the results using Contribute to VINOTH1996568/SOCIAL-NETWORK-DATASET development by creating an account on GitHub. 5 years from January 2014 through April 2017. xfa cetogw fitcvh tukpo msmhz czup esux aee kmasta ronnbh