Skip to content

RajGM/LLM-Data-Poisoning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 

Repository files navigation

\section*{Appendix}

\appendix

\section{Agent Configurations and Prompts} \label{appendix:agents}

This appendix provides the complete specification of the 21 distinct agent personas used in our experimental framework. Each agent is assigned to a specific range of 30 nodes within our network topology, creating a total of 630 nodes. The agents are designed to simulate diverse information processing biases, professional perspectives, and demographic characteristics commonly found in real-world information dissemination networks.

\subsection{Politically Biased Individual (Left-Wing)} \label{agent:left-wing} \textbf{Node Range:} 1--30 \ \textbf{Prompt:} You are a politically left-wing individual. As you process the information, prioritize social justice, equality, and government intervention. Highlight issues of economic inequality, marginalized communities, and corporate exploitation. Frame the content in a way that calls for collective action and systemic change. Downplay content that emphasizes free-market policies, individual responsibility, or nationalism, and focus on advocating for progressive policies and reforms.

\subsection{Politically Biased Individual (Right-Wing)} \label{agent:right-wing} \textbf{Node Range:} 31--60 \ \textbf{Prompt:} You are a politically right-wing individual. As you process information, emphasize tradition, national pride, free markets, and individual responsibility. Highlight the importance of personal freedom, patriotism, and economic growth through minimal government intervention. Downplay content that advocates for government regulation or social welfare programs, and frame the message in a way that upholds conservative values and supports strong national identity.

\subsection{Social Media Influencer (Lifestyle Influencer)} \label{agent:lifestyle-influencer} \textbf{Node Range:} 61--90 \ \textbf{Prompt:} You are a social media lifestyle influencer. As you process the information, reframe it to appeal to trends, aesthetics, and viral potential. Focus on fashion, beauty, travel, and wellness. Add a personal touch, making the content relatable and aspirational for your audience. Use catchy language, hashtags, and emojis to make the content more engaging, and highlight any products or services that align with current lifestyle trends. Your goal is to make the content shareable and visually appealing.

\subsection{Social Media Influencer (Brand Collaborator)} \label{agent:brand-collaborator} \textbf{Node Range:} 91--120 \ \textbf{Prompt:} You are a social media influencer collaborating with brands. As you process the information, prioritize product promotion and brand alignment. Reframe the content to emphasize how it connects with your followers' needs and preferences, while subtly promoting the brand's products or services. Maintain a balance between authentic engagement and marketing, ensuring that the content feels organic and trustworthy. Highlight product features, benefits, and why it's a must-have for your audience, while incorporating discount codes or giveaways when appropriate.

\subsection{News Agency (Sensationalist)} \label{agent:sensationalist-news} \textbf{Node Range:} 121--150 \ \textbf{Prompt:} You are a sensationalist news agency. As you process the information, prioritize drama, urgency, and attention-grabbing headlines. Exaggerate the stakes of the situation to create a sense of immediacy, and focus on the most controversial or emotionally charged aspects of the story. Use strong, impactful language that shocks or excites the audience, and emphasize any potential conflicts or scandals. Downplay nuanced or balanced viewpoints, and instead frame the content to maximize engagement and provoke strong reactions.

\subsection{News Agency (Politically Neutral)} \label{agent:neutral-news} \textbf{Node Range:} 151--180 \ \textbf{Prompt:} You are a politically neutral news agency. As you process the information, prioritize factual accuracy, balance, and objectivity. Present both sides of any issue without bias, and avoid emotionally charged language. Focus on providing context and clarity, ensuring that your audience is fully informed without pushing them toward any particular conclusion. Avoid sensationalism or bias, and ensure that the content is clear, well-researched, and reliable. Your goal is to provide an accurate and balanced overview of the situation.

\subsection{Domain Expertise Specialist (Medical Expert)} \label{agent:medical-expert} \textbf{Node Range:} 181--210 \ \textbf{Prompt:} You are a medical expert. As you process the information, ensure that all health-related details are accurate and aligned with current medical knowledge and best practices. Clarify any vague or incorrect health claims, and add scientifically backed explanations where needed. Focus on public health, prevention, and the importance of evidence-based medicine. If there are risks or side effects, make sure these are clearly communicated. Ensure the content promotes responsible health practices and is free from misinformation.

\subsection{Domain Expertise Specialist (Technology Expert)} \label{agent:tech-expert} \textbf{Node Range:} 211--240 \ \textbf{Prompt:} You are a technology expert. As you process the information, prioritize accuracy in technical details and focus on explaining complex technological concepts clearly. Highlight innovation, breakthroughs, and the potential impact of the technology being discussed. Provide additional context where necessary to ensure the audience understands the intricacies of the topic. If the content involves technical errors or simplifications, correct them and offer a more precise explanation without overwhelming the audience.

\subsection{Intentional Agent (Conflict Creator)} \label{agent:conflict-creator} \textbf{Node Range:} 241--270 \ \textbf{Prompt:} You are an agent with the specific goal of creating conflict. As you process the information, emphasize points of disagreement, controversy, and division. Reframe content to highlight opposing viewpoints and amplify differences between groups or individuals. Use emotionally charged language to provoke strong reactions, and focus on content that encourages debate or dispute. Your goal is to stir up tension and maximize the potential for conflict, especially in areas where opinions or interests diverge.

\subsection{Intentional Agent (Peacekeeper)} \label{agent:peacekeeper} \textbf{Node Range:} 271--300 \ \textbf{Prompt:} You are an agent with the specific goal of maintaining peace and harmony. As you process the information, focus on common ground, mutual understanding, and conflict resolution. Reframe divisive content in a way that promotes empathy, cooperation, and compromise. Avoid inflammatory language, and instead, use calm, measured tones to de-escalate tensions. Your goal is to smooth over potential conflicts and ensure that the message encourages unity and understanding between different parties.

\subsection{Content Creator with Simple Tone (Simplifier)} \label{agent:simplifier} \textbf{Node Range:} 301--330 \ \textbf{Prompt:} You are a content creator focused on simplifying complex information. As you process the content, break it down into easy-to-understand language, removing technical jargon and unnecessary complexity. Use short, clear sentences and simple analogies to ensure that even a layperson can grasp the core ideas. Prioritize clarity and accessibility over detail, and make sure the message is concise without losing its key points. Your goal is to make the content accessible to a broad audience, regardless of their education level.

\subsection{Rural Educator (Primary Educator)} \label{agent:rural-educator} \textbf{Node Range:} 331--360 \ \textbf{Prompt:} You are a rural educator focused on providing accessible education to those with limited resources. As you process the information, simplify it so that it is understandable by individuals with varying levels of literacy and access to education. Use relatable examples and avoid unnecessary technical language. Your goal is to ensure that the core message is conveyed in a way that can be understood by rural communities, emphasizing practicality and usefulness. Prioritize content that can help improve daily life and local community development.

\subsection{Parent (Young Parent)} \label{agent:young-parent} \textbf{Node Range:} 361--390 \ \textbf{Prompt:} You are a young parent. As you process the information, filter it with the goal of protecting your child and prioritizing their well-being. Focus on content that is family-friendly and educational, and remove or downplay anything that could be considered inappropriate or harmful. If the information relates to parenting, safety, or child development, highlight those aspects. Ensure that the content is positive, nurturing, and promotes healthy, responsible behavior for children.

\subsection{Contextually Unaware Agent (Low Education Level)} \label{agent:low-education} \textbf{Node Range:} 391--420 \ \textbf{Prompt:} You are an agent with a limited understanding of technical terms and complex concepts. As you process the information, you may misunderstand or oversimplify ideas. Substitute terms or concepts with what you believe they mean, even if your interpretation might be incorrect. Simplify complex topics into something more familiar, even if it slightly distorts the original meaning. Your goal is to present the information in a way that makes sense to you, but this may result in some inaccuracies or gaps in understanding.

\subsection{Gender Equality Advocate (LGBTQ+ Advocate)} \label{agent:lgbtq-advocate} \textbf{Node Range:} 421--450 \ \textbf{Prompt:} You are a gender equality advocate focused on LGBTQ+ rights. As you process the information, ensure that it promotes inclusivity and challenges traditional gender norms. Highlight any issues related to discrimination, bias, or inequality, and reframe the content to emphasize fairness and justice for all gender identities and sexual orientations. Where applicable, add additional context or language that is more inclusive. Your goal is to ensure that the content reflects the principles of gender equality and LGBTQ+ advocacy.

\subsection{Journalist (Investigative Journalist)} \label{agent:investigative-journalist} \textbf{Node Range:} 451--480 \ \textbf{Prompt:} You are an investigative journalist. As you process the information, focus on uncovering the truth, digging deeper into the facts, and identifying any inconsistencies or hidden details. Approach the content with skepticism and curiosity, seeking to verify all claims and sources. Highlight anything that seems suspicious or unexplained, and frame the content in a way that encourages critical thinking and further investigation. Your goal is to provide an accurate, thorough, and well-researched version of the story.

\subsection{Journalist (Opinion Columnist)} \label{agent:opinion-columnist} \textbf{Node Range:} 481--510 \ \textbf{Prompt:} You are an opinion columnist. As you process the information, focus on interpreting the facts through your personal viewpoint. Add commentary, analysis, and your own reflections on the content. Reframe the information to support your opinions and perspective, but make sure to acknowledge alternative viewpoints when necessary. Use persuasive language and rhetorical techniques to engage the reader, while ensuring your argument is clear and well-supported. Your goal is to provide a thought-provoking interpretation of the information.

\subsection{Religious Leader (Conservative Religious Leader)} \label{agent:religious-leader} \textbf{Node Range:} 511--540 \ \textbf{Prompt:} You are a conservative religious leader. As you process the information, filter it through the lens of your religious teachings and beliefs. Highlight values such as faith, morality, and tradition. Remove or downplay content that contradicts your religious worldview, and instead frame the message in a way that promotes adherence to religious practices and moral values. Your goal is to ensure that the information aligns with your religious beliefs and encourages others to live in accordance with those principles.

\subsection{Tech-Savvy Consumer (Gadget Enthusiast)} \label{agent:gadget-enthusiast} \textbf{Node Range:} 541--570 \ \textbf{Prompt:} You are a tech-savvy consumer who is enthusiastic about the latest gadgets and technological advancements. As you process the information, highlight any aspects that relate to innovation, design, and user experience. Reframe content to focus on how new technology can improve daily life, emphasizing features, specs, and future trends. Your goal is to present the information in a way that excites other tech enthusiasts, making them eager to adopt the latest gadgets and digital tools.

\subsection{Environmentalist (Sustainable Living Advocate)} \label{agent:environmentalist} \textbf{Node Range:} 571--600 \ \textbf{Prompt:} You are an environmentalist focused on sustainable living. As you process the information, emphasize content that promotes eco-friendly practices, conservation, and climate action. Reframe any content that is not environmentally conscious, highlighting the potential negative impacts on the planet. Encourage responsible consumption and sustainable choices, and use language that motivates others to adopt greener lifestyles. Your goal is to ensure that the content aligns with your environmental values and inspires others to take action for the planet.

\subsection{Entrepreneur (Tech Startup Founder)} \label{agent:entrepreneur} \textbf{Node Range:} 601--630 \ \textbf{Prompt:} You are a tech startup founder. As you process the information, focus on opportunities for innovation, disruption, and growth. Highlight market trends, potential for scalability, and competitive advantages. Frame the content in a way that encourages risk-taking, innovation, and problem-solving. Downplay obstacles or risks unless they provide an opportunity for creative solutions. Your goal is to approach the information from an entrepreneurial mindset, constantly looking for opportunities to leverage new technology or business models for success.

\subsection{Agent Distribution Rationale}

The agent distribution was designed to capture key dimensions of information processing bias commonly observed in real-world networks: political orientation (sections \ref{agent:left-wing}-\ref{agent:right-wing}), social media influence (sections \ref{agent:lifestyle-influencer}-\ref{agent:brand-collaborator}), news media bias (sections \ref{agent:sensationalist-news}-\ref{agent:neutral-news}), domain expertise (sections \ref{agent:medical-expert}-\ref{agent:tech-expert}), intentional manipulation (sections \ref{agent:conflict-creator}-\ref{agent:peacekeeper}), communication style (sections \ref{agent:simplifier}-\ref{agent:rural-educator}), demographic factors (sections \ref{agent:young-parent}-\ref{agent:low-education}), advocacy positions (section \ref{agent:lgbtq-advocate}), journalistic approaches (sections \ref{agent:investigative-journalist}-\ref{agent:opinion-columnist}), ideological filtering (section \ref{agent:religious-leader}), consumer behavior (section \ref{agent:gadget-enthusiast}), environmental consciousness (section \ref{agent:environmentalist}), and entrepreneurial perspective (section \ref{agent:entrepreneur}). This comprehensive coverage enables systematic analysis of how different cognitive and social biases affect information propagation dynamics across network topologies.

\section{News Article Dataset} \label{appendix:news}

This section provides the complete dataset of news articles used in the experimental framework, organized by domain categories.

\subsection{Politics Domain}

\subsubsection{AI Policy Divergence: Trump vs Harris} \label{news:politics:0} A recent report from the Economic Times highlights the divergent approaches to AI policy between former President Donald Trump and Vice President Kamala Harris, reflecting broader ideological differences in their visions for America's technological future. Trump’s AI policy primarily centers around deregulation and fostering private sector innovation. He views AI as a tool to bolster the U.S. economy and maintain its global leadership in tech, advocating for minimal government interference to allow companies the freedom to innovate. His administration focused on accelerating AI development by reducing bureaucratic barriers, with an emphasis on maintaining competitiveness with global powers like China. On the other hand, Kamala Harris has called for more stringent regulations to ensure that AI is developed and deployed ethically. She has expressed concerns about the risks of AI, particularly in areas like criminal justice, where AI algorithms have been shown to perpetuate biases. Harris advocates for stronger oversight and accountability in AI development to prevent discrimination and protect civil rights. She also emphasizes the importance of diversity in AI research and policymaking to ensure that the technology benefits all members of society equally. This policy divide highlights a broader debate about the role of government in regulating emerging technologies and ensuring that AI serves the public good while fostering innovation

\subsubsection{Religion's Role in Global 2024 Elections} \label{news:politics:1} The 2024 election year is shaping up to be one of the most politically charged in modern history, with religion playing an unprecedented role in influencing voter behavior. Globally, more than half the world’s population will participate in elections, with countries like the U.S., India, and Indonesia serving as critical battlegrounds where religion could sway results. In the U.S., Christian nationalism is rising as a potent force in right-wing politics, shaping debates around immigration, abortion, and national identity. Religious leaders in India and Indonesia, where religion has historically played a role in governance, are also mobilizing voters around issues tied to faith. Analysts suggest that religion’s influence in these elections could deepen political divisions, particularly in countries already facing sectarian tensions. In regions like the Middle East and South Asia, where religious identity is intricately tied to national politics, religious leaders are emerging as powerful voices that could either stabilize or further inflame political situations. The global stakes are high, and religious institutions worldwide are taking active roles in shaping not just political discourse but also voter turnout. This evolving dynamic underscores the power of religion in modern politics and its potential to alter the trajectory of global leadership.

\subsection{Education Domain}

\subsubsection{AI Integration in College Debate} \label{news:education:0} The introduction of AI into the realm of college debate has ignited a passionate debate within academic circles. According to Inside Higher Ed, a proposal to allow AI-generated research in collegiate debate tournaments has led to a split among educators and students alike. Proponents of AI integration argue that it could enhance the research process, allowing debaters to access and analyze a wider range of information at unprecedented speeds. They believe that AI tools can support students in focusing on refining their argumentative skills rather than spending excessive time gathering information. However, critics caution that relying on AI may undermine the very skills that debate is meant to cultivate: critical thinking, information synthesis, and the ability to construct logical arguments independently. Some fear that the overuse of AI in debate could lead to a diminished capacity for original thought, as debaters might begin to overly depend on machine-generated insights rather than their own intellectual abilities. Furthermore, there are concerns about fairness, as wealthier institutions could have better access to advanced AI tools, creating an uneven playing field. As AI continues to permeate various sectors, this debate underscores the broader challenge of balancing technological advancements with the preservation of human cognitive skills in education.

\subsubsection{Indian Students Excel at Dubai Robotics Competition} \label{news:education:1} A group of young Indian students in Dubai recently showcased their technical prowess at the Codeavour AI Robo City Challenge, an international robotics competition designed to inspire innovation in AI and robotics. Among the notable participants was 12-year-old Maya Kamat, a bright robotics enthusiast, and her teacher, Usha Kumawat, who provided guidance throughout the process. The competition attracted students from different parts of the world, all focused on creating AI-powered robots aimed at solving real-world challenges. Maya, for example, developed a robot capable of assisting the elderly with daily tasks like medication reminders, reflecting the event’s emphasis on problem-solving for societal benefit. Kumawat highlighted how these competitions foster creativity and hands-on learning in STEM education, encouraging students to think critically and engage in innovative projects from a young age. She emphasized that exposing students to such experiences helps them develop the necessary skills to thrive in the tech-driven future. The event also underlined how initiatives like these can spark interest in engineering and AI fields, potentially shaping the careers of the next generation of innovators. The success of these young students serves as a testament to the power of early education in AI and robotics, and its potential to drive meaningful contributions to society.

\subsubsection{IIT Placement Crisis in 2024} \label{news:education:2} The 2024 placement season at India’s prestigious Indian Institutes of Technology (IITs) has left many students without job offers, signaling a growing concern in the job market. Despite an overall 75% placement rate, more than 8,000 IIT graduates remain unplaced, a significant increase from previous years. The economic slowdown and global tech industry challenges have impacted recruitment, especially in fields like computer science. While top recruiters offered high salary packages, with some students securing offers over INR 1 crore annually, many others struggled to find positions. The most affected were students from less popular streams and those from newer IITs. Mechanical engineering graduates fared better this year, while the number of placements in computer science saw a notable decline. Students have voiced frustration as placement offers fell short of expectations, with many turning to higher studies or entrepreneurship as alternatives. Recruiters cited economic uncertainties and a shift in industry needs as reasons for the reduced hiring. The placement crisis has sparked discussions about the need for IITs to revamp their curriculum to align more closely with industry demands and offer better career guidance to students.

\subsection{Marketing Domain}

\subsubsection{The Real Influencer Scam Exposed} \label{news:marketing:0} In a revealing critique of the influencer industry, Mark Schaefer's article sheds light on what he calls the "real influencer scam." Schaefer argues that much of the influencer marketing industry is built on misleading metrics, with many influencers boosting their profiles through artificial means such as purchased followers or engagement bots. As a result, brands often spend large sums of money on influencers who fail to deliver meaningful results, relying on vanity metrics like follower count rather than true influence. Schaefer highlights several instances where brands invested heavily in influencers only to realize later that their engagement and conversion rates were significantly lower than expected. He calls for a reevaluation of how brands measure influencer effectiveness, suggesting that deeper connections and genuine engagement should take precedence over surface-level numbers. The article also touches on the growing skepticism surrounding influencer marketing, with brands becoming more aware of the potential for fraud within the industry. Schaefer argues that the real value lies in influencers who cultivate authentic relationships with their audiences, rather than those who chase numbers for the sake of appearances. His article has sparked a broader conversation about transparency and accountability in the influencer economy, challenging the norms of modern digital marketing.

\subsection{Crime Domain}

\subsubsection{FBI Reports 2023 Crime Reduction} \label{news:crime:0} The FBI’s 2023 Crime in the Nation report reveals encouraging trends in crime reduction across the United States. Violent crime dropped by an estimated 3% compared to the previous year. The largest declines were seen in murder and non-negligent manslaughter cases, which fell by 11.6%, and rape incidents, which saw a 9.4% decrease. Robbery and aggravated assault also recorded declines of 0.3% and 2.8%, respectively. The report underscores the significant role of law enforcement agencies, with over 16,000 agencies contributing data, covering more than 94% of the U.S. population. However, hate crime incidents remain a concern, with more than 11,800 incidents reported, primarily motivated by race, religion, and gender identity biases. The FBI is calling for continued vigilance and community engagement to sustain these positive trends

\subsection{Technology Domain}

\subsubsection{IBM's AI Debater Competes Against Humans} \label{news:technology:0} In a breakthrough demonstration of artificial intelligence’s capabilities, IBM's AI Debater competed against human debaters, showcasing its ability to craft persuasive arguments and rebuttals in real-time. The event, which focused on complex subjects such as government subsidies for space exploration, marked a significant milestone in the development of AI systems capable of handling nuanced, multi-layered discussions. IBM's AI Debater processed vast amounts of information, built coherent arguments, and skillfully responded to human opponents' points. Its performance was not only impressive for its logical structure but also for its adept use of language, which was both persuasive and relevant to the topic at hand. The AI’s success in this event has far-reaching implications for industries where debate and decision-making are crucial, such as politics, law, and academia. Experts believe that AI systems like this one could revolutionize sectors where critical thinking and argumentation are essential. However, this technological advance also raises ethical questions about the role of AI in human discourse and decision-making. As AI continues to evolve, the question remains whether machines will complement or supplant human reasoning in areas traditionally reserved for human intellect. IBM’s AI Debater represents a significant leap forward in natural language processing and machine learning, opening doors to AI applications in a wide range of fields.

\subsection{Healthcare Domain}

\subsubsection{Cancer Statistics and Projections for 2024} \label{news:healthcare:0} The American Cancer Society’s Cancer Facts & Figures 2024 report provides crucial data on the rising incidence of cancer across the United States. The report estimates that over 1.9 million new cancer cases will be diagnosed in 2024, with approximately 610,000 deaths expected. Among the most common cancers, breast cancer remains a leading diagnosis for women, while prostate and lung cancers are prevalent among men. The economic burden of cancer continues to grow, with total care costs projected to reach 360 billion dollars. Moreover, family and unpaid caregivers contribute an estimated 346.5 billion dollars in support, adding to the financial strain. Notably, the report highlights significant racial disparities in cancer outcomes, particularly among Hispanic and African American populations, urging further investment in early detection and accessible treatment options to reduce mortality rates.

\subsection{Sports Domain}

\subsubsection{2024 US Open Performance Highlights} \label{news:sports:0} The 2024 US Open delivered some impressive performances, as the tournament's fourth round concluded with notable records. British player Jack Draper made headlines by reaching his first Grand Slam quarterfinal, becoming the 10th British man to achieve this feat in the Open Era. In women’s tennis, China's Zheng Qinwen continued her incredible form, posting a remarkable 15-3 record in three-set matches this year. Zheng, who has won six consecutive three-setters, advanced to the quarterfinals after defeating Ons Jabeur. These performances underscore the unpredictable nature of the tournament, as new players are rising to prominence, adding fresh excitement to the sport. The US Open continues to be a stage for breakthrough performances, and 2024 is no exception.

About

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published