# Introduction

En este trabajo nos embarcamos en la fascinante tarea de analizar la red de One Piece.


# Filtering

One of the main challenges in understanding the universe of *One Piece* is the sheer number of characters that appear throughout the story. Many of these characters are truly significant, with their influence and appearances spanning the entire series. However, there are many others who serve merely as spectators to the events, contributing little to the narrative or the broader network of relationships.

To achieve a better understanding of the netwok, a method was designed to filter out less important characters based on the combination of four metrics: `Out-References`, `Word Count`, `In-References`, and `Intersection References`. These metrics respectively represent:

1. The number of characters a character mentions in their wiki.
2. The number of characters in their wiki entry.
3. The number of characters that mention this character.
4. The number of characters that this character mentions, who, in turn, also mention the original character.

![image.png](attachment:image.png)

For instance:

1. **Monkey D. Luffy** might score exceptionally high in the `In-References` metric due to his central role in the narrative and being mentioned by numerous other characters. His importance in driving the story and connecting with key characters naturally places him at the top of this metric.

2. A less central character, like **Bellamy**, might have a moderately high `Out-References` count due to their extensive connections and interactions with multiple characters, such as during the Jaya and Dressrosa arcs. However, Bellamy ranks significantly lower on metrics like `In-References` or `Intersection References` because he is not frequently referenced by others or part of reciprocal relationships.

3. **Dracule Mihawk** could rank exceptionally high in `Word Count` due to the depth of his backstory, detailed descriptions of his abilities, and his significance as a Shichibukai. However, his `In-References` and `Out-References` might not be as high because he interacts with fewer characters directly, reflecting his solitary nature.

4. Characters like **Buggy the Clown** might excel in `Intersection References` because of his recurring role and mutual relationships with characters from different arcs (e.g., the Straw Hat Pirates, Shichibukai members, and Impel Down inmates). However, his `Word Count` might be relatively lower compared to characters with long, detailed wikis, showing how metrics can highlight different aspects of character importance.

---

### Weighted Combination
The metrics are combined into a weighted sum, with certain attributes receiving greater weight due to their relevance. For example, **In-References** might be considered more indicative of importance, as being mentioned by others highlights a character's influence in the story. This weighted approach ensures that key dimensions of character relevance are prioritized.

As a result, a **score value** is calculated for each character. This score comes from summing up their rankings in each of the four metrics, ordered in ascending order. For example:

- A character with the highest `In-References` will rank last (highest index) for that attribute.
- Their index is multiplied by the weight of the respective attribute (e.g., weight = 1 for `In-References`) and added to the scores derived from the other three metrics.

The character with the **highest overall score** is deemed the most important in the *One Piece* universe.

![image-2.png](attachment:image-2.png)

---

### Filtering with a Threshold
To determine the optimal threshold for filtering characters, input was sought from recognized experts in the field. The goal was to remove as many minor characters as possible while ensuring no significant character was excluded. 

A specific **threshold value** was selected that balances these criteria. As a result:

- 639 important characters were identified, constituting **43% of the original total**. 
- This result aligns with the nature of *One Piece*, a series known for introducing numerous tertiary and quaternary characters who contribute little to the main narrative.

The filtering process produced smoother distributions, emphasizing key characters over the vast background cast.

![image-3.png](attachment:image-3.png)
---

### Updating the Network
Subsequently, the character data was updated. Mentions of discarded characters were no longer considered in future iterations of the network creation process. This ensures a cleaner, more accurate representation of character relationships in the refined network.



# Inner Network: Rapid Growth and Localized Connections

As mentioned in the introduction, the structure of *One Piece* is built around various arcs, each of which introduces a new world of characters and relationships. From a network theory perspective, these arcs create an intricate and dynamic system of connections that evolves rapidly, often in seemingly random ways. 

Typically, when the protagonists—the Straw Hat Pirates—arrive at a new island where the arc unfolds, they split into subgroups to carry out specific tasks. Each subgroup interacts with different sets of characters, forming localized networks within the larger narrative structure. These interactions are often unique to each subgroup, with limited overlap at first, as they encounter allies, enemies, and neutral parties in their respective journeys.

By the end of the arc, however, the narrative usually culminates in a convergence of the subgroups, reuniting the Straw Hats and integrating the characters they’ve individually encountered into a single, interconnected web. This results in a burst of new connections within the network, as previously disparate nodes (characters) become linked through shared events, battles, or alliances.

The **inner network** of each arc is constructed by creating a link between two characters if either one mentions the other. This approach captures both direct and indirect relationships, ensuring a comprehensive representation of all interactions within the arc. This method allows the network to grow rapidly and encompass the diverse cast of characters and relationships introduced in each arc.

This arc-driven expansion and integration of relationships result in a network that grows rapidly in complexity with each new arc. However, this growth is not uniform. Certain characters and relationships become central to the network, while others remain peripheral or fleeting. This disparity reflects the varying roles characters play in the story—some become pivotal to the main narrative, while others exist as temporary connections, enriching the local network of a particular arc but having limited influence beyond it.

Understanding the structure and dynamics of this inner network is key to identifying which characters hold lasting significance within the story’s broader universe and which ones serve more localized, transient roles. This layered and hierarchical growth of the network highlights the narrative's complexity and emphasizes the importance of filtering techniques to distinguish between core and peripheral nodes.




# External Network: Gradual Growth and Persistent Relationships

In contrast to the rapidly expanding inner network within individual arcs, the external network represents the overarching storyline that connects the arcs together. This network is more stable and grows at a slower pace because it focuses on the relationships and interactions among the main characters and recurring elements of the series. 

From a network theory perspective, the external network primarily consists of the core nodes—main characters such as the Straw Hat Pirates—and the persistent connections between them that endure across arcs. These connections reflect the long-term dynamics of the story, such as the relationships among the crew members, their evolving bond with allies like Trafalgar Law or Vivi, and the recurring adversaries or thematic antagonists like the World Government, Marines, or Yonko.

Unlike the inner network, where many new nodes and edges (characters and relationships) are introduced rapidly and often discarded after the arc, the external network retains only a subset of these relationships. Specifically, connections that remain relevant to the larger narrative are incorporated, while arc-specific interactions are typically excluded once the arc concludes. 

The **external network** is constructed more conservatively: a link is created between two characters only if they mention each other. This stricter criterion ensures that only the most meaningful, reciprocal relationships are included, reflecting the enduring significance of these connections in the broader storyline. For example:
 
- Alliances and rivalries between key factions like the Straw Hats, the Revolutionary Army, and the Yonko remain part of the external network.
- Relationships with characters who play a temporary role in a single arc, such as the inhabitants of Drum Island or Skypiea, are typically omitted unless they have a broader impact on the main story.

This selective incorporation results in a more gradual growth of the external network. Each arc adds a limited number of new nodes and edges, representing significant plot developments, major alliances, or enduring rivalries. For instance, while the Dressrosa arc introduces numerous new characters in its inner network, only a few, like Bartolomeo and the alliance with the Tontatta tribe, become lasting elements of the external network.

Overall, the external network reflects the skeleton of the *One Piece* narrative, showing the enduring relationships and overarching themes that connect the arcs. Its slower growth rate and stability contrast sharply with the volatile and expansive nature of the inner network, highlighting the layered complexity of the series' storytelling.

# Louvain Communities

# NLP

# Network distribution