GitHub - geektrust/python-property-recommendation

This is the starter kit for the problem statement property recommendation

Problem Statement

Property Recommendation Scoring

You are building a content-based recommendation engine for a real estate platform. The platform records user interaction events (views, saves, enquiries) on property listings. Your engine must infer each user's preferences from their interaction history and score every unseen property to surface the most relevant recommendations.

Each property has the following fields:

{
    "property_id": str,
    "suburb": str,
    "property_type": str,   # one of: "house", "apartment", "townhouse"
    "bedrooms": int,
    "price": float,
    "has_parking": bool,
}

Each interaction event has the following fields:

{
    "user_id": str,
    "property_id": str,
    "event_type": str,      # one of: "view", "save", "enquiry"
    "event_time": str,      # ISO-8601 UTC, e.g. "2025-06-01T09:00:00Z"
}

Requirements

Interaction weights — each event type carries a different signal strength:
- view = 1.0
- save = 2.0
- enquiry = 3.0
Build a per-user preference profile by aggregating the features of every property the user has interacted with, weighted by event type:
- Weighted suburb counts
- Weighted property type counts
- Weighted average number of bedrooms
- Weighted average price
- Weighted parking preference (sum of weights for has_parking=True vs False)
Exclude seen properties — never recommend a property the user has already interacted with (any event type).
Score each unseen property using five sub-scores, then average them:
- suburb_score: weighted_suburb_count / total_weighted_interactions (0 if suburb not in history)
- type_score: weighted_type_count / total_weighted_interactions (0 if type not in history)
- bedroom_score: max(0.0, 1.0 - |property_bedrooms - user_weighted_avg_bedrooms| / max_bedrooms_in_dataset)
- price_score: max(0.0, 1.0 - |property_price - user_weighted_avg_price| / max_price_in_dataset)
- parking_score: 1.0 if has_parking matches the user's weighted majority preference, else 0.0
```
final_score = round((suburb_score + type_score + bedroom_score + price_score + parking_score) / 5.0, 6)
```
Return top-N recommendations per user, sorted by score descending, then property_id ascending as a tiebreaker.

Return a list of dictionaries with exactly these keys:

{
    "user_id": str,
    "recommendations": [
        {
            "property_id": str,
            "score": float,   # rounded to 6 decimal places
        },
        ...
    ]
}

Sort output by user_id ascending.
Users with no interaction history must be excluded from the output.
If fewer than top_n unseen properties exist for a user, return only what is available.

How to execute

./run.sh "{\"properties\": [{\"property_id\": \"p1\", \"suburb\": \"Richmond\", \"property_type\": \"apartment\", \"bedrooms\": 2, \"price\": 650000, \"has_parking\": true}], \"events\": [{\"user_id\": \"u1\", \"property_id\": \"p1\", \"event_type\": \"view\", \"event_time\": \"2025-06-01T09:00:00Z\"}], \"top_n\": 2}"

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.vscode		.vscode
.gitignore		.gitignore
.gtignore		.gtignore
README.md		README.md
gt_main_wrapper.py		gt_main_wrapper.py
main.py		main.py
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Problem Statement

Property Recommendation Scoring

Requirements

How to execute

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Problem Statement

Property Recommendation Scoring

Requirements

How to execute

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages