In [1]:
faqs = """About the Program

What is the course fee for the Data Science & AI Mentorship Program (DSAIM 2025)?
The program follows a monthly subscription model where learners pay Rs 850 per month. This allows students from all over Pakistan to join at an affordable cost without large upfront payments.

What is the total duration of the course?
The total duration is 7 months, with a complete learning roadmap covering Python, Machine Learning, and real-world case studies. Hence, the total course fee is around Rs 5950 for the entire program.

What is included in the mentorship program?
We cover both theoretical and practical modules to ensure industry readiness:

Python Fundamentals

Data Science Libraries (NumPy, Pandas, Matplotlib, Seaborn)

Data Cleaning and Analysis

SQL for Data Science

Statistics & Mathematics for ML

Machine Learning Algorithms

MLOps and Deployment

Portfolio & Case Studies

You can check the detailed outline on our website: https://datasciencepakistan.org/programs

Will Deep Learning and NLP be covered?
Yes, in the Advanced Module (optional), Deep Learning and Natural Language Processing will be introduced with TensorFlow and Hugging Face.

What if I miss a live session? Will I get recordings?
Yes, all live sessions are recorded and uploaded to the Learning Management System (LMS). You can access them anytime within your active subscription.

Where can I find the class schedule?
You can view the monthly schedule in the shared Google Sheet here:
https://docs.google.com/spreadsheets/d/PKDSAIM2025

What is the average time duration of each live class?
Each live class runs for 2 hours, including Q&A and hands-on practice sessions.

Which language will be used during sessions?
The sessions will be conducted in Urdu and English (mixed), making them easy to follow for all learners.

How will students be informed about upcoming sessions?
Registered learners receive an email and WhatsApp message before every session with the joining link and resources.

Can students from non-technical backgrounds join?
Absolutely. The program starts from scratch and assumes no prior programming background.

Can I join the program in the middle?
Yes, students can join anytime. The LMS access gives you recordings of all previous sessions instantly after payment.

Will I get access to old content if I join late?
Yes, you will get full access to all previously recorded modules and assignments once you activate your subscription.

Do we have to submit tasks and projects?
Yes. Weekly assignments and a final capstone project are part of the evaluation process. Submissions are done through the LMS dashboard.

Will we work on real case studies?
Yes, every module includes a case study — such as analyzing Pakistan Stock Exchange data, weather data, or COVID-19 trends in Pakistan.

Where can we reach for technical or course-related support?
You can contact us at support@datasciencepakistan.org
 or through the “Help” section on your LMS dashboard.

Payment & Registration

Where do we make payments?
Payments can be made through our official website:
https://datasciencepakistan.org

We accept JazzCash, Easypaisa, and debit/credit card payments.

Can I pay the full amount at once?
Yes, you can choose to pay all seven months together to receive a 10% discount.

What is the monthly validity period?
Each subscription is valid for 30 days from the date of payment. If you join on 10th January, your next payment is due by 10th February.

What if I am not satisfied with the course after payment?
We offer a 7-day refund policy. You can email us within seven days of payment to request a refund, no questions asked.

I live outside Pakistan and cannot pay via local gateways. What should I do?
You can contact our team at support@datasciencepakistan.org
 to receive an international payment link via PayPal or Stripe.

Will I still get updates if I haven’t made the latest payment?
You’ll continue receiving newsletters, but your LMS access will be paused until the next payment is made.

After Registration

How long can I watch paid videos after purchase?
You can view all videos for the duration of your subscription period. Once the entire course fee is paid, you’ll retain access to the content until August 2026.

Why don’t you provide lifetime access?
We keep costs low and update our curriculum every year. Lifetime access is not feasible at this price.

How do I ask questions after the session?
A Google Form is available in your LMS dashboard for submitting doubts. Our mentors respond via one-on-one Zoom sessions every Friday.

If I join late, can I still ask previous session doubts?
Yes, just select the module and week number in the doubt form.

What if I lose access to the LMS?
You can recover your account anytime using your registered email. Technical support is available 24/7.

Certificate & Career Support

What is the certificate eligibility criteria?
There are two main requirements:

You must complete all 7 months of payment.

You must attempt and submit all course projects.

Can I pay for missed months later?
Yes. A special “Make-Up Fee” link will be available on your dashboard for pending months.

Do you provide placement assistance?
Yes, but placement assistance ≠ placement guarantee. We help you with:

Portfolio-building workshops

Resume & LinkedIn profile sessions

Mock interviews with industry mentors

Internship and freelancing guidance

Will I get guaranteed job placement?
No. The program provides skill-building and career support, not guaranteed jobs.

What type of projects are part of the portfolio?
Students complete 3 mini-projects and one capstone project, such as:

Predicting electricity demand in Pakistan

Analyzing PSL player statistics

Building a chatbot for customer queries

COVID-19 vaccination trend visualization

What is the format of the certificate?
Certificates are digital (PDF) and verifiable via QR code. A printed version can be mailed for Rs 400 extra.

Can I continue learning after completing the program?
Yes. You can join our Advanced AI Fellowship (AAF) for Deep Learning, NLP, and Computer Vision.

Contact & Communication

Where can I ask general questions?
You can email us at info@datasciencepakistan.org
 or join our official Telegram community for discussions.

Will I get mentorship support after course completion?
Yes, alumni receive access to monthly mentor meetups and career clinics for one year after graduation.

Do you organize hackathons or challenges?
Yes. We conduct bi-monthly hackathons with cash prizes and internship opportunities for top performers.

Where are the sessions conducted?
Sessions are hosted live on Zoom and managed through our LMS platform. On-site classes are available in Karachi, Lahore, and Islamabad.

In short:
The Data Science & AI Mentorship Program (Pakistan) is a 7-month guided journey blending affordability, mentorship, and project-based learning for aspiring data scientists. Whether you’re from a tech or non-tech background, this program is structured to make you industry-ready through live sessions, local datasets, and one-on-one mentorship.
"""

In [21]:
import tensorflow as tf
from tensorflow.keras.preprocessing.text import Tokenizer

In [3]:
tokenizer = Tokenizer()

In [4]:
tokenizer.fit_on_texts([faqs])

In [5]:
len(tokenizer.word_index)

458

In [6]:
input_sequences = []
for sentence in faqs.split('\n'):
  tokenized_sentence = tokenizer.texts_to_sequences([sentence])[0]

  for i in range(1,len(tokenized_sentence)):
    input_sequences.append(tokenized_sentence[:i+1])

In [None]:
input_sequences

[[100, 1],
 [100, 1, 13],
 [12, 7],
 [12, 7, 1],
 [12, 7, 1, 27],
 [12, 7, 1, 27, 67],
 [12, 7, 1, 27, 67, 6],
 [12, 7, 1, 27, 67, 6, 1],
 [12, 7, 1, 27, 67, 6, 1, 28],
 [12, 7, 1, 27, 67, 6, 1, 28, 54],
 [12, 7, 1, 27, 67, 6, 1, 28, 54, 68],
 [12, 7, 1, 27, 67, 6, 1, 28, 54, 68, 37],
 [12, 7, 1, 27, 67, 6, 1, 28, 54, 68, 37, 13],
 [12, 7, 1, 27, 67, 6, 1, 28, 54, 68, 37, 13, 165],
 [12, 7, 1, 27, 67, 6, 1, 28, 54, 68, 37, 13, 165, 166],
 [1, 13],
 [1, 13, 167],
 [1, 13, 167, 8],
 [1, 13, 167, 8, 44],
 [1, 13, 167, 8, 44, 45],
 [1, 13, 167, 8, 44, 45, 168],
 [1, 13, 167, 8, 44, 45, 168, 38],
 [1, 13, 167, 8, 44, 45, 168, 38, 69],
 [1, 13, 167, 8, 44, 45, 168, 38, 69, 46],
 [1, 13, 167, 8, 44, 45, 168, 38, 69, 46, 70],
 [1, 13, 167, 8, 44, 45, 168, 38, 69, 46, 70, 169],
 [1, 13, 167, 8, 44, 45, 168, 38, 69, 46, 70, 169, 170],
 [1, 13, 167, 8, 44, 45, 168, 38, 69, 46, 70, 169, 170, 101],
 [1, 13, 167, 8, 44, 45, 168, 38, 69, 46, 70, 169, 170, 101, 71],
 [1, 13, 167, 8, 44, 45, 168, 38, 6

In [7]:
max_len = max([len(x) for x in input_sequences])

In [8]:
from tensorflow.keras.preprocessing.sequence import pad_sequences
padded_input_sequences = pad_sequences(input_sequences, maxlen = max_len, padding='pre')

In [9]:
padded_input_sequences

array([[  0,   0,   0, ...,   0, 100,   1],
       [  0,   0,   0, ..., 100,   1,  13],
       [  0,   0,   0, ...,   0,  12,   7],
       ...,
       [  0,   0,   1, ...,   2,  43,  18],
       [  0,   1,  28, ...,  43,  18,  43],
       [  1,  28,  54, ...,  18,  43,  37]], dtype=int32)

In [10]:
X = padded_input_sequences[:,:-1]

In [11]:
y = padded_input_sequences[:,-1]

In [12]:
X.shape

(1028, 51)

In [13]:
y.shape

(1028,)

In [14]:
from tensorflow.keras.utils import to_categorical
y = to_categorical(y,num_classes=459)

In [15]:
y.shape

(1028, 459)

In [16]:
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Embedding, LSTM, Dense

In [18]:
#model = Sequential()
#model.add(Embedding(459, 100, input_length=56))
#model.add(LSTM(150))
#model.add(LSTM(150))
#model.add(Dense(459, activation='softmax'))

#from keras.models import Sequential
#from keras.layers import Embedding, LSTM, Dense

model = Sequential()
model.add(Embedding(459, 100))             # ← removed input_length
model.add(LSTM(150, return_sequences=True))
model.add(LSTM(150))
model.add(Dense(459, activation='softmax'))


In [19]:
model.compile(loss='categorical_crossentropy', optimizer='adam',metrics=['accuracy'])

In [None]:
model.summary()

In [20]:
model.fit(X,y,epochs=100)

Epoch 1/100
[1m33/33[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m10s[0m 158ms/step - accuracy: 0.0284 - loss: 6.0572
Epoch 2/100
[1m33/33[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m10s[0m 160ms/step - accuracy: 0.0359 - loss: 5.6817
Epoch 3/100
[1m33/33[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m7s[0m 201ms/step - accuracy: 0.0404 - loss: 5.6275
Epoch 4/100
[1m33/33[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m5s[0m 158ms/step - accuracy: 0.0418 - loss: 5.5951
Epoch 5/100
[1m33/33[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m6s[0m 181ms/step - accuracy: 0.0417 - loss: 5.5636
Epoch 6/100
[1m33/33[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m6s[0m 170ms/step - accuracy: 0.0384 - loss: 5.5380
Epoch 7/100
[1m33/33[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m5s[0m 160ms/step - accuracy: 0.0405 - loss: 5.5320
Epoch 8/100
[1m33/33[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m7s[0m 194ms/step - accuracy: 0.0574 - loss: 5.4149
Epoch 9/100
[1m33/33[0m [32

<keras.src.callbacks.history.History at 0x7ccdcc1014f0>

In [22]:
import time
import numpy as np
text = "The Data Science & AI Mentorship"

for i in range(10):
  # tokenize
  token_text = tokenizer.texts_to_sequences([text])[0]
  # padding
  padded_token_text = pad_sequences([token_text], maxlen=56, padding='pre')
  # predict
  pos = np.argmax(model.predict(padded_token_text))

  for word,index in tokenizer.word_index.items():
    if index == pos:
      text = text + " " + word
      print(text)
      time.sleep(2)

[1m1/1[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m0s[0m 405ms/step
The Data Science & AI Mentorship program
[1m1/1[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m0s[0m 55ms/step
The Data Science & AI Mentorship program pakistan
[1m1/1[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m0s[0m 56ms/step
The Data Science & AI Mentorship program pakistan is
[1m1/1[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m0s[0m 55ms/step
The Data Science & AI Mentorship program pakistan is a
[1m1/1[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m0s[0m 60ms/step
The Data Science & AI Mentorship program pakistan is a 7
[1m1/1[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m0s[0m 136ms/step
The Data Science & AI Mentorship program pakistan is a 7 month
[1m1/1[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m0s[0m 54ms/step
The Data Science & AI Mentorship program pakistan is a 7 month guided
[1m1/1[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m0s[0m 69ms/step
The Data Science & AI Mentorship 

In [None]:
tokenizer.word_index

{'the': 1,
 'and': 2,
 'can': 3,
 'you': 4,
 'i': 5,
 'for': 6,
 'is': 7,
 'a': 8,
 'to': 9,
 'will': 10,
 'yes': 11,
 'what': 12,
 'program': 13,
 'of': 14,
 'in': 15,
 'sessions': 16,
 'we': 17,
 'on': 18,
 'access': 19,
 'your': 20,
 'payment': 21,
 'all': 22,
 'join': 23,
 'learning': 24,
 'be': 25,
 'are': 26,
 'course': 27,
 'data': 28,
 'our': 29,
 'lms': 30,
 'after': 31,
 'with': 32,
 'if': 33,
 'live': 34,
 'or': 35,
 'support': 36,
 'mentorship': 37,
 'where': 38,
 'pakistan': 39,
 'at': 40,
 'get': 41,
 'do': 42,
 'one': 43,
 'monthly': 44,
 'subscription': 45,
 'pay': 46,
 'students': 47,
 'from': 48,
 '7': 49,
 'months': 50,
 'datasciencepakistan': 51,
 'org': 52,
 'through': 53,
 'science': 54,
 'payments': 55,
 'duration': 56,
 'case': 57,
 'session': 58,
 'receive': 59,
 'email': 60,
 'every': 61,
 'projects': 62,
 'dashboard': 63,
 'via': 64,
 'available': 65,
 'placement': 66,
 'fee': 67,
 'ai': 68,
 'learners': 69,
 'rs': 70,
 'this': 71,
 'an': 72,
 'total': 73,
 '