-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Data initialization and validation tracking issue #2
Comments
Thanks a lot, Ewout. Working with Niteesh on this. |
Can multiple types of irrigation be present on the same farm land? If so, which can and which can not? Does having irrigation take any space that can't be used to produce crops? Does irrigation have a maximum capacity, or can it handle any drought? https://www.deccanherald.com/state/cdurga-villages-answer-drought-713763.html |
@nit1995 Thanks a lot for doing the research on this!
Can you make an yes/no table which of these crops can use which of these irrigation types? ['Maize', 'Pigeonpea', 'Sorghum', 'Chickpea', 'Groundnut', 'Finger millet']
Perfect, let’s assume irrigation doesn’t take significant space.
@Kaveri3012 and @nit1995 can you discuss how this translates to yield? I think we could quantify it as precipitation deficit, of which each irrigation system can handle some amount. Then we need a functions:
|
I've added a lookup table for these crops. However, @Kaveri3012 and I are looking at better defining the geographical scope based on the sample size of the respondents belonging to these districts in the CMIE data. Based on the districts chosen, the major crops might change. I will update it soon.
Using ICRISAT data to run a regression is tricky because it doesn't control for the type of irrigation or any other factors that might influence yield. I have been looking at literature to see if I can find some other estimate. This paper shows strong correlation between rainfall from June to September and yield for different crops. Another paper however show that exposure to extreme temperatures impact yield more than rainfall. And that impact of rainfall is more significant (for rice grown) in rainfed than in irrigated conditions. If I don't find any secondary literature on these crops, I might have to use ICRISAT data itself.
This is tough to quantify. Borewells generally provide respite during droughts, but how long they run depends on groundwater levels and also what type of irrigation the farmers use. Irrigation types using water from canals again depend on if the rivers are flowing and enough water has been released into the canal. |
@nit1995 Could you upload the data on the look up tables (and data on farm-land etc ) by the end of today? We already discussed this two days ago. The more data we have sooner, the better, so @EwoutH can build a better model to start with. @EwoutH the link between rainfall deficit and drought seems more complicated to quantify. But we'll try and have something by the end of this week. |
@nit1995 please also list out which of the five districts we will be specifically looking at in Karnataka, and the reasons for choosing those districts, and any related limitations of making that choice. We will then have to find both demography-related (income, consumption trends) and crop-related data, across the three farmer groups for those five districts, and send them to Ewout |
Great work, it’s appreciated! |
@nit1995 Could you try to deliver all data as tidy data? That looks like this:
For example, for the Farmland data, that would go from this
to this:
I now did this in this notebook. If you have more than two axis (for example if you also would like to keep the different years, having 1) the year, 2) farmer size and 3) attribute as axis), please use multi-indexing. In that case, also feel free to save as a Pickle (DataFrame.to_pickle) instead of a CSV. I hope this is possible, if you have any questions please let me know! |
@Kaveri3012 I now just assume the Area per farmer value +- 25% for each farmer type. If you would like another approach please let me know. Edit: Are there any other properties we like to link to farmer type? Like initial wealth or living costs? |
@EwoutH the classification of farmers based on land holdings is as follows: Source: Agricultural Census 2015-16 |
Thanks! Can you find/calculate an distribution function or histogram (bins) by any chance? |
I am unable to find a distribution function or a histogram with bins. The data available only gives the total number of farmers and the aggregate area under each category of farmers. I plotted a histogram with this data, but the classification of land holdings does not provide equal intervals. |
I did some experimentation and found out that the farm sizes quite closely resemble a lognormal distribution! After a bit of experimentation I landed on a lognormal distribution with And results in the following metrics for each classification bin:
Which quite well resembles the table above! Once we have the farm size data per district, we can fit a lognormal distribution for district by estimating the shape and scale parameters. See the 4_India-ABM-farmland-size-distribution-function.ipynb |
@nit1995 please let me know if anything is unclear, you want to discuss something or you’re stuck on something! |
@EwoutH I had the district-wise farm size data in a format similar to this:
I was not sure how to estimate the distribution though |
Hi @EwoutH, A couple of clarifications from Niteesh and me:
Define the shape of the distribution# The parameters can be adjusted based on the characteristics of your specific datashape, loc, scale = 0.92, 0, 1.25
|
Thanks, I can take care of the parameter estimation. It's nothing more than playing around a bit and check if the error values go down. |
@nit1995 Any chance you can price data on 'Castor', 'Linseed', 'Pearl millet' and 'Wheat'? Because currently we only have pricing data on Chickpea, Finger millet, Groundnut, Maize, Paddy, Pigeonpea and Sorghum. Otherwise we will have six crops in rotation: Chickpea, Finger millet, Groundnut, Maize, Pigeonpea and Sorghum (no Paddy, because no area data). If so, please add them to the CSVs without changing the data structure. If not, also no problem, because those 6 listed above are also the most grown by area in Karnataka. |
best, |
Without considering paddy, these six cover about 95% of area. Without paddy, no idea. Edit: I just noticed, we do have area data for rice, but not paddy, and do have price data for paddy, not rice. Probably we can just say rice = paddy, and all our problems are solved. |
We do have area data for rice, but not paddy, and do have price data for paddy, not rice. Probably we can just say rice = paddy, and all our problems are solved. See #2 (comment)
We do have area data for rice, but not paddy, and do have price data for paddy, not rice. Probably we can just say rice = paddy, and all our problems are solved. See #2 (comment)
Paddy = rice solved a lot of problems, we now have 7 crops! @Kaveri3012 I was thinking about how farmers estimate the expected return of switching crops. My initial idea is taking the market price of the past 5 years for their current crop and the crop they want to switch those, and comparing those. |
Sounds like a good start to me. I'm wondering whether a farmer agent also
make price projections into the future?
…On Wed, Jun 14, 2023, 17:01 Ewout ter Hoeven ***@***.***> wrote:
Paddy = rice solved a lot of problems, we now have 7 crops!
@Kaveri3012 <https://github.com/Kaveri3012> I was thinking about how
farmers estimate the expected return of switching crops. My initial idea is
taking the market price of the past 5 years for their current crop and the
crop they want to switch those, and comparing those.
—
Reply to this email directly, view it on GitHub
<#2 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAWFOXYFJSEJD3NZAB5ZE33XLGOJRANCNFSM6AAAAAAYNDJPJY>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Ewout,
I do think it maybe better to focus on the lending models and setting up
the networks this week, until the details of the investment model (and risk
aversion) etc become clear
On Wed, Jun 14, 2023, 17:25 Kaveri Iychettira ***@***.***>
wrote:
… Sounds like a good start to me. I'm wondering whether a farmer agent also
make price projections into the future?
On Wed, Jun 14, 2023, 17:01 Ewout ter Hoeven ***@***.***>
wrote:
> Paddy = rice solved a lot of problems, we now have 7 crops!
>
> @Kaveri3012 <https://github.com/Kaveri3012> I was thinking about how
> farmers estimate the expected return of switching crops. My initial idea is
> taking the market price of the past 5 years for their current crop and the
> crop they want to switch those, and comparing those.
>
> —
> Reply to this email directly, view it on GitHub
> <#2 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/AAWFOXYFJSEJD3NZAB5ZE33XLGOJRANCNFSM6AAAAAAYNDJPJY>
> .
> You are receiving this because you were mentioned.Message ID:
> ***@***.***>
>
|
Changes in code, since Tuesday:
New assumptions:JLG:
Neigbours:
Lending:
Yield function:
Crop diversification:
Expenditure:
Normalize for inflation:
Farmers
|
@nit1995 For initialisation (and validation), I need to know how much different crops (i.e. 30% has 1, 50% has 2, 20% has 3) a farmer on average has. This can be a lookup or probability function. It might depend on the farm size/class. Do you think you can find some data on that? |
@EwoutH I can't seem to get any data on how many farmers do multiple cropping or on how many crops they sow. I just found this statistic , but this is not India specific. Only 5% of global rainfed cropland is under multiple cropping, whereas 40% of global irrigated cropland is under multiple cropping |
Thanks for looking anyway. I also need a formula for the maximum amount that (a member of) a joint liability group can finance. It would also be nice to have an indication of the typical duration of loans. @Kaveri3012 in the pseudocode I encountered both notes of that a JGL can loan from a bank and microfinance institutions. Is it both, or just one of the two? Edit: Also need to know how to translate income to an amount to lend at a nationalised bank. Last years income, or do you need to show a trend or something? 5 year average or minimum? Simple regression? |
Hi Ewout,
Best, |
This is a tracking issue for all the data we might need, for either initialization or validation.
@nit1995 Thanks for taking this on! Can you take the lead on the Drought and Irrigation parts? Please discuss these things also directly with Kaveri (since she is the subject matter expert).
See also the pseudocode: https://docs.google.com/document/d/1Fay3uEJRzJnAenQnd0Pa9e5AOzRPa5O7vOzR0pKnI5M/edit
Farming
Financial
Network
conceptual-mindmap-India-drought-2023-05-22.zip
The text was updated successfully, but these errors were encountered: