Team:NCTU Formosa/Disease Occurrence Prediction

navigation

NCTU_Formosa: Disease Occurrence Prediction
Overview

     For curing the fungal diseases in reality, we built the disease prediction system. According to the research data, weather conditions are related to fungal diseases happening incidence. Therefore, we used eleven weather data which extended by 4 weather conditions -- relative humidity, temperature, air pressure, and rainfall for the past 14 days. With the use of the convolutional neural network (CNN), we could get the daily probability of the disease happening. Besides using CNN, we also added the spore germination rate to know the more accurate time of spore germination as an optimization.Later, we used IoTtalk to connected with the sensors and the spraying system.

Disease Occurrence model

     With the combined of disease occurrence model and the spore germination rate modeling, we could predict the most likely outbreaking time of fungal diseases.

Content:

- Convolutional Neural Network

- Result of disease occurrence model

- Spore germination rate modeling

- Result of spore germination rate modeling

Convolutional Neural Network

     The method we used was called neural network, one of the popular machine learning solutions, or another well-known name called deep learning. The method that can improve the quality of the model is Convolutional Neural Network (CNN)

Datasets:

     To complete this model, we needed two kinds of data in the beginning. One was weather data, the other was the weather data that respond to the time when fungal diseases happened.

- Collection:

     The fungal disease data were from the government agency, and the weather information was from the Central Meteorological Bureau's website.

- Pre-processing:

     We combined the fungal diseases data and the weather information then deleted the diseases data that had no weather data to match with. Later, these data were standardized for machine training and testing.

Convolutional Neural Network(CNN):

     CNN can recognize the patterns of the feature automatically. Therefore, we used it to catch the pattern of the favorite weather change to the fungal diseases in the life cycle. In other words, if CNN identified the weather change that was suitable for fungal diseases in the specific time, it could correspond to the life cycle of the fungal diseases.

- Max pooling:

     After parameters went through the CNN layers, the amount of data were increased, therefore we also added max pooling layer, which is a commanding way that can help reduce the computational complexity of the model and help to find the best tendency of these data.

- Full connection layer:

     It converted the max pooling output into high dim space, and classified them into two classes, negative(diseases that had not happened) and positive(diseases that had happened).

- Softmax function:

     The network output was hard to be realized by humans, so we used softmax function to transform it into the diseases happening probability that could be understood by both machines and humans.

     After that, we used cross-entropy to evaluate the differences between the predicted values and actual values in the training stage. At last, we used independent data to test our final model.

Result

Figure 1: After a bunch of train and test steps, we got our model with about 83% of accuracy.

(For the result and technology in detail, please check out Disease Occurrence Modeling.)

Spore Germination Rate Modeling

     Although we could get the daily disease occurrence with CNN method, however, we wanted to get the more accurate timing to let our prediction more effective. Due to the fact that the spore germination must happen before disease occurrence, we tried to get the conditions that cause spores germinate. As a result, we made the spore germination rate with the research data and confirmed it by experiment.

Formula:

     According to the researchers, humidity and temperature affect the spore germination the most between weather conditions. As the result, we focused on the relationships between the humidity with the spore germination and temperature with the spore germination.

- Spore germination rate based on temperature: f1(x)=ax3+bx2+cx+d, can be fitted by a cubic equation.

- Spore germination rate based on humidity: f2(x)=ax+b, can be fitted with a linear equation

- The general spore germination rate: f1(x) x f2(x)

Experiment:

- Fixing temperature and change humidity:

     Put the spore suspension solution(2x105particles/mL) and 2% glucose solution mixing with equal volume, in concave glass slide and then place it into temperature and the humidity control box. The humidity was fixed at 100%, the temperature was tested every 5 degrees from 10 to 30 Celsius degree.

- Fixing humidity and change temperature:

     Put the spore suspension solution(2x105particles/mL) and 2% glucose solution mixing with equal volume, in concave glass slide and placed into temperature and humidity control box. The temperature was fixed at 25 Celsius degree, and the humidity was tested every 5 percents from 80% to 100%.

- Independent event validation:

     Compared the result of formula with the value of the spore germination in the reality to verify temperature and relative humidity are independent event to spore germination. We randomly choose the conditions of temperature 23 and 13 Celsius, relative humidity 97% and 80%.

Result:

- Botrytis cinerea’s spore germination rate based on temperature:

$$ f_1 = ( -0.0625x_1^3 + 2.9974x_1^2 - 37.865x_1 + 141.68) $$

Figure 2: Botrytis cinerea’s spore germination rate based on temperature.(9 hours)

- Botrytis cinerea’s spore germination rate based on relative humidity:

$$ f_2 = 316.88x - 216.88 $$

Figure 3: Botrytis cinerea’s spore germination rate based on relative humidity(9 hours

Table 1: Botrytis cinerea’s spore germination rate based on relative humidity(9 hours)

- Independent event validation:

      The results of comparing the spore germination rate of our experiment and formula.

      The result can proof that our formula is quite precise.

- Conclusion:

      Our final formula is:

$$ f_1 \times f_2 = [(-0.0625x_1^3 + 2.9974x_1^2 - 37.865x_1 + 141.8)\div 100]\times [(316.88x_2 - 216.88)\div 100] $$

     We divided it by 100 respectively because we change % to possibility, where x1 is temperature and x2 is relative humidity.

     According our experiment result, we prove that temperature and relative humidity is an independent event and our formula is quite precise.

(For the result and technology in detail, please check out Spore Germination Rate Modeling)

Application

     To put the disease occurrence prediction in reality, we combined it with IoT system and we used the IoTtalk platform to make the IoT contact easier and more convenient and link the entire system.The IoTtalk platform was developed by the lab of Dr.Lin in National Chiao Tung University, Taiwan.

IoTtalk

     To put the disease occurrence prediction, in reality, we combined it with IoT system and we used the IoTtalk platform to make the IoT contact easier and more convenient and link the entire system.The IoTtalk platform was developed by the lab of Dr.Lin in National Chiao Tung University, Taiwan.

     First, we used the sensors to grab the prompt weather conditions and sent to get the spore germination incidence or accumulated till 14 days then sent into the predict system to get the daily possibility of disease occurrence. If both overcame the threshold, the spraying system would be opened and sprayed the peptide that was chosen from our database. The figure below is the main architecture of our project.

Figure 4: Main architecture of IoT

(For more in detail, please check out Demonstration--Plant box.)

Reference:

[1]Mofan Zhou, https://morvanzhou.github.io/tutorials/machine-learning/, 2017

[2]c1mone Tensorflow Day Generative Adversarial Network with MNIST, https://ithelp.ithome.com.tw/articles/10187424, 2016

Untitled Document