我遇到问题,我无法重现Keras和ThensorFlow的结果。结果不能用Keras和TensorFlow在Python中重现
好像最近也一直在Keras documentation site发布针对此问题的解决方法,但不知何故,没有为我工作。
我做错了什么?
我使用一个MBP视网膜Jupyter笔记本(不Nvidia的GPU)。
# ** Workaround from Keras Documentation **
import numpy as np
import tensorflow as tf
import random as rn
# The below is necessary in Python 3.2.3 onwards to
# have reproducible behavior for certain hash-based operations.
# See these references for further details:
# https://docs.python.org/3.4/using/cmdline.html#envvar-PYTHONHASHSEED
# https://github.com/fchollet/keras/issues/2280#issuecomment-306959926
import os
os.environ['PYTHONHASHSEED'] = '0'
# The below is necessary for starting Numpy generated random numbers
# in a well-defined initial state.
np.random.seed(42)
# The below is necessary for starting core Python generated random numbers
# in a well-defined state.
rn.seed(12345)
# Force TensorFlow to use single thread.
# Multiple threads are a potential source of
# non-reproducible results.
# For further details, see: https://stackoverflow.com/questions/42022950/which-seeds-have-to-be-set-where-to-realize-100-reproducibility-of-training-res
session_conf = tf.ConfigProto(intra_op_parallelism_threads=1, inter_op_parallelism_threads=1)
from keras import backend as K
# The below tf.set_random_seed() will make random number generation
# in the TensorFlow backend have a well-defined initial state.
# For further details, see: https://www.tensorflow.org/api_docs/python/tf/set_random_seed
tf.set_random_seed(1234)
sess = tf.Session(graph=tf.get_default_graph(), config=session_conf)
K.set_session(sess)
# ** Workaround end **
# ** Start of my code **
# LSTM and CNN for sequence classification in the IMDB dataset
from keras.models import Sequential
from keras.layers import Dense
from keras.layers import LSTM
from keras.layers.embeddings import Embedding
from keras.preprocessing import sequence
from sklearn import metrics
# fix random seed for reproducibility
#np.random.seed(7)
# ... importing data and so on ...
# create the model
embedding_vecor_length = 32
neurons = 91
epochs = 1
model = Sequential()
model.add(Embedding(top_words, embedding_vecor_length, input_length=max_review_length))
model.add(LSTM(neurons))
model.add(Dense(1, activation='sigmoid'))
model.compile(loss='mean_squared_logarithmic_error', optimizer='adam', metrics=['accuracy'])
print(model.summary())
model.fit(X_train, y_train, epochs=epochs, batch_size=64)
# Final evaluation of the model
scores = model.evaluate(X_test, y_test, verbose=0)
print("Accuracy: %.2f%%" % (scores[1]*100))
Python 3.6.3 | Anaconda custom(x86_64)| (默认,2017年10月6日,12:04:38) [GCC 4.2.1 Compatible Clang 4.0.1(tags/RELEASE_401/final)]
解决方法已包含在代码中(无效)。
随着每次我做培训部分我得到不同的结果。
当复位Jupyter笔记本电脑的内核,第1次与相对应的第一次和第二次与第2次。
所以复位我会永远在第一次运行时获得,例如0.7782,0.7732在第二次运行等
但没有经过内核复位结果是我每次运行它总是不同的。
我会有所帮助的任何建议!
你可以添加'np.random.get_state()'和'rn.getstate()'到输出吗?你使用GPU还是CPU?你可以在'python'中尝试脚本吗? – Maxim