2016-10-13 27 views
-2

我工作的“如何看待一个计算机科学家”,当然Python中的散点图,和我被困在这个问题:开发利用龟

解释数据文件labdata.txt使得每行包含一个x,y坐标对。编写一个名为plotRegression函数从该文件中读取数据,并且使用一个龟绘制这些点,并根据下面的公式的最佳拟合线:

Y = Y + M(X-X)

m =Σxiyi-nx¯y¯Σx2i-nx¯2

您的程序应该分析点并使用setworldcoordinates正确缩放窗口,以便每个点都可以绘制。然后你应该用不同的颜色画出最合适的线。

这是我到目前为止,但我不断收到'int'不支持索引错误。我一直在使用各种在线资源和一些解决方案,但似乎无法正常工作。

任何人都可以帮我找出纠正什么?

import turtle 


def plotRegression(data): 
    win = turtle.Screen() 
    win.bgcolor('pink') 

    t = turtle.Turtle() 
    t.shape('circle') 
    # t.turtlesize(0.2) 

    x_list, y_list = [int(i[0]) for i in plot_data], [int(i[1]) for i in plot_data] 
    x_list, y_list = [float(i) for i in x_list], [float(i) for i in y_list] 
    x_sum, y_sum = sum(x_list), sum(y_list) 
    x_bar, y_bar = x_sum/len(x_list), y_sum/len(y_list) 
    x_list_square = [i ** 2 for i in x_list] 
    x_list_square_sum = sum(x_list_square) 
    xy_list = [x_list[i] * y_list[i] for i in range(len(x_list))] 
    xy_list_sum = sum(xy_list) 

    m = (xy_list_sum - len(x_list) * x_bar * y_bar)/(x_list_square_sum - len(x_list) * x_bar ** 2) 
    # best y 
    y_best = [(y_bar + m * (x_list[i] - x_bar)) for i in range(len(x_list))] 

    # plot points 

    max_x = max(x_list) 
    max_y = max(y_list) 
    win.setworldcoordinates(0, 0, max_x, max_y) 
    for i in range(len(x_list)): 
     t.penup() 
     t.setposition(x_list[i], y_list[i]) 
     t.stamp() 

    # plot best y 
    t.penup() 
    t.setposition(0, 0) 
    t.color('blue') 
    for i in range(len(x_list)): 
     t.setposition(x_list[i], y_best[i]) 
     t.pendown() 

    win.exitonclick() 


f = open("labdata.txt", "r") 
for aline in f: 
    plot_data = map(int, aline.split()) 
plotRegression(plot_data) 

回答

0

我觉得你的龟图形是一个次要的问题 - 你是不是在正确的阅读您的数据。除了最后一个x,y对之外,你都在扔东西。而map()不是你的朋友,因为你需要将结果编入plotRegression()。你也直接访问plot_data而不是正式参数data和其他细节。

这里是我的代码的返工,看它是否让你在一个更好的方向前进:

from turtle import Turtle, Screen 

def plotRegression(data): 

    x_list, y_list = [int(i[0]) for i in data], [int(i[1]) for i in data] 
    x_list, y_list = [float(i) for i in x_list], [float(i) for i in y_list] 
    x_sum, y_sum = sum(x_list), sum(y_list) 
    x_bar, y_bar = x_sum/len(x_list), y_sum/len(y_list) 
    x_list_square = [i ** 2 for i in x_list] 
    x_list_square_sum = sum(x_list_square) 
    xy_list = [x_list[i] * y_list[i] for i in range(len(x_list))] 
    xy_list_sum = sum(xy_list) 

    m = (xy_list_sum - len(x_list) * x_bar * y_bar)/(x_list_square_sum - len(x_list) * x_bar ** 2) 
    # best y 
    y_best = [(y_bar + m * (x_list[i] - x_bar)) for i in range(len(x_list))] 

    # plot points 

    turtle = Turtle(shape = 'circle') 

    for i in range(len(x_list)): 
     turtle.penup() 
     turtle.setposition(x_list[i], y_list[i]) 
     turtle.stamp() 

    # plot best y 
    turtle.penup() 
    turtle.setposition(0, 0) 
    turtle.color('blue') 
    for i in range(len(x_list)): 
     turtle.setposition(x_list[i], y_best[i]) 
     turtle.pendown() 

    return (min(x_list), min(y_list), max(x_list), max(y_list)) 

screen = Screen() 

screen.bgcolor('pink') 

f = open("labdata.txt", "r") 

plot_data = [] 

for aline in f: 
    x, y = aline.split() 
    plot_data.append((x, y)) 

# This next line should be something like: 

# screen.setworldcoordinates(*plotRegression(plot_data)) 

# but setworldcoordinates() is so tricky to work with 
# that I'm going to leave it at: 

print(*plotRegression(plot_data)) 

# and suggest you trace a rectangle with the return 
# values to get an idea what's going to happen to 
# your coordinate system 

screen.exitonclick()