Author Topic: Testing BoincRescheduler v1.0  (Read 3707 times)

0 Members and 1 Guest are viewing this topic.

Offline Pepo

  • Tester
  • Hero Member
  • *****
  • Posts: 875
    • View Profile
Testing BoincRescheduler v1.0
« on: July 19, 2010, 06:03:09 pm »
I've got a red ERROR: check the log!

Code: [Select]
---------- Debug begin
log_all_tasks = 1
rsc_fpops = 1
---------- Debug end
19 July 2010 - 19:40:22 BoincRescheduler V: 1.0

19 July 2010 - 19:47:37  true_angle_range: 0.44330727921088 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu12my10aa.26352.2112.14.10.25 - Read: 158430379595272.620000, Calculated: 158430379595272.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.4431461863789 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu14my10aa.12824.14791.11.10.125 - Read: 158454586579697.500000, Calculated: 158454586579698.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.44302007965294 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu15my10ab.28922.2112.6.10.177 - Read: 158473548581197.120000, Calculated: 158473548581197.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.0097986224112693 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu01jn10aa.25560.80221.16.10.77 - Read: 160720000000000.000000, Calculated: 160720000000000.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.42121945709895 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu15my10ab.7870.49530.8.10.103 - Read: 161922232675840.910000, Calculated: 161922232675841.000000, Ratio: 1.000000
19 July 2010 - 19:47:37 wu18no09aj.30466.7843.3.13.138 - Read: 161922232675840.910000, Calculated: 157794699338661.000000, Ratio: 1.026158
19 July 2010 - 19:47:37 wu18no09aj.30466.8661.3.13.235 - Read: 161922232675840.910000, Calculated: 157829454578104.000000, Ratio: 1.025932
19 July 2010 - 19:47:37 wu18no09aj.30466.9479.3.13.35 - Read: 161922232675840.910000, Calculated: 157835637345957.000000, Ratio: 1.025891
19 July 2010 - 19:47:37 wu18no09aj.30466.9479.3.13.208 - Read: 161922232675840.910000, Calculated: 157835637345957.000000, Ratio: 1.025891
19 July 2010 - 19:47:37 wu18no09aj.30466.10297.3.13.76 - Read: 161922232675840.910000, Calculated: 157829423360584.000000, Ratio: 1.025932
19 July 2010 - 19:47:37 wu18no09aj.30466.11524.3.13.76 - Read: 161922232675840.910000, Calculated: 157829557458494.000000, Ratio: 1.025931
19 July 2010 - 19:47:37 ERROR: Cpu and GPU countMore than one Plan Class: cuda23 ,cuda
19 July 2010 - 19:47:37 ERROR: Scan completed with an error.

I have 6 Seti Beta tasks (18no09aj.30466.* - 5 cuda + 1 CPU), listed in the log; and additionally 6 Seti tasks (**my10**.* and 01nj10**.* - 5 cuda23 and 1 CPU), not (yet?) mentioned in the log.
Was the log output stopped prematurely? Did BR incorrectly mix tasks, belonging to different projects?
Peter

Offline jjwhalen

  • Active Member
  • Sr. Member
  • *****
  • Posts: 263
    • View Profile
Re: Testing BoincRescheduler v1.0
« Reply #1 on: July 19, 2010, 06:37:58 pm »
I've got a red ERROR: check the log!

Code: [Select]
---------- Debug begin
log_all_tasks = 1
rsc_fpops = 1
---------- Debug end
19 July 2010 - 19:40:22 BoincRescheduler V: 1.0

19 July 2010 - 19:47:37  true_angle_range: 0.44330727921088 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu12my10aa.26352.2112.14.10.25 - Read: 158430379595272.620000, Calculated: 158430379595272.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.4431461863789 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu14my10aa.12824.14791.11.10.125 - Read: 158454586579697.500000, Calculated: 158454586579698.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.44302007965294 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu15my10ab.28922.2112.6.10.177 - Read: 158473548581197.120000, Calculated: 158473548581197.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.0097986224112693 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu01jn10aa.25560.80221.16.10.77 - Read: 160720000000000.000000, Calculated: 160720000000000.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.42121945709895 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu15my10ab.7870.49530.8.10.103 - Read: 161922232675840.910000, Calculated: 161922232675841.000000, Ratio: 1.000000
19 July 2010 - 19:47:37 wu18no09aj.30466.7843.3.13.138 - Read: 161922232675840.910000, Calculated: 157794699338661.000000, Ratio: 1.026158
19 July 2010 - 19:47:37 wu18no09aj.30466.8661.3.13.235 - Read: 161922232675840.910000, Calculated: 157829454578104.000000, Ratio: 1.025932
19 July 2010 - 19:47:37 wu18no09aj.30466.9479.3.13.35 - Read: 161922232675840.910000, Calculated: 157835637345957.000000, Ratio: 1.025891
19 July 2010 - 19:47:37 wu18no09aj.30466.9479.3.13.208 - Read: 161922232675840.910000, Calculated: 157835637345957.000000, Ratio: 1.025891
19 July 2010 - 19:47:37 wu18no09aj.30466.10297.3.13.76 - Read: 161922232675840.910000, Calculated: 157829423360584.000000, Ratio: 1.025932
19 July 2010 - 19:47:37 wu18no09aj.30466.11524.3.13.76 - Read: 161922232675840.910000, Calculated: 157829557458494.000000, Ratio: 1.025931
19 July 2010 - 19:47:37 ERROR: Cpu and GPU countMore than one Plan Class: cuda23 ,cuda
19 July 2010 - 19:47:37 ERROR: Scan completed with an error.

I have 6 Seti Beta tasks (18no09aj.30466.* - 5 cuda + 1 CPU), listed in the log; and additionally 6 Seti tasks (**my10**.* and 01nj10**.* - 5 cuda23 and 1 CPU), not (yet?) mentioned in the log.
Was the log output stopped prematurely? Did BR incorrectly mix tasks, belonging to different projects?

Ahh-ha!  This may be another consequence of 2 different "projects" (SETI & SETI Beta) running the same application/revision/class plan at the same time.  We had a situation a few months ago where the task-summing filter was lumping SETI & SETI Beta tasks together.  I recall that Fred had to add project name as a filter criterion.  Just a thought :)

I've been with SETI almost since the beginning (Jul 99) and I've long advocated them suspending Beta when they're not actually betatesting something, either on the client or the server side.  They have enough load on the server farm as it is.

 

Offline fred

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 3501
  • eFMer
    • View Profile
    • Trails
Re: Testing BoincRescheduler v1.0
« Reply #2 on: July 20, 2010, 05:51:48 am »
I've got a red ERROR: check the log!

Code: [Select]
---------- Debug begin
log_all_tasks = 1
rsc_fpops = 1
---------- Debug end
19 July 2010 - 19:40:22 BoincRescheduler V: 1.0

19 July 2010 - 19:47:37  true_angle_range: 0.44330727921088 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu12my10aa.26352.2112.14.10.25 - Read: 158430379595272.620000, Calculated: 158430379595272.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.4431461863789 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu14my10aa.12824.14791.11.10.125 - Read: 158454586579697.500000, Calculated: 158454586579698.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.44302007965294 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu15my10ab.28922.2112.6.10.177 - Read: 158473548581197.120000, Calculated: 158473548581197.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.0097986224112693 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu01jn10aa.25560.80221.16.10.77 - Read: 160720000000000.000000, Calculated: 160720000000000.000000, Ratio: 1.000000
19 July 2010 - 19:47:37  true_angle_range: 0.42121945709895 beam_width: 0.0500000007 sample_rate: 9765.625 nsamples: 1048576 pot_min_slew: 0.00209999993 pot_max_slew: 0.0104999999 chirp_resolution: 0.1665
19 July 2010 - 19:47:37 wu15my10ab.7870.49530.8.10.103 - Read: 161922232675840.910000, Calculated: 161922232675841.000000, Ratio: 1.000000
19 July 2010 - 19:47:37 wu18no09aj.30466.7843.3.13.138 - Read: 161922232675840.910000, Calculated: 157794699338661.000000, Ratio: 1.026158
19 July 2010 - 19:47:37 wu18no09aj.30466.8661.3.13.235 - Read: 161922232675840.910000, Calculated: 157829454578104.000000, Ratio: 1.025932
19 July 2010 - 19:47:37 wu18no09aj.30466.9479.3.13.35 - Read: 161922232675840.910000, Calculated: 157835637345957.000000, Ratio: 1.025891
19 July 2010 - 19:47:37 wu18no09aj.30466.9479.3.13.208 - Read: 161922232675840.910000, Calculated: 157835637345957.000000, Ratio: 1.025891
19 July 2010 - 19:47:37 wu18no09aj.30466.10297.3.13.76 - Read: 161922232675840.910000, Calculated: 157829423360584.000000, Ratio: 1.025932
19 July 2010 - 19:47:37 wu18no09aj.30466.11524.3.13.76 - Read: 161922232675840.910000, Calculated: 157829557458494.000000, Ratio: 1.025931
19 July 2010 - 19:47:37 ERROR: Cpu and GPU countMore than one Plan Class: cuda23 ,cuda
19 July 2010 - 19:47:37 ERROR: Scan completed with an error.

I have 6 Seti Beta tasks (18no09aj.30466.* - 5 cuda + 1 CPU), listed in the log; and additionally 6 Seti tasks (**my10**.* and 01nj10**.* - 5 cuda23 and 1 CPU), not (yet?) mentioned in the log.
Was the log output stopped prematurely? Did BR incorrectly mix tasks, belonging to different projects?
Can you send me the BOINC data folder.
I need the client_state and everything in the regular SETI folder.

Offline Pepo

  • Tester
  • Hero Member
  • *****
  • Posts: 875
    • View Profile
Re: Testing BoincRescheduler v1.0
« Reply #3 on: July 20, 2010, 07:21:32 am »
Ahh-ha!  This may be another consequence of 2 different "projects" (SETI & SETI Beta) running the same application/revision/class plan at the same time.  We had a situation a few months ago where the task-summing filter was lumping SETI & SETI Beta tasks together.  I recall that Fred had to add project name as a filter criterion.  Just a thought :)

I've been with SETI almost since the beginning (Jul 99) and I've long advocated them suspending Beta when they're not actually betatesting something, either on the client or the server side.  They have enough load on the server farm as it is.
During the years I also remember various SW problems because of different projects using similar or same applications/revisions(/class plans) at the same time. But I'm rather an "catch & fix the bug while it reproducibly manifests itself" advocate, while I think it were always SW problems, like comparing app numbers and not taking project name and/or app name into account...
Peter

Offline Pepo

  • Tester
  • Hero Member
  • *****
  • Posts: 875
    • View Profile
Re: Testing BoincRescheduler v1.0
« Reply #4 on: July 20, 2010, 07:22:35 am »
I've got a red ERROR: check the log!
[...]
I have 6 Seti Beta tasks (18no09aj.30466.* - 5 cuda + 1 CPU), listed in the log; and additionally 6 Seti tasks (**my10**.* and 01nj10**.* - 5 cuda23 and 1 CPU), not (yet?) mentioned in the log.
Was the log output stopped prematurely? Did BR incorrectly mix tasks, belonging to different projects?
Can you send me the BOINC data folder.
I need the client_state and everything in the regular SETI folder.
Sent off-list.
Peter

Offline fred

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 3501
  • eFMer
    • View Profile
    • Trails
Re: Testing BoincRescheduler v1.0
« Reply #5 on: July 22, 2010, 11:45:32 am »
Sent off-list.
V 1.2 allows multiple cuda applications.
And giving me a state file with an not exactly synchronized  project folder, helped me find some other small bugs.
« Last Edit: July 22, 2010, 12:22:10 pm by fred »

Offline Pepo

  • Tester
  • Hero Member
  • *****
  • Posts: 875
    • View Profile
Re: Testing BoincRescheduler v1.0
« Reply #6 on: July 23, 2010, 12:19:20 pm »
Sent off-list.
V 1.2 allows multiple cuda applications.
And giving me a state file with an not exactly synchronized  project folder, helped me find some other small bugs.
The v.1.2 does not complain anymore on multiple cuda's.
Peter