GitHub - dillon-bostwick/SharkBatch: Job Scheduler and Dependency Manager based on Multi-level Feedback Queue

Install

$ cd `src
$ make
Run:
$ ./sharkbatch or specify options and flags: $ ./sharkbatch -cq <baseQuantum> <numPriorities>
Uninstall:
$ make clean

Options

-c: Chain Weighting Mode (see below)
-q: Varying Quanta Mode (see below)
baseQuantum: Size of quantum of the baseline priority in jiffies
numPriorities: number of levels to the multilevel feedback queue (see below)

The Multilevel Feedback Queue Scheduling Algorithm

In SharkBatch, when jobs are created, the client does not specify the priority. All jobs start in the highest priority level queue. The intent is that short processes get a chance to run quickly and interrupt longer, batch-like processes, which end up in the lower level queue for round-robin processing. Overall, the MLFQ is often described as a "relatively fair scheduler."

Slice computation via quanta

SharkBatch can compute the length of a slice per job using an number of different methods, "modes," which can be specified on process invocation. (A baseline quantum must always be specified as the first numerical argument).

If no modes are selected, all slices are equal to the baseline quantum. Modes may be simultaneously activated by specifying -cq):

Varying Quanta Mode

If "varying quanta mode" is specified with -q, then higher priority jobs will receive shorter slices. In this case every priority has a unique, decreasing quantum:

BaseQuantum - priorityLevel (baseQuantum / numPriorities)

The difference between each priority level is constant. This is common in the MLFQ literature as a way to further increase the MLFQ's effectiveness as a time discriminator, as longer jobs get pushed to lower priorities. This is useful when you need more insurance that the scheduler can't be "gamed" (see below) to starve I/O processes.

Scheduling statistics reference

For a job, the following statistics represent the time elapsed between a job's key events:

Turnaround = Complete - Begin
Latency = Begin - Insert
Response = Complete - Insert

Throughput = # complete jobs / total time elapsed

In SharkBatch, the clock only records time for which a jiffie is being processed; wallclock time is never recorded even if an empty MLFQ is left running.

Avg latency
Avg response time
Avg turnaround time
Avg turnaround per burst time = mean(turnaround/burst for each job)
Avg latency per burst time = mean(latency/burst for each job)

Dependency resolution

SharkBatch also supports dependency resolution of jobs. A topological sort will be applied if a client specifies job dependencies as a DAG. If Chain Weighting Mode is specified, jobs with longer total DAG time will be prioritized in a way consistent with optimizing the entire batch of jobs, however latency of each individual job is balanced with ability to unblock jobs that may be more recent and this have a lower latency expectation. One of the core features of SharkBatch is its ability to combine traditional DAG scheduling with the MLFQ algorithm in how it recursively evaluates dependencies when making determinations about time allocation.

About inputting jobs and job dependencies

New jobs are added synchronously between slices of processes. When adding a job, specify a new PID, expected execution time, memory required, and dependent jobs. If a job is dependent on some jobs that haven't been finished, SharkBatch will hold that job before pushing it to the top queue. It is possible to enter a dependent PID that the scheduler has never seen before.

Note: because SharkBatch is simulating process execution, the user must input an execution time that represents total CPU burst the job requires. The scheduler does not use this number to make any decisions regarding time slices or prioritizing, making it possible to remove this variable and instead make jobs block or quit whenever they are done with the CPU.

Installation Troubleshooting

For earlier versins of NCurses version, when running valgrind memory profile, some mem blocks listed as "still reachable." Ncurses >=6.0 is recommended. Update: Can't include thread and chrono in Scheduler for the latest clang 3.9 (tested on Fedora 24), need to update with libstdc++.

References

http://www.scs.stanford.edu/07au-cs140/notes/l5.pdf
http://inst.eecs.berkeley.edu/~cs162/sp11/sections/cs162-sp11-section5-answers.pdf
http://www.cs.tufts.edu/comp/111/
https://www.cis.upenn.edu/~sudipto/mypapers/throughput.pdf
https://www.cs.uic.edu/~jbell/CourseNotes/OperatingSystems/5_CPU_Scheduling.html
http://www.personal.kent.edu/~rmuhamma/OpSystems/Myos/cpuScheduling.htm
https://en.wikipedia.org/wiki/Scheduling_(computing)
https://en.wikipedia.org/wiki/Multilevel_feedback_queue
https://en.wikipedia.org/wiki/Fixed-priority_pre-emptive_scheduling
http://pages.cs.wisc.edu/~remzi/solaris-notes.pdf
http://dl.acm.org/citation.cfm?doid=321707.321717

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
sample-datasets		sample-datasets
src		src
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sample-datasets

sample-datasets

src

src

.gitignore

.gitignore

LICENSE.md

LICENSE.md

README.md

README.md

Repository files navigation

Install

Options

The Multilevel Feedback Queue Scheduling Algorithm

Slice computation via quanta

Varying Quanta Mode

Scheduling statistics reference

Dependency resolution

About inputting jobs and job dependencies

Installation Troubleshooting

References

About

Releases

Packages

Languages

License

dillon-bostwick/SharkBatch

Folders and files

Latest commit

History

Repository files navigation

Install

Options

The Multilevel Feedback Queue Scheduling Algorithm

Slice computation via quanta

Varying Quanta Mode

Scheduling statistics reference

Dependency resolution

About inputting jobs and job dependencies

Installation Troubleshooting

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages