Skip to content

PerilousApricot/lstore-gop

 
 

Repository files navigation

============================================================================================
MQ Messages
============================================================================================
Each message is comprised of one or more frames.  Each frame is an arbitrary blob of data.
There are curently 6 different messages supported. The general message format is given below
where each line corresponds to a frame.

--------Sending message format------
address 1
address 2
....
address N
empty frame
version
Command
id
arg 1
arg 2
....
arg M
empty frame


------Receiving message format for TRACE_ROUTER-------
empty frame
version
Command
id
arg 1
arg 2
....
arg M
empty frame
address 1
address 2
....
address N


address fames
----------------------------------------
Each address frame corresponds to the next hop in routing form the source to the destination.
The simplest case only has a single address frame.

TRACE_ROUTER devices append the source address of the message on receipt allowing one to reverse
path taken and send a reply back to the message originator.

empty frame
----------------------------------------
Empty frames are used to separate an address from the rest of the command and it's arguments.
It's also used as a sanity check on message integrity.  

Version frame
----------------------------------------
Determines the message protocol formant to use.  Currently this value is stored in the constant
MQF_VERSION_KEY which is set to LMQv100.

Command frame
----------------------------------------
How to interpret the message.  The crrent available commands are:

MQF_PING_KEY - Ping command
MQF_PONG_KEY - Pong response to a ping command
MQF_EXEC_KEY - Send the message to the remote address.  No tracking or response is allowed.
   Upon receipt at the destination the message is passed to the application for execution.
MQF_TRACKEXEC_KEY - Same as MQF_EXEC_KEY but the command is tracked at the source.  This allows
   for detection of closed or dead hosts.  Optionally a MQF_TRACKADDRESSS_KEY can be sent 
   as a response while the remote host processes the command.  The message ID frame is used in
   any response (MQF_TRACKADDRESS_KEY or MQF_RESPONSE_KEY).
   Since the command is tracked a response is exptected in the form of a MQF_RESPONSE_KEY command.
MQF_TRACKADDRES_KEY - Optionally sent in response to an MQF_TRACKEXEC_KEY command.  Only needed
   if processing the command is deferred or may take some time.  In this case sending a track address
   comand allows the sender to know the command was received and is being processed.  The MQ connection
   sender sends periodic heartbeats to detect lost or dead servers.
MQF_RESPONSE_KEY - Contains the response to the MQF_TRACKEXEC_KEY command.

ID frame
----------------------------------------
Arbitrary data used as a UUID for the command

Argument frames
----------------------------------------
One or more frames passed to the application.


============================================================================================
Typical Message send/recv sequences.
============================================================================================
Below are listed some typical exchanges using a TRACE_ROUTER.
<addresss> is used to signify one or more address frames.
'/' is used to delineate frames.

-----Simple Ping/Pong-----
Client Sends: <address>/<empty>/LMQv100/PING/id/<empty>
Server Recvs: <empty>/LMQv100/PING/id/<empty>/<reversed client path>
Server Sends: <client path>/<empty>/LMQv100/PONG/id/<empty>
Client Recvs: <empty>/LMQv100/PONG/id/<empty><reversed server path>

-------- Simple EXEC with no tracking --------
Client sends: <address>/<empty>/LMQv100/EXEC/id/<args>/<empty>
Server Recvs: <empty>/LMQv100/EXEC/id/<args><empty>/<reversed client address>

-------- TRACKEXEC without heartbeating -----------
Client Sends: <address>/<empty>/LMQv100/TRACKEXEC/id/<args>/<empty>
Server Recvs: <empty>/LMQv100/TRACKEXEC/id/<args><empty>/<reversed client address>
Server Sends: <client address>/<empty>/LMQv100/RESPONSE/id/<response args><empty>
Client Recvs: <empty>/LMQv100/RESPONSE/id/<response args><empty><reversed server address>

-------- TRACKEXEC with heartbeating -----------
Client Sends: <address>/<empty>/LMQv100/TRACKEXEC/id/<args>/<empty>
Server Recvs: <empty>/LMQv100/TRACKEXEC/id/<args><empty>/<reversed client address>

Server Sends: <client address>/<empty>/LMQv100/TRACKADDRESS/id/<server address><empty>
Client Recvs: <empty>/LMQv100/TRACKADDRESS/id/<server address>><empty><reversed server address>

Once the TRACKADDRESS record is reicved the MQ portal then enables heartbeating to the server until
the command timeout is reached or a RESPONSE is received.  TRACKADDRESS commands are internal to the 
MQ system.  The application never sees these commands.

Server Sends: <client address>/<empty>/LMQv100/RESPONSE/id/<response args><empty>
Client Recvs: <empty>/LMQv100/RESPONSE/id/<response args><empty><reversed server address>


============================================================================================
MQ devices/socket types
============================================================================================
The MQ layer supports several different devices.  Additional devices can be added.  These are based on the 
ZeroMQ devices so look to those for queue limits, etc.

---------MQ_PAIR, MQ_DEALER---------
These are all straight 0MQ socket types.

---------MQ_ROUTER--------
This is also a straight 0MQ socket type but it's useful to understand the differences between it and
the other MQ_*_ROUTER types
Send: pop next hop off message and forward
Recv: push sender address onto message for pass to application

---------MQ_SIMPLE_ROUTER--------
Send: pop next hop off message and forward
Recv: pass to application unchanged

---------MQ_TRACE_ROUTER--------
Send: pop next hop off message and forward
Recv: append sender address and pass to application



============================================================================================
Source files
============================================================================================
There are just a handful of source files.  The design uses a generic abstraction but in reality
the current implementation is a wrapper around ZeroMQ routines.  This is obvious when looking at
the mq_portal.h header.

mq_portal.c - Main implementation source code.
mq_msg.c - Routines to manipulate messages and frames
mq_zmq.c - 0MQ wrappers for functionality.
mq_test.c - Test program

============================================================================================
Usage
============================================================================================
The MQ layer is designed to work within the GOP framework for dispatching tasks to remote
servers and recveiving responses.  It has the ability to provide network overlay and do 
complex routing between hosts.  Each task has a timeout and can be tracked.  This allows 
detection of remote host failures through heartbeating.  The number of connections
to a remote host varies between a user supplied min and max number of connections.  The command
backlog is used to decide when to spawn additional connections.  Likewise if a connection
is not sustainiang a user supplied minimum command throughput then the connection is culled.

Additionally an MQ portal can be created to accept incoming connections like that used on a 
server.  Tasks can be submitted either by the GOP tools or via mq_submit.  In server mode
user supplied commands can be installed for execution.  These key off the first argumaent frame
on EXEC and TRACKEXEC commands.

Typical usage for a client is:
------------------------------------
inip_file_t *ifd = ....
mq_context_t *mqc;
op_generic_t *gop;

//** Create the context
mqc = mq_create_context(ifd, "mq_context");

//** Generate the task
gop = new_mq_op(mqc, msg, client_response_pong, td, free, td->dt);

//** Wait for it to complete
gop_waitall(gop);

//** Destroy when finished
mq_destroy_context(mqc);
--------------------------------------

Look at client_test_thread() in mq_test.c for a fully working example.  The format for the repsonse handler,
client_response_pong(), in the above example is:

	op_status_t (*fn_response)(void *arg, int id);

The arg passed in is actually a pointer of type mq_task_t.  This structure is used for both GOP submissions
and server side tasks.  Right now we'll just focus on the GOP or client side meanings.

typedef struct {
  mq_msg_t *msg;          //** Actual message sent with address
  mq_msg_t *response;     //** Response message
  op_generic_t *gop;      //** GOP corresponding to the task.
  mq_context_t *ctx;      //** Portal context for sending responses.
  void *arg;              //** Optional argument when calling new_mq_op().  This was free() in the above example.
  apr_time_t timeout;     //** Initially the DT in sec for the command to complete and converted to abs timeout when sent
  void (*my_arg_free)(void *arg);  //** Function for cleaning up the GOP arg on GOP destruction.
} mq_task_t;


Likewise for the server side its:
--------------------------------------
mq_context_t *mqc;
mq_command_table_t *table;
uint64_t n;

log_printf(0, "START\n");

//** Make the server context
mqc = mq_create_context(ifd, "mq_context");

//** Make the server portal
server_portal = mq_portal_create(mqc, host, MQ_CMODE_SERVER);

//** Get the command table from the newly created MQ portal
table = mq_portal_command_table(server_portal);

//** Install your commands
mq_command_add(table, MQF_PING_KEY, MQF_PING_SIZE, NULL, cb_ping);

//** Make available for use
mq_portal_install(mqc, server_portal);

//** Wait for a shutdown using an eventFD.  Not reuqired Just one way to trigger a shutdown.
//**  Could also do it via another command
read(control_efd, &n, sizeof(n));

//** Destroy the portal
mq_destroy_context(mqc);

--------------------------------------

This code is directly lifted from server_test_mq_loop() in mq_test.c.

The format for commands used by the command table is pretty simple:
	void command_fn(void *arg, mq_task_t *task)

Arg is the void * pointer provided when adding the operation to the command table.
In the above it corresponds to the NULL in mq_command_add() above.

The other argument, task, uses the same task structure as discussed above but with slightly different meanings:

typedef struct {
  mq_msg_t *msg;          //** Message received
  mq_msg_t *response;     //** NULL, Unused
  op_generic_t *gop;      //** NULL, Unused
  mq_context_t *ctx;      //** Portal context for sending responses. (Server+GOP)
  void *arg;              //** Optional argument when calling mq_command_add().
  apr_time_t timeout;     //** Initially the DT in sec for the command to complete and converted to abs timeout when sent
  void (*my_arg_free)(void *arg);  //** NULL, Unused
} mq_task_t;


Servers normally send responses directly in more of a fire and forget approach, bypassing the GOP framework. In this case one
uses mq_submit():

	int mq_submit(mq_portal_t *p, mq_task_t *task);

The easist way to generate the task is to use the helper function:

	mq_task_t *mq_task_new(mq_context_t *ctx, mq_msg_t *msg, op_generic_t *gop, void *arg, int dt);

In the cb_ping() example above translates to

        err = mq_submit(server_portal, mq_task_new(server_portal->mqc, response, NULL, NULL, 5));

in proc_trackexec_ping() which is called from cb_ping();

You can look at cb_ping() in mq_test.c for the full example.



About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C 92.1%
  • C++ 4.1%
  • CMake 3.7%
  • Shell 0.1%