Want to protect your cyber security and still get fast solutions? Ask a secure question today.Go Premium

x
?
Solved

Sun Grid Installaion

Posted on 2005-03-09
9
Medium Priority
?
214 Views
Last Modified: 2013-12-27
Hi,
I am trying to install Sun grid on my linux machines. I have installed the software on a server. Its configured as the qmaster,administrative host, submit host and execution host.
I am trying to install another system as the execution host. When i try to run the install_execd script it  exits saying it cannot contact the queue master. Can anyone help me out in this

Regards
Walter
0
Comment
Question by:wfaleiro
  • 5
  • 4
9 Comments
 
LVL 9

Expert Comment

by:David Piniella
ID: 13503191
can you contact the other host via other means (telnet/ssh) and is the other host running any sort of firewall? (hosts.allow/hosts.deny correct etc etc)
0
 
LVL 1

Author Comment

by:wfaleiro
ID: 13505968
yes i can contact the hosts and there is no firewall running at the moment.
0
 
LVL 9

Expert Comment

by:David Piniella
ID: 13506436
what do the logs say?
0
VIDEO: THE CONCERTO CLOUD FOR HEALTHCARE

Modern healthcare requires a modern cloud. View this brief video to understand how the Concerto Cloud for Healthcare can help your organization.

 
LVL 1

Author Comment

by:wfaleiro
ID: 13508095
03/12/2005 12:52:03|qmaster|lablin|I|read job database with 0 entries in 0 seconds
03/12/2005 12:52:03|qmaster|lablin|I|qmaster will use max. 1004 file descriptors for communication
03/12/2005 12:52:03|qmaster|lablin|I|qmaster will accept max. 99 dynamic event clients
03/12/2005 12:52:03|qmaster|lablin|E|no execd known on host lablin.marfic.local to send conf notification
03/12/2005 12:52:03|qmaster|lablin|I|starting up 6.0u3
0
 
LVL 9

Expert Comment

by:David Piniella
ID: 13508155
which machine is "03/12/2005 12:52:03|qmaster|lablin|E|no execd known on host lablin.marfic.local to send conf notification" this error on? the master or the execution host? looks like it's from the master, which if that's the case, it's not running the necessary daemon. I am not familiar enough with sun grid installs to help you w/ much detail, but I would try running the execd daemon on that host.
0
 
LVL 1

Author Comment

by:wfaleiro
ID: 13509118
[root@lablin etc]# cat /etc/services | grep sge
sge_qmaster     536/tcp
sge_execd       537/tcp
[root@lablin etc]#
0
 
LVL 1

Author Comment

by:wfaleiro
ID: 13509560
root@lablin named]# ps -ef | grep sge
sgeadmin  1470     1  0 14:09 ?        00:00:00 /opt/sge/bin/lx24-x86/sge_execd
sgeadmin  1846     1  0 14:12 ?        00:00:00 /opt/sge/bin/lx24-x86/sge_qmaster
sgeadmin  1865     1  0 14:13 ?        00:00:00 /opt/sge/bin/lx24-x86/sge_schedd
root      2650  1185  0 15:18 pts/1    00:00:00 grep sge
[root@lablin named]#
0
 
LVL 1

Author Comment

by:wfaleiro
ID: 13551103
Hi,
I was doing the installation incorrectly. The sge_root directory should be accessible to all the hosts who are supposed to be on Grid Network. So set the sge_rot via nfs and it worked. But now am thinking of how to mount it automatically every time the host needs it. Is automount a good option

thanks
Walter
0
 
LVL 9

Accepted Solution

by:
David Piniella earned 150 total points
ID: 13552272
if you're on a trusted network, automount will work. You can also set a script (boot up or cron job it) to mount the NFS share on the specific hosts when needed (or at boot time etc). I would probably do this rather than automount it because I am not as familiar with Solaris' automount as I could be, but I don't see any reason why automount should not work (although I understand that it can be considered a security risk by the cautious....YMMV.)
0

Featured Post

Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

When you do backups in the Solaris Operating System, the file system must be inactive. Otherwise, the output may be inconsistent. A file system is inactive when it's unmounted or it's write-locked by the operating system. Although the fssnap utility…
In tuning file systems on the Solaris Operating System, changing some parameters of a file system usually destroys the data on it. For instance, changing the cache segment block size in the volume of a T3 requires that you delete the existing volu…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
In a previous video, we went over how to export a DynamoDB table into Amazon S3.  In this video, we show how to load the export from S3 into a DynamoDB table.
Suggested Courses

575 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question