?
Solved

Sun Grid Installaion

Posted on 2005-03-09
9
Medium Priority
?
213 Views
Last Modified: 2013-12-27
Hi,
I am trying to install Sun grid on my linux machines. I have installed the software on a server. Its configured as the qmaster,administrative host, submit host and execution host.
I am trying to install another system as the execution host. When i try to run the install_execd script it  exits saying it cannot contact the queue master. Can anyone help me out in this

Regards
Walter
0
Comment
Question by:wfaleiro
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 4
9 Comments
 
LVL 9

Expert Comment

by:David Piniella
ID: 13503191
can you contact the other host via other means (telnet/ssh) and is the other host running any sort of firewall? (hosts.allow/hosts.deny correct etc etc)
0
 
LVL 1

Author Comment

by:wfaleiro
ID: 13505968
yes i can contact the hosts and there is no firewall running at the moment.
0
 
LVL 9

Expert Comment

by:David Piniella
ID: 13506436
what do the logs say?
0
Optimize your web performance

What's in the eBook?
- Full list of reasons for poor performance
- Ultimate measures to speed things up
- Primary web monitoring types
- KPIs you should be monitoring in order to increase your ROI

 
LVL 1

Author Comment

by:wfaleiro
ID: 13508095
03/12/2005 12:52:03|qmaster|lablin|I|read job database with 0 entries in 0 seconds
03/12/2005 12:52:03|qmaster|lablin|I|qmaster will use max. 1004 file descriptors for communication
03/12/2005 12:52:03|qmaster|lablin|I|qmaster will accept max. 99 dynamic event clients
03/12/2005 12:52:03|qmaster|lablin|E|no execd known on host lablin.marfic.local to send conf notification
03/12/2005 12:52:03|qmaster|lablin|I|starting up 6.0u3
0
 
LVL 9

Expert Comment

by:David Piniella
ID: 13508155
which machine is "03/12/2005 12:52:03|qmaster|lablin|E|no execd known on host lablin.marfic.local to send conf notification" this error on? the master or the execution host? looks like it's from the master, which if that's the case, it's not running the necessary daemon. I am not familiar enough with sun grid installs to help you w/ much detail, but I would try running the execd daemon on that host.
0
 
LVL 1

Author Comment

by:wfaleiro
ID: 13509118
[root@lablin etc]# cat /etc/services | grep sge
sge_qmaster     536/tcp
sge_execd       537/tcp
[root@lablin etc]#
0
 
LVL 1

Author Comment

by:wfaleiro
ID: 13509560
root@lablin named]# ps -ef | grep sge
sgeadmin  1470     1  0 14:09 ?        00:00:00 /opt/sge/bin/lx24-x86/sge_execd
sgeadmin  1846     1  0 14:12 ?        00:00:00 /opt/sge/bin/lx24-x86/sge_qmaster
sgeadmin  1865     1  0 14:13 ?        00:00:00 /opt/sge/bin/lx24-x86/sge_schedd
root      2650  1185  0 15:18 pts/1    00:00:00 grep sge
[root@lablin named]#
0
 
LVL 1

Author Comment

by:wfaleiro
ID: 13551103
Hi,
I was doing the installation incorrectly. The sge_root directory should be accessible to all the hosts who are supposed to be on Grid Network. So set the sge_rot via nfs and it worked. But now am thinking of how to mount it automatically every time the host needs it. Is automount a good option

thanks
Walter
0
 
LVL 9

Accepted Solution

by:
David Piniella earned 150 total points
ID: 13552272
if you're on a trusted network, automount will work. You can also set a script (boot up or cron job it) to mount the NFS share on the specific hosts when needed (or at boot time etc). I would probably do this rather than automount it because I am not as familiar with Solaris' automount as I could be, but I don't see any reason why automount should not work (although I understand that it can be considered a security risk by the cautious....YMMV.)
0

Featured Post

Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Hello fellow BSD lovers, I've created a patch process for patching openjdk6 for BSD (FreeBSD specifically), although I tried to keep all BSD versions in mind when creating my patch. Welcome to OpenJDK6 on BSD First let me start with a little …
This tech tip describes how to install the Solaris Operating System from a tape backup that was created using the Solaris flash archive utility. I have used this procedure on the Solaris 8 and 9 OS, and it shoudl also work well on the Solaris 10 rel…
Learn how to get help with Linux/Unix bash shell commands. Use help to read help documents for built in bash shell commands.: Use man to interface with the online reference manuals for shell commands.: Use man to search man pages for unknown command…
Learn how to navigate the file tree with the shell. Use pwd to print the current working directory: Use ls to list a directory's contents: Use cd to change to a new directory: Use wildcards instead of typing out long directory names: Use ../ to move…
Suggested Courses
Course of the Month14 days, 18 hours left to enroll

771 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question