Sun Grid Installaion

Hi,
I am trying to install Sun grid on my linux machines. I have installed the software on a server. Its configured as the qmaster,administrative host, submit host and execution host.
I am trying to install another system as the execution host. When i try to run the install_execd script it  exits saying it cannot contact the queue master. Can anyone help me out in this

Regards
Walter
LVL 1
wfaleiroAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

David PiniellaCommented:
can you contact the other host via other means (telnet/ssh) and is the other host running any sort of firewall? (hosts.allow/hosts.deny correct etc etc)
0
wfaleiroAuthor Commented:
yes i can contact the hosts and there is no firewall running at the moment.
0
David PiniellaCommented:
what do the logs say?
0
Cloud Class® Course: Ruby Fundamentals

This course will introduce you to Ruby, as well as teach you about classes, methods, variables, data structures, loops, enumerable methods, and finishing touches.

wfaleiroAuthor Commented:
03/12/2005 12:52:03|qmaster|lablin|I|read job database with 0 entries in 0 seconds
03/12/2005 12:52:03|qmaster|lablin|I|qmaster will use max. 1004 file descriptors for communication
03/12/2005 12:52:03|qmaster|lablin|I|qmaster will accept max. 99 dynamic event clients
03/12/2005 12:52:03|qmaster|lablin|E|no execd known on host lablin.marfic.local to send conf notification
03/12/2005 12:52:03|qmaster|lablin|I|starting up 6.0u3
0
David PiniellaCommented:
which machine is "03/12/2005 12:52:03|qmaster|lablin|E|no execd known on host lablin.marfic.local to send conf notification" this error on? the master or the execution host? looks like it's from the master, which if that's the case, it's not running the necessary daemon. I am not familiar enough with sun grid installs to help you w/ much detail, but I would try running the execd daemon on that host.
0
wfaleiroAuthor Commented:
[root@lablin etc]# cat /etc/services | grep sge
sge_qmaster     536/tcp
sge_execd       537/tcp
[root@lablin etc]#
0
wfaleiroAuthor Commented:
root@lablin named]# ps -ef | grep sge
sgeadmin  1470     1  0 14:09 ?        00:00:00 /opt/sge/bin/lx24-x86/sge_execd
sgeadmin  1846     1  0 14:12 ?        00:00:00 /opt/sge/bin/lx24-x86/sge_qmaster
sgeadmin  1865     1  0 14:13 ?        00:00:00 /opt/sge/bin/lx24-x86/sge_schedd
root      2650  1185  0 15:18 pts/1    00:00:00 grep sge
[root@lablin named]#
0
wfaleiroAuthor Commented:
Hi,
I was doing the installation incorrectly. The sge_root directory should be accessible to all the hosts who are supposed to be on Grid Network. So set the sge_rot via nfs and it worked. But now am thinking of how to mount it automatically every time the host needs it. Is automount a good option

thanks
Walter
0
David PiniellaCommented:
if you're on a trusted network, automount will work. You can also set a script (boot up or cron job it) to mount the NFS share on the specific hosts when needed (or at boot time etc). I would probably do this rather than automount it because I am not as familiar with Solaris' automount as I could be, but I don't see any reason why automount should not work (although I understand that it can be considered a security risk by the cautious....YMMV.)
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Unix OS

From novice to tech pro — start learning today.

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.