Google logo
Google Search Appliance Documentation

Installing the Google Search Appliance
PDF Previous Next
Installing

Installation Guide

This guide contains the information you need to install the Google Search Appliance (models GB-7007, GB-9009, G100, and G500) on your network and perform the initial software configuration on the search appliance. This guide also tells you how to set up the first crawl of the content on a web site or intranet.

For information about specific feature limitations, see Specifications and Usage Limits.

For installation information for other software versions and search appliance models, see the Archive page, which contains links to previous versions of the search appliance documentation.

Back to top

About This Document

This document is for you if you are a network administrator or if you need to install the Google Search Appliance. You need some knowledge of networking and web concepts before you install. If you are not a network administrator, work with your network administrator to ensure that all the requirements described in Planning for Search Appliance Installation are met.

About the Google Search Appliance

The Google Search Appliance is a one-stop search and index solution for organizations of all sizes. Using a search appliance, you can quickly deploy search on a web site or intranet. The search appliance comes with Google software installed on powerful hardware, simplifying the planning process because you do not need to choose a hardware platform or go through a complicated software configuration process.

Mount the Google Search Appliance on a rack in a location that meets the temperature and electricity requirements. For complete information on those technical requirements, read the section Electrical and Other Technical Requirements in Planning for Search Appliance Installation.

Back to top

Before You Install

Before you install a search appliance, read the installation planning document, Planning for Search Appliance Installation. You may have received the planning document as an attachment to the Welcome email you received after purchasing the search appliance. If not, read Planning for Search Appliance Installation. If you have read the planning document and collected the information described in the preinstallation checklists, you can proceed to install the Google Search Appliance.

You need a computer that is physically close to the search appliance and can be attached to the search appliance with an Ethernet cable. The computer should run a supported web browser. For a list of supported browsers, see the release notes for the software version that is running on the search appliance.

About Technical Support

Google provides technical support through a web application at the for Work Technical Support portal. The Google for Work Support Information web page (https://support.google.com/googleforwork/answer/142244#search) has information on how to connect to the Support portal.

You can also find useful information in the user forums on Google Groups for the Google Search Appliance (http://groups.google.com/group/Google-Search-Appliance).

About the Google for Work Technical Support Web Site

The Google for Work Technical Support web site provides the following information and services for all models of the Google Search Appliance:

Technical Support Requirements

Under the terms of the Technical Support Agreements for the Google Search Appliance, Google for Work Technical Support requires direct access to your search appliance to provide some types of support. For example, technical support engineers require direct access to determine whether a search appliance that is experiencing problems is eligible to be returned to Google and exchanged for a new search appliance. Different access methods have different requirements. The requirements for remote access are discussed in Remote access methods for technical support (http://support.google.com/gsa/answer/2644822).

After you create a technical support account, review the terms of the Technical Support Guidelines for your search appliance (https://support.google.com/gsa/answer/3246290).

Back to top

Installing the Google Search Appliance

This section contains instructions for connecting the Google Search Appliance to the network and performing the initial software configuration. Before you connect the search appliance, complete the preinstallation checklists, which are in Planning for Search Appliance Installation.

Installing the Google Search Appliance has two phases:

When you configure the software, you set network parameters so that the search appliance can communicate with the network and other computers on the network. After you configure the software, set up the initial crawl of the content. When the crawl and index processes are completed, the search appliance can begin serving content to end users.

The Welcome letter that is in the box with the Google Search Appliance also contains instructions for connecting the search appliance to the network and starting the search appliance. If you have already connected the search appliance to the network, you can skip to Configuring the Network Settings and continue configuring the software.

Connecting the Search Appliance to the Network

The installation process begins by connecting the search appliance to your network. Ensure that you have a laptop computer or desktop computer available that is located near the search appliance.

A Google Search Appliance has four Ethernet ports and they might be labeled LAN1 through LAN4.

All search appliances are shipped with two Ethernet cables and two power cables localized for your region. One Ethernet cable is yellow and the other is orange. The orange cable is considerably shorter than the yellow cable.

The required network configuration parameters, such as the IP address assigned to the search appliance, can be changed only from a computer that has physical access to the search appliance. Network access is insufficient for changing these parameters.

The following diagram illustrates the connections between a search appliance, the network, and the computer used for configuring the search appliance:

Note that the Google Search Appliance is provided with two power supplies and two power cables. Ensure that the search appliance is connected to an uninterruptible power supply (UPS) to ensure that the power source is clean and the search appliance has continuing power during a power failure.

You can optionally attach a monitor directly to the search appliance. The search appliance displays messages on the monitor indicating the progress of the start-up process and when the process is complete.

To connect the Google Search Appliance to the network:

The search appliance assigns the IP address 192.168.255.254 and subnet mask 255.255.255.0 to the computer connected to the search appliance. Most computers use dynamic host configuration protocol (DHCP) to obtain IP addresses. If your computer does not use DHCP to obtain IP addresses, you must manually assign the computer’s static IP address and subnet mask to these values, or temporarily change your computer’s network setting to allow use of DHCP to get IP addresses. For more information on enabling DHCP or manually setting IP addresses, see the help system or other documentation for your computer.

Continue to the next section, Configuring the Network Settings, and proceed with the instructions there.

First-Time Startup of Older Models of the Google Search Appliance

This section only applies to older search appliance models GB-7007 (T2-T3 series) and GB-9009 (U1-U2 series), but not to newer models G100 (T4) or G500 (U3).

The first time you start a Google Search Appliance, you may see an error message saying

The battery has recovered, but cached data was lost.
Press any key to continue or 'C' to load the configuration utility.

The error message indicates that the power has drained from a rechargeable battery for the RAID controller. The Google Search Appliance has not lost data and will function normally. The battery recharges in the 24 hours after the Google Search Appliance is powered up. So allow GSA to run for a few more hours, and then press any key to continue the boot.

Configuring the Network Settings

This section provides instructions for configuring the search appliance software so that the Google Search Appliance can connect to your network and communicate with the other computers located on the network. To configure the search appliance, you need the values described in the preinstallation checklists that are in Planning for Search Appliance Installation.

When you perform these steps, you run the Network and System Settings wizard from the computer that you connected to the search appliance. If the web browser on the computer uses a proxy to connect to the Internet, disable the proxy, which prevents access to the search appliance.

You can use these instructions after you initially configure the search appliance to change the search appliance’s network settings. When you change any network settings, only those settings are affected. The index is not reset or otherwise affected by the changes. You do not need to have the search appliance recrawl the content after changing the network settings.

To configure the search appliance software:

http://192.168.255.1:1111/
6.
Click Accept & Continue to Step 1.
If you accept the license agreement, you see the Network Settings page of the Network and System Settings wizard.
a.
Define a particular network speed and duplex setting for the network interface card by clicking the appropriate radio button in the Network Speed section. By default, the search appliance autonegotiates network speed and the duplex setting with your network.
b.
In the IP Address field, enter the IP address assigned to the search appliance.
c.
Edit the Subnet Mask field so that it matches the value on the completed preinstallation checklist. If the field contains a value, the search appliance obtained the value using DHCP.
d.
Edit the Default Gateway field so that it matches the value on the completed preinstallation checklist. If the field contains a value, the search appliance obtained the value using DHCP.
For an IPv6 configuration, to enable the search appliance to retrieve IPv6 settings automatically, click Auto Configuration and leave the other IPv6 settings blank. To configure IPv6 settings manually:
e.
In the IP Address field, enter the IP address assigned to the search appliance.
f.
In the Prefix Length field, specify the length of the address prefix used for search appliance administration.
g.
Edit the Default Gateway field so that it matches the value on the completed preinstallation checklist. If the field contains a value, the search appliance obtained the value using DHCP.
8.
To use the dedicated port (white label) for search appliance administration, check Dedicated Admin NIC and complete the following steps. The Admin NIC is IPv4 only.
a.
Define a particular network speed and duplex setting for the dedicated network interface card by clicking the appropriate radio button in the Network Speed section. By default, the search appliance autonegotiates network speed and the duplex setting with your network.
b.
In the IP address field, enter the IP address to use for search appliance administration.
c.
In the Subnet mask field, enter the subnet mask for search appliance administration.
d.
In the Gateway field, enter the gateway to be used for outbound search appliance administration traffic.
9.
In the DNS Servers field, enter a comma-separated list of Domain Name System (DNS) servers.
10.
In the DNS Suffix field, enter a comma-separate list of suffixes for DNS resolution.
products.example.com, 
mydivision.example.com,
support.example.com
11.
In the SMTP Server field, enter the name of a Simple Mail Transfer Protocol (SMTP) server, which the search appliance needs for sending notifications and alerts.
12.
In the Sender of Outgoing Email field, enter a name for the account that sends email messages and alerts from the search appliance.
13.
In the Time Zone field, select the time zone to use for times recorded in logs, for host load scheduling, and for other time-based information.
14.
In the NTP Servers field, enter a comma-separated list of IP addresses of Network Time Protocol (NTP) servers.
15.
In the Password field, enter a new password for the admin account, then retype the password to confirm it.
16.
In the Email Address field, enter the email address to which the search appliance sends notices intended for the admin account.
If you select Local Authentication, administrators and managers are authenticated using credentials you enter directly on the Admin Console.
If you select LDAP Authentication, administrators and managers are authenticated against an LDAP server. To use this option, you must initially connect to the Admin Console using the credentials you entered in step 15, then configure the LDAP administrator group and server. After you save the LDAP configuration, LDAP authentication for administrators and managers takes effect.
If you select Local and LDAP authentication, the search appliance attempts to authenticate administrators and managers against both the local credentials and the LDAP server. If an account can be authenticated against either the local credentials or the LDAP server, the login attempt succeeds.
18.
Click Continue.
19.
In the Test URLs field, enter one URL per line, representing locations of content you want crawled and indexed.
http://corp.example.com/
http://www.example.com/products/
20.
Click Check Access to Web Servers.
Congratulations! You have configured the appliance. If no warnings or errors 
have been displayed, you can now disconnect your laptop and use the appliance.
Use the Admin Console application for day-to-day administration.

Determining Whether the Software Version is Current

The software preinstalled on your Google Search Appliance might not be the most recent version. After you configure the search appliance, Google recommends that you check the software version on the search appliance, then visit the Support site to check for updates.

To determine the version of the software current installed on your search appliance, click the About link on any Admin Console page. You see a new page that displays the search appliance version.

Visit the following URL for instructions on how to log in to the Support portal:

When you visit the Support portal, navigate to the page that lists supported software versions, then determine which is the most recent version and the proper update path to follow.

The next section, Setting Up the Crawl, contains instructions for connecting to the Admin Console and configuring the initial crawl of your content.

Back to top

Setting Up the Crawl

Crawl is the process by which the Google Search Appliance locates content to be indexed. You define the start URLs to be crawled, URLs that are crawled or excluded from the crawl, and file types to include or exclude.

When you complete the process described in Configuring the Network Settings, the crawl is not started. This section contains instructions for connecting to the Admin Console, entering start URLs and URL patterns, starting the crawl process, and confirming that the crawl is proceeding normally. For complete information on crawl, start URLs, and URL patterns, see Administering Crawl.

To obtain context-sensitive help from any page in the Admin Console, click the Help link. You can also view help pages when you click the Help Center link in the horizontal blue bar in the upper right of the screen.

The high-level steps for setting up the initial crawl are:

Logging in to the Admin Console

The Admin Console is an application accessed with a web browser that provides the user interface for administering the Google Search Appliance.

You can log in to the Admin Console using HTTP or HTTPS:

It’s best to use HTTPS on port 8443.

To retain changes you make on any Admin Console page, click the Save button. If you navigate to another page without clicking Save, your changes are lost.

To log in to the Admin Console:

For secure access, type https://hostname:8443/ or https://IP_address:8443/, where hostname is the host name assigned to the search appliance or IP_address is the IP address assigned to the search appliance.
For insecure access, type http://hostname:8000/ or https://IP_address:8000/, where hostname is the host name assigned to the search appliance or IP_address is the IP address assigned to the search appliance.
3.
When the Admin Console login page appears, type admin in the user name field and type the password you assigned to the admin account during configuration in the password field.

After you log in to the Admin Console, you can configure crawling or perform other administrative tasks. If you cannot log in to the Admin Console, refer to Troubleshooting, search the search appliance documentation on code.google.com, or contact Google for Work Technical Support (https://support.google.com/googleforwork/answer/142244#search) for additional troubleshooting information.

The next section, Setting Up and Starting the Crawl, discusses how to configure the initial crawl of your content files. For more information on crawl and configuring crawl, see Administering Crawl.

Setting Up and Starting the Crawl

To set up and start the crawl:

1.
In the left-hand menu, click Content Sources > Web Crawl > Start and Block URLs.
2.
In the Start URLs field, type one or more start URLs.
protocol://host[:port]/[path]/
For example, http://dracula:2346/content. The information in the square brackets is optional.
3.
In the Follow Patterns field, copy all start URLs from the Start URLs field.
4.
In the Do Not Follow Patterns field, scroll through the list of patterns that can be blocked from being crawled.
5.
Click Save.
6.
In the left-hand menu, click Content Sources > Diagnostics > Crawl Status.
7.
Click Resume Crawl.

Checking the Crawl Status

You can check the progress of the crawl from the Home page.

To check the crawl status:

Checking the Serving Status

The Search > Diagnostics > Search Status page reports on search results serving. The graph shows a summary of queries per second by the half hour. If the crawl has just begun, there is a delay before the Search Status page shows any activity.

Back to top

After the Installation Process is Completed

After the installation process is finished, perform the following tasks:

Recording the Appliance ID

After you complete the installation process, record the identification number of the search appliance, which is useful in troubleshooting if the search appliance experiences any problems. You can find the appliance ID in the following locations:

On the Admin Console on the Administration > License page

Running Test Queries Using the Test Center

Run test queries in these circumstances:

To run test queries:

1.
Click the Test Center link in the horizontal blue bar at the top right of the page.

Providing a Search Page to Your Users

After the index is created and tested, your users need a search page for access to the index. There are two ways to do this:

Both techniques are discussed in Customizing the User Interface in Creating the Search Experience.

Exporting the Search Appliance Configuration

The Google Search Appliance saves configuration information internally. To back up this information, you can export it to a file. After a failure, or to revert your configuration, you can import the file back into the search appliance. It’s best to export the search appliance configuration on a regular basis, as well as any time you change the configuration.

To export the search appliance configuration information:

1.
Click Administration > Import/Export.
4.
Click Export.

Back to top

Starting and Shutting Down the Google Search Appliance

Use the instructions below to start or shut down the search appliance.

Starting the Search Appliance

This section contains instructions for starting the Google Search Appliance.

To start the search appliance:

The search appliance typically takes about ten minutes to initialize and run system checks, but this can take as much as twenty to thirty minutes.

If the search appliance was shut down improperly, the next initialization takes longer because the process includes disk verification and repair.

See the instructions in Shutting Down the Search Appliance for information on shutting down the search appliance safely.

Shutting Down the Search Appliance

This section contains instructions for safely shutting down the Google Search Appliance.

You must perform a safe shut down before you unplug the power cord. Do not shut down a running search appliance by unplugging it or by depressing the power button for four seconds or more. Use one of the methods described in this section.

If the search appliance is in use, tell your users that search will be off line before you shut down the search appliance. For more details about shutdown, see the Admin Console’s online help system.

Shutting Down the Search Appliance Normally

The following two methods are the preferred ways to shut down the search appliance.

To shut the search appliance down normally when you have network access:

3.
Click Administration > Shutdown.
4.
Click the button Shut the System Down Now.

To shut the search appliance down normally when you have physical access:

Rebooting the Search Appliance

Use the following instructions to reboot the search appliance from the Admin Console or from a physical or virtual keyboard.

To reboot the search appliance from the Admin Console:

3.
Click Administration > Shutdown.
4.
Click the button Reboot the System Now. The system reboots.

To reboot the search appliance from a physical or virtual keyboard:

2.
On the keyboard, press the Ctrl, Alt, and Delete keys simultaneously. The search appliance reboots.

Back to top

Troubleshooting

This section contains information on the following topics:

If you encounter any difficulties or technical problems while installing the Google Search Appliance, follow these steps:

2.
If the problem occurred while you configured the initial crawl, read Administering Crawl for Web and File System Content and the help system in the Admin Console.
3.
If you cannot find the solution to your problem in this document, Administering Crawl for Web and File System Content, or the help system in the Admin Console, see Fix a problem (https://support.google.com/gsa/topic/2707842).

In some circumstances, your technical support provider may ask you to attach a USB keyboard and monitor directly to the search appliance so that you can manually restart the search appliance.

The appliance ID (also known as the serial number) is on a tag on the back of the chassis and can also be found on the Admin Console on the Administration > License page.

Resolving Installation and Configuration Problems

The following table contains information on how to fix problems you might encounter during installation or configuration.

 

The search appliance starts and you connect a computer to the search appliance. The computer is not assigned the IP address 192.168.255.254.

You start a browser on the computer connected to the search appliance. You cannot connect to the Network and System Settings wizard at http://192.168.255.1:1111/

ping 192.168.255.1
ping 127.0.0.1

You type a URL in the Test URLs box in the configuration wizard and you see the error message URLs to Test: connection refused appears.

The server located at the test URL is not running a web server.

You type a URL in the Test URLs box in the configuration wizard and you see the error message URLs to Test: not a valid URL appears.

The URL does not include the protocol, the domain name, or additional path information. For example, http://www.google.com/ is a valid URL, but http://www.google.com is not valid because it does not include path information (the final slash “/”).

You type a URL in the URLS to Test box and you see the error message URLs to Test: connection timed out appears.

The server that you identified might have an access control list (ACL) that is blocking access.

On the DNS Settings page, you see the message DNS: connection refused: server down when you click Continue and attempt to navigate to the next page.

The server that you identified does not run the DNS service. The server cannot respond to a DNS request from the search appliance.

Network-Related Error Messages

This table lists common errors in network configurations that might cause the setup to fail. You might need to ask a system administrator for help.

 

Gateway unpingable

This typically means that the gateway or IP or netmask values were entered incorrectly during the initial software configuration for the search appliance.

Ensure that the correct values are entered on the configuration screens. Consult your system administrators to verify the correct values.

Cannot ping the DNS, NTP, mail, or content server.

The message is displayed on the configuration screen and indicates which type of server is unpingable. The message can apply to a DNS server, NTP server, content server, or mail server. You are encountering one of these problems:

Ensure that the hostname or IP address is correct. Ensure that the DNS, NTP, mail, or content server is running and start the server if it is not running. Ensure that the network or router policies allow the search appliance access to the server.

DNS: connection timed out

The search appliance established a connection to the DNS server, but the connection timed out. The DNS server is busy or network traffic is interfering with the connection.

Increase the capacity of the DNS server and monitor network traffic to correct any traffic problems.

Cannot resolve server (SMTP, SYSLOG, NTP servers, test URLs)

The hostname you specified is not known to the DNS servers.

Ensure that you entered the server name correctly and correct any typographical errors.

Unable to contact NTP server

The search appliance cannot contact the NTP server.

Ensure that you correctly identify an NTP server on the configuration screens and that the NTP server is running. Review the network configuration and any firewalls to ensure that they do not block network traffic to and from the search appliance. Do not attempt to operate the search appliance without identifying at least one NTP server.

Mail: connection refused

The search appliance cannot contact the SMTP server.

Ensure that you correctly identify an SMTP mail server on the configuration screens and that the SMTP server is running. Review the network configuration and any firewalls to ensure that they do not block network traffic to and from the search appliance.

URLs to Test: returns code nnn instead of 200, where n=any numeral

The web server specified responded, but returned an error code.

Common error codes:

401—user authentication required
403—forbidden
404—document not found

The URLs to Test section is to establish that the connectivity functions of the search appliance are working correctly. Use URLs to content that the search appliance can access without presenting credentials for authentication or authorization.

Note: A complete list of HTTP error codes is available from http://www.ietf.org/rfc/rfc2616.txt, on pages 39 and 40.

Back to top

Notices

This section contains safety and other notices.

Safety Notice

Read and follow all instructions marked on the product and in the documentation before you operate the search appliance.

Retain all safety and operating instructions for future use.

The product should be operated only from the type of power source indicated on the rating label.

If your computer has a voltage selector switch, make sure that the switch is in the proper position for your area. The voltage selector switch is set at the factory to the correct voltage.

The plug-socket combination must be accessible at all times because it serves as the main disconnecting device.

This product is shipped with a three-wire electrical grounding-type plug that fits only into a grounding-type power outlet. This is a safety feature. The equipment grounding should be in accordance with local and national electrical codes. The equipment operates safely when it is used in accordance with its marked electrical ratings and product usage instructions.

Do not use this product near water or a heat source.

Set up the system on a stable work surface so as to ensure stability of the system.

Openings in the case are provided for ventilation. Do not block or cover these openings. Make sure you provide adequate space around the system for ventilation when you set up your work area.

Never insert objects of any kind into the ventilation openings.

To avoid electrical shock, never attempt to remove the covers of the Google Search Appliance.

Always shut down the Google Search Appliance from the Admin Console before powering down the hardware.

If you are working with the modem or local area network:

Do not connect or use a modem or telephone during a lightning storm. There may be a risk of electrical shock from lightning.

To reduce the risk of fire, only use No. 26 AQG or larger telecommunications line cord.

Do not plug a modem or telephone cable into the network interface controller receptacle (NIC).

The device should be plugged into an uninterruptible power supply to make sure that the power source is clean and that the Google Search Appliance is powered in case of a power outage.

WARNING: Never disconnect the power cord while the search appliance is running. Shut the search appliance down from the System Shutdown page (in the Admin Console’s Main page) before disconnecting the power cord. Last, to remove all power from the unit, disconnect the power cord.

Search Appliance Security and Resilience

Anti-Hacking Measures

All services within the Google Search Appliance sit behind a firewall. The firewall is the main source of security for the device against malicious hackers. No communication can connect to or connect from these servers without first going through the firewall. The firewall allows only a small set of ports to pass into the system. For a complete list of the port numbers and their functions, see What Ports Does the Search Appliance Use? in Planning for Search Appliance Installation.

Lockdown Security

The key to the bezel is supplied with search appliances GB-7007 and GB-9009, which you may unlock only while installing the appliance in a racking system or replacing a hard drive. Do not attempt to open the case of any other search appliance model.

Compliance Notice

Federal Communications Commission (FCC) Statement

Note: This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful Interference when the equipment is operated in a commercial environment. This equipment generates, uses and can radiate radio frequency energy and, if not Installed and used in accordance with the instruction manual, may cause harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference in which case the user will be required to correct the interference at his own expense.

Copyrights

This product includes Python software version 1.5.2, developed by Stichting Mathematisch Centrum.

Copyright © 1991–1995 by Stichting Mathematisch Centrum, Amsterdam, The Netherlands. All Rights Reserved.

Permission to use, copy, modify, and distribute this software and its documentation for any purpose and without fee is hereby granted, provided that the above copyright notice appear in all copies and that both that copyright notice and this permission notice appear in supporting documentation, and that the names of Stichting Mathematisch Centrum or CWI or Corporation for National Research Initiatives or CNRI not be used in advertising or publicity pertaining to distribution of the software without specific, written prior permission.

This product includes timeoutsocket.py

Copyright 2000, 2001 by Timothy O’Malley <timo@alum.mit.edu> All Rights Reserved

Permission to use, copy, modify, and distribute this software and its documentation for any purpose and without fee is hereby granted, provided that the above copyright notice appear in all copies and that both that copyright notice and this permission notice appear in supporting documentation, and that the name of Timothy O’Malley not be used in advertising or publicity pertaining to distribution of the software without specific, written prior permission.

Timothy O’Malley DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL Timothy O’Malley BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION ITH THE USE OR PERFORMANCE OF THIS SOFTWARE.

This product includes Outside In® HTML Export

© 1992-2002 2003 Stellent Chicago, Inc. All rights reserved.