[openstack-dev] [kolla] ceph osd deploy fails
Florian Engelmann
florian.engelmann at everyware.ch
Wed Sep 26 12:31:57 UTC 2018
Hi,
I tried to deploy Rocky in a multinode setup but ceph-osd fails with:
failed: [xxxxxxxxxxx-poc2] (item=[0, {u'fs_uuid': u'', u'bs_wal_label':
u'', u'external_journal': False, u'bs_blk_label': u'',
u'bs_db_partition_num': u'', u'journal_device': u'', u'journal': u'',
u'partition': u'/dev/nvme0n1', u'bs_wal_partition_num': u'',
u'fs_label': u'', u'journal_num': 0, u'bs_wal_device': u'',
u'partition_num': u'1', u'bs_db_label': u'', u'bs_blk_partition_num':
u'', u'device': u'/dev/nvme0n1', u'bs_db_device': u'',
u'partition_label': u'KOLLA_CEPH_OSD_BOOTSTRAP_BS', u'bs_blk_device':
u''}]) => {
"changed": true,
"item": [
0,
{
"bs_blk_device": "",
"bs_blk_label": "",
"bs_blk_partition_num": "",
"bs_db_device": "",
"bs_db_label": "",
"bs_db_partition_num": "",
"bs_wal_device": "",
"bs_wal_label": "",
"bs_wal_partition_num": "",
"device": "/dev/nvme0n1",
"external_journal": false,
"fs_label": "",
"fs_uuid": "",
"journal": "",
"journal_device": "",
"journal_num": 0,
"partition": "/dev/nvme0n1",
"partition_label": "KOLLA_CEPH_OSD_BOOTSTRAP_BS",
"partition_num": "1"
}
]
}
MSG:
Container exited with non-zero return code 2
We tried to debug the error message by starting the container with a
modified endpoint but we are stuck at the following point right now:
docker run -e "HOSTNAME=10.0.153.11" -e "JOURNAL_DEV=" -e
"JOURNAL_PARTITION=" -e "JOURNAL_PARTITION_NUM=0" -e
"KOLLA_BOOTSTRAP=null" -e "KOLLA_CONFIG_STRATEGY=COPY_ALWAYS" -e
"KOLLA_SERVICE_NAME=bootstrap-osd-0" -e "OSD_BS_BLK_DEV=" -e
"OSD_BS_BLK_LABEL=" -e "OSD_BS_BLK_PARTNUM=" -e "OSD_BS_DB_DEV=" -e
"OSD_BS_DB_LABEL=" -e "OSD_BS_DB_PARTNUM=" -e "OSD_BS_DEV=/dev/nvme0n1"
-e "OSD_BS_LABEL=KOLLA_CEPH_OSD_BOOTSTRAP_BS" -e "OSD_BS_PARTNUM=1" -e
"OSD_BS_WAL_DEV=" -e "OSD_BS_WAL_LABEL=" -e "OSD_BS_WAL_PARTNUM=" -e
"OSD_DEV=/dev/nvme0n1" -e "OSD_FILESYSTEM=xfs" -e "OSD_INITIAL_WEIGHT=1"
-e "OSD_PARTITION=/dev/nvme0n1" -e "OSD_PARTITION_NUM=1" -e
"OSD_STORETYPE=bluestore" -e "USE_EXTERNAL_JOURNAL=false" -v
"/etc/kolla//ceph-osd/:/var/lib/kolla/config_files/:ro" -v
"/etc/localtime:/etc/localtime:ro" -v "/dev/:/dev/" -v
"kolla_logs:/var/log/kolla/" -ti --privileged=true --entrypoint
/bin/bash
10.0.128.7:5000/openstack/openstack-kolla-cfg/ubuntu-source-ceph-osd:7.0.0.3
cat /var/lib/kolla/config_files/ceph.client.admin.keyring >
/etc/ceph/ceph.client.admin.keyring
cat /var/lib/kolla/config_files/ceph.conf > /etc/ceph/ceph.conf
(bootstrap-osd-0)[root at 985e2dee22bc /]# /usr/bin/ceph-osd -d
--public-addr 10.0.153.11 --cluster-addr 10.0.153.11
usage: ceph-osd -i <ID> [flags]
--osd-data PATH data directory
--osd-journal PATH
journal file or block device
--mkfs create a [new] data directory
--mkkey generate a new secret key. This is normally used in
combination with --mkfs
--convert-filestore
run any pending upgrade operations
--flush-journal flush all data out of journal
--mkjournal initialize a new journal
--check-wants-journal
check whether a journal is desired
--check-allows-journal
check whether a journal is allowed
--check-needs-journal
check whether a journal is required
--debug_osd <N> set debug level (e.g. 10)
--get-device-fsid PATH
get OSD fsid for the given block device
--conf/-c FILE read configuration from the given configuration file
--id/-i ID set ID portion of my name
--name/-n TYPE.ID set name
--cluster NAME set cluster name (default: ceph)
--setuser USER set uid to user or uid (and gid to user's gid)
--setgroup GROUP set gid to group or gid
--version show version and quit
-d run in foreground, log to stderr.
-f run in foreground, log to usual location.
--debug_ms N set message debug level (e.g. 1)
2018-09-26 12:28:07.801066 7fbda64b4e40 0 ceph version 12.2.4
(52085d5249a80c5f5121a76d6288429f35e4e77b) luminous (stable), process
(unknown), pid 46
2018-09-26 12:28:07.801078 7fbda64b4e40 -1 must specify '-i #' where #
is the osd number
But it looks like "-i" is not set anywere?
grep command
/opt/stack/kolla-ansible/ansible/roles/ceph/templates/ceph-osd.json.j2
"command": "/usr/bin/ceph-osd -f --public-addr {{
hostvars[inventory_hostname]['ansible_' +
storage_interface]['ipv4']['address'] }} --cluster-addr {{
hostvars[inventory_hostname]['ansible_' +
cluster_interface]['ipv4']['address'] }}",
What's wrong with our setup?
All the best,
Flo
--
EveryWare AG
Florian Engelmann
Systems Engineer
Zurlindenstrasse 52a
CH-8003 Zürich
tel: +41 44 466 60 00
fax: +41 44 466 60 10
mail: mailto:florian.engelmann at everyware.ch
web: http://www.everyware.ch
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 5210 bytes
Desc: not available
URL: <http://lists.openstack.org/pipermail/openstack-dev/attachments/20180926/3d926d82/attachment.bin>
More information about the OpenStack-dev
mailing list