/[base]/head/sys/geom/geom_disk.c
ViewVC logotype

Log of /head/sys/geom/geom_disk.c

Parent Directory Parent Directory | Revision Log Revision Log


Links to HEAD: (view) (download) (annotate)
Sticky Revision:


Revision 367071 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 27 06:43:24 2020 UTC (3 years, 8 months ago) by imp
File length: 27138 byte(s)
Diff to previous 365226
Remove frontstuff

Nothing implements this in the tree. Remove the ioctl and the
conversion to the geom atttribute stuff.

This was introduced in r94287 in 2002 and was retired in r113390
2003. It appeared in FreeBSD 5.0, but no other releases. This is a
vestige that was missed at the time and overlooked until now. No
compat is provided for this reason.  And there's no implementation of
it today. And it was never part of a release from a stable branch.

Reviewed by: phk@
Differential Revision: https://reviews.freebsd.org/D26967


Revision 365226 - (view) (download) (annotate) - [select for diffs]
Modified Tue Sep 1 22:14:09 2020 UTC (3 years, 10 months ago) by mjg
File length: 27206 byte(s)
Diff to previous 364442
geom: clean up empty lines in .c and .h files


Revision 364442 - (view) (download) (annotate) - [select for diffs]
Modified Fri Aug 21 00:03:24 2020 UTC (3 years, 10 months ago) by imp
File length: 27207 byte(s)
Diff to previous 361015
Use devctl.h instead of bus.h to reduce newbus pollution.

There's no need for these parts of the kernel to know about newbus,
so narrow what is included to devctl.h for device_notify_*.

Suggested by: kib@


Revision 361015 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 13 19:17:28 2020 UTC (4 years, 1 month ago) by imp
File length: 27204 byte(s)
Diff to previous 359718
Reimplement aliases in geom

The alias needs to be part of the provider instead of the geom to work
properly. To bind the DEV geom, we need to look at the provider's names and
aliases and create the dev entries from there. If this lives in the GEOM, then
it won't propigate down the tree properly. Remove it from geom, add it provider.

Update geli, gmountver, gnop, gpart, and guzip to use it, which handles the bulk
of the uses in FreeBSD. I think this is all the providers that create a new name
based on their parent's name.


Revision 359718 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 7 22:23:22 2020 UTC (4 years, 2 months ago) by imp
File length: 27249 byte(s)
Diff to previous 358333
Now that we don't have special-case geom hacking defined in md_var.h, stop
including it. sparc64 was the last straggler here, but these weren't removed at
the time.


Revision 358333 - (view) (download) (annotate) - [select for diffs]
Modified Wed Feb 26 14:26:36 2020 UTC (4 years, 4 months ago) by kaktus
File length: 27277 byte(s)
Diff to previous 357363
Mark more nodes as CTLFLAG_MPSAFE or CTLFLAG_NEEDGIANT (17 of many)

r357614 added CTLFLAG_NEEDGIANT to make it easier to find nodes that are
still not MPSAFE (or already are but aren’t properly marked).
Use it in preparation for a general review of all nodes.

This is non-functional change that adds annotations to SYSCTL_NODE and
SYSCTL_PROC nodes using one of the soon-to-be-required flags.

Mark all obvious cases as MPSAFE.  All entries that haven't been marked
as MPSAFE before are by default marked as NEEDGIANT

Approved by:	kib (mentor, blanket)
Commented by:	kib, gallatin, melifaro
Differential Revision:	https://reviews.freebsd.org/D23718


Revision 357363 - (view) (download) (annotate) - [select for diffs]
Modified Sat Feb 1 10:15:23 2020 UTC (4 years, 5 months ago) by pjd
File length: 27223 byte(s)
Diff to previous 356818
The error variable is not really needed. Remove it.


Revision 356818 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 17 01:15:55 2020 UTC (4 years, 5 months ago) by imp
File length: 27251 byte(s)
Diff to previous 356214
Pass BIO_SPEEDUP through all the geom layers

While some geom layers pass unknown commands down, not all do. For the ones that
don't, pass BIO_SPEEDUP down to the providers that constittue the geom, as
applicable. No changes to vinum or virstor because I was unsure how to add this
support, and I'm also unsure how to test these. gvinum doesn't implement
BIO_FLUSH either, so it may just be poorly maintained. gvirstor is for testing
and not supportig BIO_SPEEDUP is fine.

Reviewed by: chs
Differential Revision: https://reviews.freebsd.org/D23183


Revision 356214 - (view) (download) (annotate) - [select for diffs]
Modified Tue Dec 31 03:43:13 2019 UTC (4 years, 6 months ago) by mav
File length: 27032 byte(s)
Diff to previous 356200
Avoid few memory accesses in g_disk_done().


Revision 356200 - (view) (download) (annotate) - [select for diffs]
Modified Mon Dec 30 03:13:38 2019 UTC (4 years, 6 months ago) by mav
File length: 27039 byte(s)
Diff to previous 355451
Use atomic for start_count in devstat_start_transaction().

Combined with earlier nstart/nend removal it allows to remove several locks
from request path of GEOM and few other places.  It would be cool if we had
more SMP-friendly statistics, but this helps too.

Sponsored by:	iXsystems, Inc.


Revision 355451 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 6 16:48:36 2019 UTC (4 years, 6 months ago) by mav
File length: 27272 byte(s)
Diff to previous 354571
Remove some branching from GEOM_DISK hot path.

pp->private just can not be NULL in those places.

In g_disk_start() and g_disk_ioctl() both dp != NULL and !dp->d_destroyed
should always be true if disk_gone() and disk_destroy() are used properly,
since GEOM does not send requests to errored providers.  If the protocol is
not followed, then no amount of additional checks here give real safety.

In g_disk_access() though the checks are useful, since GEOM blocks only
new opens for errored providers, but allows closes.  It should not happen
if disk_gone() and disk_destroy() are used properly, but may otherwise.

To improve cases when disk_gone() is not used, call it from disk_destroy().
It does not give full guaranties, but it errors the provider and makes
GEOM block unwanted requests at least after some race.

MFC after:	2 weeks


Revision 354571 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 9 17:30:19 2019 UTC (4 years, 7 months ago) by trasz
File length: 26942 byte(s)
Diff to previous 349195
Add GEOM attribute to report physical device name, and report it
via 'diskinfo -v'.  This avoids the need to track it down via CAM,
and should also work for disks that don't use CAM.  And since it's
inherited thru the GEOM hierarchy, in most cases one doesn't need
to walk the GEOM graph either, eg you can use it on a partition
instead of disk itself.

Reviewed by:	allanjude, imp
Sponsored by:	Klara Inc
Differential Revision:	https://reviews.freebsd.org/D22249


Revision 349195 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jun 19 15:36:02 2019 UTC (5 years ago) by mav
File length: 26856 byte(s)
Diff to previous 327996
Use sbuf_cat() in GEOM confxml generation.

When it comes to megabytes of text, difference between sbuf_printf() and
sbuf_cat() becomes substantial.

MFC after:	2 weeks
Sponsored by:	iXsystems, Inc.


Revision 327996 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jan 15 11:20:00 2018 UTC (6 years, 5 months ago) by avg
File length: 26934 byte(s)
Diff to previous 326270
geom_disk / scsi_da: deny opening write-protected disks for writing

Ths change consists of two parts.

geom_disk: deny opening a disk for writing if it's marked as
write-protected.  A new disk(9) flag is added to mark write protected
disks.  A possible alternative could be to add another parameter to d_open,
so that the open mode could be passed to it and the disk drivers could
make the decision internally, but the flag required less churn.

scsi_da: add a new phase of disk probing to query the all pages mode
sense page.  We can determine if the disk is write protected using bit 7
of the device specific field in the mode parameter header returned by
MODE SENSE.

PR:		224037
Reviewed by:	mav
MFC after:	4 weeks
Differential Revision: https://reviews.freebsd.org/D13360


Revision 326270 - (view) (download) (annotate) - [select for diffs]
Modified Mon Nov 27 15:17:37 2017 UTC (6 years, 7 months ago) by pfg
File length: 26737 byte(s)
Diff to previous 322198
sys/geom: adoption of SPDX licensing ID tags.

Mainly focus on files that use BSD 2-Clause license, however the tool I
was using misidentified many licenses so this was mostly a manual - error
prone - task.

The Software Package Data Exchange (SPDX) group provides a specification
to make it easier for automated tools to detect and summarize well known
opensource licenses. We are gradually adopting the specification, noting
that the tags are considered only advisory and do not, in any way,
superceed or replace the license texts.


Revision 322198 - (view) (download) (annotate) - [select for diffs]
Modified Mon Aug 7 21:12:38 2017 UTC (6 years, 10 months ago) by imp
File length: 26693 byte(s)
Diff to previous 320729
Expose API to allow disks to ask for alias names in devfs.

Implement disk_add_alias to allow aliases to be added to disks. All
disk have a primary name (say "foo") can also have secondary names
(say "bar") such that all instances of "foo" also have a "bar"
alias. So if you have foo0, foo0p1, foo1, foo1s1 and foo1s1a nodes
created by the foo driver and gpart, device nodes bar0, bar0p1, bar1,
bar1s1 and bar1s1a will appear as symlinks back to the original nodes.
This generalizes to multiple aliases. However, since the unit number
follows the primary name, multiple device drivers can't create the
same aliases unless those drives coorinate the unit number space (eg
you couldn't add an alias 'disk' to both 'da' and 'ada' because it's
possible to have da0 and ada0, because 'disk0' is ambiguous).

Differential Revision: https://reviews.freebsd.org/D11873


Revision 320729 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jul 6 08:36:14 2017 UTC (6 years, 11 months ago) by mav
File length: 26046 byte(s)
Diff to previous 315195
Add GEOM::descr attribute for symmetry with GEOM::ident.

MFC after:	2 weeks


Revision 315195 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 13 11:09:17 2017 UTC (7 years, 3 months ago) by scottl
File length: 25975 byte(s)
Diff to previous 311971
Report disk flags via the sysctl tree


Revision 311971 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 12 08:53:10 2017 UTC (7 years, 5 months ago) by mav
File length: 25313 byte(s)
Diff to previous 308155
Report random flash storage as non-rotating to GEOM_DISK.

While doing it, introduce respective constants in geom_disk.h.

MFC after:	1 week


Revision 308155 - (view) (download) (annotate) - [select for diffs]
Modified Mon Oct 31 23:09:52 2016 UTC (7 years, 8 months ago) by cem
File length: 25318 byte(s)
Diff to previous 303637
Add BUF_TRACKING and FULL_BUF_TRACKING buffer debugging

Upstream the BUF_TRACKING and FULL_BUF_TRACKING buffer debugging code.
This can be handy in tracking down what code touched hung bios and bufs
last. The full history is especially useful, but adds enough bloat that
it shouldn't be enabled in release builds.

Function names (or arbitrary string constants) are tracked in a
fixed-size ring in bufs. Bios gain a pointer to the upper buf for
tracking. SCSI CCBs gain a pointer to the upper bio for tracking.

Reviewed by:	markj
Sponsored by:	Dell EMC Isilon
Differential Revision:	https://reviews.freebsd.org/D8366


Revision 303637 - (view) (download) (annotate) - [select for diffs]
Modified Mon Aug 1 20:54:54 2016 UTC (7 years, 11 months ago) by ae
File length: 25292 byte(s)
Diff to previous 303019
Do not invoke resize event if initial disk size is zero. Some disks
report the size only after first opening.  And due to the events are
asynchronous, some consumers can receive this event too late and
this confuses them. This partially restores previous behaviour, and
at the same time this should fix the problem, when already opened
provider loses resize event.

PR:		211028
MFC after:	3 weeks


Revision 303019 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jul 19 05:36:21 2016 UTC (7 years, 11 months ago) by ae
File length: 25093 byte(s)
Diff to previous 302150
Use g_resize_provider() to change the size of GEOM_DISK provider,
when it is being opened. This should fix the possible loss of a resize
event when disk capacity changed.

PR:		211028
Reported by:	Dexuan Cui <decui at microsoft dot com>
MFC after:	3 weeks


Revision 302150 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jun 23 20:05:59 2016 UTC (8 years ago) by ken
File length: 25086 byte(s)
Diff to previous 302069
Switch geom_disk over to using a pool mutex.

The GEOM disk d_mtx is only acquired on disk creation and destruction.
It is a good candidate for replacement with a pool mutex.  This eliminates
the mutex initialization and teardown and the mutex and name variables
themselves from struct disk.

sys/geom/geom_disk.h:
	Take d_mtx and d_mtx_name out of struct disk.

sys/geom/geom_disk.c:
	Use mtx_pool_lock() and mtx_pool_unlock() to guard the disk
	initialization state instead of a dedicated mutex.

	This allows removing the initialization and destruction of
	d_mtx.

sys/sys/param.h:
	Bump __FreeBSD_version to 1100119 for the change to struct disk.

Suggested by:	jhb
Sponsored by:	Spectra Logic
Approved by:	re (gjb)


Revision 302069 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jun 21 20:18:19 2016 UTC (8 years ago) by ken
File length: 25148 byte(s)
Diff to previous 300207
Fix a bug that caused da(4) instances to hang around after the underlying
device is gone.

The problem was that when disk_gone() is called, if the GEOM disk
creation process has not yet happened, the withering process
couldn't start.

We didn't record any state in the GEOM disk code, and so the d_gone()
callback to the da(4) driver never happened.

The solution is to track the state of the creation process, and
initiate the withering process from g_disk_create() if the disk is
being created.

This change does add fields to struct disk, and so I have bumped
DISK_VERSION.

geom_disk.c:	Track where we are in the disk creation process,
		and check to see whether our underlying disk has
		gone away or not.

		In disk_gone(), set a new d_goneflag variable that
		g_disk_create() can check to see if it needs to
		clean up the disk instance.

geom_disk.h:    Add a mutex to struct disk (for internal use) disk
		init level, and a gone flag.

		Bump DISK_VERSION because the size of struct disk has
		changed and fields have been added at the beginning.

Sponsored by:	Spectra Logic
Approved by:	re (marius)


Revision 300207 - (view) (download) (annotate) - [select for diffs]
Modified Thu May 19 14:08:36 2016 UTC (8 years, 1 month ago) by ken
File length: 23605 byte(s)
Diff to previous 298808
Add support for managing Shingled Magnetic Recording (SMR) drives.

This change includes support for SCSI SMR drives (which conform to the
Zoned Block Commands or ZBC spec) and ATA SMR drives (which conform to
the Zoned ATA Command Set or ZAC spec) behind SAS expanders.

This includes full management support through the GEOM BIO interface, and
through a new userland utility, zonectl(8), and through camcontrol(8).

This is now ready for filesystems to use to detect and manage zoned drives.
(There is no work in progress that I know of to use this for ZFS or UFS, if
anyone is interested, let me know and I may have some suggestions.)

Also, improve ATA command passthrough and dispatch support, both via ATA
and ATA passthrough over SCSI.

Also, add support to camcontrol(8) for the ATA Extended Power Conditions
feature set.  You can now manage ATA device power states, and set various
idle time thresholds for a drive to enter lower power states.

Note that this change cannot be MFCed in full, because it depends on
changes to the struct bio API that break compatilibity.  In order to
avoid breaking the stable API, only changes that don't touch or depend on
the struct bio changes can be merged.  For example, the camcontrol(8)
changes don't depend on the new bio API, but zonectl(8) and the probe
changes to the da(4) and ada(4) drivers do depend on it.

Also note that the SMR changes have not yet been tested with an actual
SCSI ZBC device, or a SCSI to ATA translation layer (SAT) that supports
ZBC to ZAC translation.  I have not yet gotten a suitable drive or SAT
layer, so any testing help would be appreciated.  These changes have been
tested with Seagate Host Aware SATA drives attached to both SAS and SATA
controllers.  Also, I do not have any SATA Host Managed devices, and I
suspect that it may take additional (hopefully minor) changes to support
them.

Thanks to Seagate for supplying the test hardware and answering questions.

sbin/camcontrol/Makefile:
	Add epc.c and zone.c.

sbin/camcontrol/camcontrol.8:
	Document the zone and epc subcommands.

sbin/camcontrol/camcontrol.c:
	Add the zone and epc subcommands.

	Add auxiliary register support to build_ata_cmd().  Make sure to
	set the CAM_ATAIO_NEEDRESULT, CAM_ATAIO_DMA, and CAM_ATAIO_FPDMA
	flags as appropriate for ATA commands.

	Add a new get_ata_status() function to parse ATA result from SCSI
	sense descriptors (for ATA passthrough over SCSI) and ATA I/O
	requests.

sbin/camcontrol/camcontrol.h:
	Update the build_ata_cmd() prototype

	Add get_ata_status(), zone(), and epc().

sbin/camcontrol/epc.c:
	Support for ATA Extended Power Conditions features.  This includes
	support for all features documented in the ACS-4 Revision 12
	specification from t13.org (dated February 18, 2016).

	The EPC feature set allows putting a drive into a power power mode
	immediately, or setting timeouts so that the drive will
	automatically enter progressively lower power states after various
	idle times.

sbin/camcontrol/fwdownload.c:
	Update the firmware download code for the new build_ata_cmd()
	arguments.

sbin/camcontrol/zone.c:
	Implement support for Shingled Magnetic Recording (SMR) drives
	via SCSI Zoned Block Commands (ZBC) and ATA Zoned Device ATA
	Command Set (ZAC).

	These specs were developed in concert, and are functionally
	identical.  The primary differences are due to SCSI and ATA
	differences.  (SCSI is big endian, ATA is little endian, for
	example.)

	This includes support for all commands defined in the ZBC and
	ZAC specs.

sys/cam/ata/ata_all.c:
	Decode a number of additional ATA command names in ata_op_string().

	Add a new CCB building function, ata_read_log().

	Add ata_zac_mgmt_in() and ata_zac_mgmt_out() CCB building
	functions.  These support both DMA and NCQ encapsulation.

sys/cam/ata/ata_all.h:
	Add prototypes for ata_read_log(), ata_zac_mgmt_out(), and
	ata_zac_mgmt_in().

sys/cam/ata/ata_da.c:
	Revamp the ada(4) driver to support zoned devices.

	Add four new probe states to gather information needed for zone
	support.

	Add a new adasetflags() function to avoid duplication of large
	blocks of flag setting between the async handler and register
	functions.

	Add new sysctl variables that describe zone support and paramters.

	Add support for the new BIO_ZONE bio, and all of its subcommands:
	DISK_ZONE_OPEN, DISK_ZONE_CLOSE, DISK_ZONE_FINISH, DISK_ZONE_RWP,
	DISK_ZONE_REPORT_ZONES, and DISK_ZONE_GET_PARAMS.

sys/cam/scsi/scsi_all.c:
	Add command descriptions for the ZBC IN/OUT commands.

	Add descriptions for ZBC Host Managed devices.

	Add a new function, scsi_ata_pass() to do ATA passthrough over
	SCSI.  This will eventually replace scsi_ata_pass_16() -- it
	can create the 12, 16, and 32-byte variants of the ATA
	PASS-THROUGH command, and supports setting all of the
	registers defined as of SAT-4, Revision 5 (March 11, 2016).

	Change scsi_ata_identify() to use scsi_ata_pass() instead of
	scsi_ata_pass_16().

	Add a new scsi_ata_read_log() function to facilitate reading
	ATA logs via SCSI.

sys/cam/scsi/scsi_all.h:
	Add the new ATA PASS-THROUGH(32) command CDB.  Add extended and
	variable CDB opcodes.

	Add Zoned Block Device Characteristics VPD page.

	Add ATA Return SCSI sense descriptor.

	Add prototypes for scsi_ata_read_log() and scsi_ata_pass().

sys/cam/scsi/scsi_da.c:
	Revamp the da(4) driver to support zoned devices.

	Add five new probe states, four of which are needed for ATA
	devices.

	Add five new sysctl variables that describe zone support and
	parameters.

	The da(4) driver supports SCSI ZBC devices, as well as ATA ZAC
	devices when they are attached via a SCSI to ATA Translation (SAT)
	layer.  Since ZBC -> ZAC translation is a new feature in the T10
	SAT-4 spec, most SATA drives will be supported via ATA commands
	sent via the SCSI ATA PASS-THROUGH command.  The da(4) driver will
	prefer the ZBC interface, if it is available, for performance
	reasons, but will use the ATA PASS-THROUGH interface to the ZAC
	command set if the SAT layer doesn't support translation yet.
	As I mentioned above, ZBC command support is untested.

	Add support for the new BIO_ZONE bio, and all of its subcommands:
	DISK_ZONE_OPEN, DISK_ZONE_CLOSE, DISK_ZONE_FINISH, DISK_ZONE_RWP,
	DISK_ZONE_REPORT_ZONES, and DISK_ZONE_GET_PARAMS.

	Add scsi_zbc_in() and scsi_zbc_out() CCB building functions.

	Add scsi_ata_zac_mgmt_out() and scsi_ata_zac_mgmt_in() CCB/CDB
	building functions.  Note that these have return values, unlike
	almost all other CCB building functions in CAM.  The reason is
	that they can fail, depending upon the particular combination
	of input parameters.  The primary failure case is if the user
	wants NCQ, but fails to specify additional CDB storage.  NCQ
	requires using the 32-byte version of the SCSI ATA PASS-THROUGH
	command, and the current CAM CDB size is 16 bytes.

sys/cam/scsi/scsi_da.h:
	Add ZBC IN and ZBC OUT CDBs and opcodes.

	Add SCSI Report Zones data structures.

	Add scsi_zbc_in(), scsi_zbc_out(), scsi_ata_zac_mgmt_out(), and
	scsi_ata_zac_mgmt_in() prototypes.

sys/dev/ahci/ahci.c:
	Fix SEND / RECEIVE FPDMA QUEUED in the ahci(4) driver.

	ahci_setup_fis() previously set the top bits of the sector count
	register in the FIS to 0 for FPDMA commands.  This is okay for
	read and write, because the PRIO field is in the only thing in
	those bits, and we don't implement that further up the stack.

	But, for SEND and RECEIVE FPDMA QUEUED, the subcommand is in that
	byte, so it needs to be transmitted to the drive.

	In ahci_setup_fis(), always set the the top 8 bits of the
	sector count register.  We need it in both the standard
	and NCQ / FPDMA cases.

sys/geom/eli/g_eli.c:
	Pass BIO_ZONE commands through the GELI class.

sys/geom/geom.h:
	Add g_io_zonecmd() prototype.

sys/geom/geom_dev.c:
	Add new DIOCZONECMD ioctl, which allows sending zone commands to
	disks.

sys/geom/geom_disk.c:
	Add support for BIO_ZONE commands.

sys/geom/geom_disk.h:
	Add a new flag, DISKFLAG_CANZONE, that indicates that a given
	GEOM disk client can handle BIO_ZONE commands.

sys/geom/geom_io.c:
	Add a new function, g_io_zonecmd(), that handles execution of
	BIO_ZONE commands.

	Add permissions check for BIO_ZONE commands.

	Add command decoding for BIO_ZONE commands.

sys/geom/geom_subr.c:
	Add DDB command decoding for BIO_ZONE commands.

sys/kern/subr_devstat.c:
	Record statistics for REPORT ZONES commands.  Note that the
	number of bytes transferred for REPORT ZONES won't quite match
	what is received from the harware.  This is because we're
	necessarily counting bytes coming from the da(4) / ada(4) drivers,
	which are using the disk_zone.h interface to communicate up
	the stack.  The structure sizes it uses are slightly different
	than the SCSI and ATA structure sizes.

sys/sys/ata.h:
	Add many bit and structure definitions for ZAC, NCQ, and EPC
	command support.

sys/sys/bio.h:
	Convert the bio_cmd field to a straight enumeration.  This will
	yield more space for additional commands in the future.  After
	change r297955 and other related changes, this is now possible.
	Converting to an enumeration will also prevent use as a bitmask
	in the future.

sys/sys/disk.h:
	Define the DIOCZONECMD ioctl.

sys/sys/disk_zone.h:
	Add a new API for managing zoned disks.  This is very close to
	the SCSI ZBC and ATA ZAC standards, but uses integers in native
	byte order instead of big endian (SCSI) or little endian (ATA)
	byte arrays.

	This is intended to offer to the complete feature set of the ZBC
	and ZAC disk management without requiring the application developer
	to include SCSI or ATA headers.  We also use one set of headers
	for ioctl consumers and kernel bio-level consumers.

sys/sys/param.h:
	Bump __FreeBSD_version for sys/bio.h command changes, and inclusion
	of SMR support.

usr.sbin/Makefile:
	Add the zonectl utility.

usr.sbin/diskinfo/diskinfo.c
	Add disk zoning capability to the 'diskinfo -v' output.

usr.sbin/zonectl/Makefile:
	Add zonectl makefile.

usr.sbin/zonectl/zonectl.8
	zonectl(8) man page.

usr.sbin/zonectl/zonectl.c
	The zonectl(8) utility.  This allows managing SCSI or ATA zoned
	disks via the disk_zone.h API.  You can report zones, reset write
	pointers, get parameters, etc.

Sponsored by:	Spectra Logic
Differential Revision:	https://reviews.freebsd.org/D6147
Reviewed by:	wblock (documentation)


Revision 298808 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 29 20:56:58 2016 UTC (8 years, 2 months ago) by pfg
File length: 23288 byte(s)
Diff to previous 298439
sys/geom: spelling fixes in comments.

No functional change.


Revision 298439 - (view) (download) (annotate) - [select for diffs]
Modified Thu Apr 21 21:13:41 2016 UTC (8 years, 2 months ago) by asomers
File length: 23288 byte(s)
Diff to previous 298420
DRY on buffer sizes.  Update to r298420.

sys/geom/geom_disk.c:
	In disk_attr_changed, don't repeat a buffer size.

Reported by: ngie, hselasky
MFC after:	4 weeks
X-MFC-With:	298420
Sponsored by:	Spectra Logic Corp


Revision 298420 - (view) (download) (annotate) - [select for diffs]
Modified Thu Apr 21 16:43:15 2016 UTC (8 years, 2 months ago) by asomers
File length: 23268 byte(s)
Diff to previous 296605
Notify userspace listeners when geom disk attributes have changed

sys/geom/geom_disk.c:
	disk_attr_changed(): Generate a devctl event of type GEOM:<attr> for
	every call.

MFC after:	4 weeks
Sponsored by:	Spectra Logic Corp
Differential Revision:	https://reviews.freebsd.org/D5952


Revision 296605 - (view) (download) (annotate) - [select for diffs]
Modified Thu Mar 10 06:25:31 2016 UTC (8 years, 3 months ago) by imp
File length: 23106 byte(s)
Diff to previous 294042
Don't assume that bio_cmd is bit mask.

Differential Revision: https://reviews.freebsd.org/D5593


Revision 294042 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 14 21:52:21 2016 UTC (8 years, 5 months ago) by rpokala
File length: 23052 byte(s)
Diff to previous 291742
Add rotationrate to geom disk dumpconf

Parse and report the nominal rotation rate reported by the drive.

Reviewed by:	sbruno, jhb
Approved by:	jhb
MFC after:	1 week
Differential Revision:	https://reviews.freebsd.org/D4483
Requested by:	Kevin Bowling < kevin.bowling @ kev009.com >


Revision 291742 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 4 03:44:12 2015 UTC (8 years, 7 months ago) by ken
File length: 22393 byte(s)
Diff to previous 291741
Fix a style issue in g_disk_limit().

Noticed by:	bdrewery
MFC after:	1 week


Revision 291741 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 4 03:38:35 2015 UTC (8 years, 7 months ago) by ken
File length: 22384 byte(s)
Diff to previous 291716
Fix g_disk_vlist_limit() to work properly with deletes.

Add a new bp argument to g_disk_maxsegs(), and add a new function,
g_disk_maxsize() tha will properly determine the maximum I/O size for a
delete or non-delete bio.

Submitted by:	will
MFC after:	1 week
Sponsored by:	Spectra Logic


Revision 291716 - (view) (download) (annotate) - [select for diffs]
Modified Thu Dec 3 20:54:55 2015 UTC (8 years, 7 months ago) by ken
File length: 22273 byte(s)
Diff to previous 273638
Add asynchronous command support to the pass(4) driver, and the new
camdd(8) utility.

CCBs may be queued to the driver via the new CAMIOQUEUE ioctl, and
completed CCBs may be retrieved via the CAMIOGET ioctl.  User
processes can use poll(2) or kevent(2) to get notification when
I/O has completed.

While the existing CAMIOCOMMAND blocking ioctl interface only
supports user virtual data pointers in a CCB (generally only
one per CCB), the new CAMIOQUEUE ioctl supports user virtual and
physical address pointers, as well as user virtual and physical
scatter/gather lists.  This allows user applications to have more
flexibility in their data handling operations.

Kernel memory for data transferred via the queued interface is
allocated from the zone allocator in MAXPHYS sized chunks, and user
data is copied in and out.  This is likely faster than the
vmapbuf()/vunmapbuf() method used by the CAMIOCOMMAND ioctl in
configurations with many processors (there are more TLB shootdowns
caused by the mapping/unmapping operation) but may not be as fast
as running with unmapped I/O.

The new memory handling model for user requests also allows
applications to send CCBs with request sizes that are larger than
MAXPHYS.  The pass(4) driver now limits queued requests to the I/O
size listed by the SIM driver in the maxio field in the Path
Inquiry (XPT_PATH_INQ) CCB.

There are some things things would be good to add:

1. Come up with a way to do unmapped I/O on multiple buffers.
   Currently the unmapped I/O interface operates on a struct bio,
   which includes only one address and length.  It would be nice
   to be able to send an unmapped scatter/gather list down to
   busdma.  This would allow eliminating the copy we currently do
   for data.

2. Add an ioctl to list currently outstanding CCBs in the various
   queues.

3. Add an ioctl to cancel a request, or use the XPT_ABORT CCB to do
   that.

4. Test physical address support.  Virtual pointers and scatter
   gather lists have been tested, but I have not yet tested
   physical addresses or scatter/gather lists.

5. Investigate multiple queue support.  At the moment there is one
   queue of commands per pass(4) device.  If multiple processes
   open the device, they will submit I/O into the same queue and
   get events for the same completions.  This is probably the right
   model for most applications, but it is something that could be
   changed later on.

Also, add a new utility, camdd(8) that uses the asynchronous pass(4)
driver interface.

This utility is intended to be a basic data transfer/copy utility,
a simple benchmark utility, and an example of how to use the
asynchronous pass(4) interface.

It can copy data to and from pass(4) devices using any target queue
depth, starting offset and blocksize for the input and ouptut devices.
It currently only supports SCSI devices, but could be easily extended
to support ATA devices.

It can also copy data to and from regular files, block devices, tape
devices, pipes, stdin, and stdout.  It does not support queueing
multiple commands to any of those targets, since it uses the standard
read(2)/write(2)/writev(2)/readv(2) system calls.

The I/O is done by two threads, one for the reader and one for the
writer.  The reader thread sends completed read requests to the
writer thread in strictly sequential order, even if they complete
out of order.  That could be modified later on for random I/O patterns
or slightly out of order I/O.

camdd(8) uses kqueue(2)/kevent(2) to get I/O completion events from
the pass(4) driver and also to send request notifications internally.

For pass(4) devcies, camdd(8) uses a single buffer (CAM_DATA_VADDR)
per CAM CCB on the reading side, and a scatter/gather list
(CAM_DATA_SG) on the writing side.  In addition to testing both
interfaces, this makes any potential reblocking of I/O easier.  No
data is copied between the reader and the writer, but rather the
reader's buffers are split into multiple I/O requests or combined
into a single I/O request depending on the input and output blocksize.

For the file I/O path, camdd(8) also uses a single buffer (read(2),
write(2), pread(2) or pwrite(2)) on reads, and a scatter/gather list
(readv(2), writev(2), preadv(2), pwritev(2)) on writes.

Things that would be nice to do for camdd(8) eventually:

1.  Add support for I/O pattern generation.  Patterns like all
    zeros, all ones, LBA-based patterns, random patterns, etc. Right
    Now you can always use /dev/zero, /dev/random, etc.

2.  Add support for a "sink" mode, so we do only reads with no
    writes.  Right now, you can use /dev/null.

3.  Add support for automatic queue depth probing, so that we can
    figure out the right queue depth on the input and output side
    for maximum throughput.  At the moment it defaults to 6.

4.  Add support for SATA device passthrough I/O.

5.  Add support for random LBAs and/or lengths on the input and
    output sides.

6.  Track average per-I/O latency and busy time.  The busy time
    and latency could also feed in to the automatic queue depth
    determination.

sys/cam/scsi/scsi_pass.h:
	Define two new ioctls, CAMIOQUEUE and CAMIOGET, that queue
	and fetch asynchronous CAM CCBs respectively.

	Although these ioctls do not have a declared argument, they
	both take a union ccb pointer.  If we declare a size here,
	the ioctl code in sys/kern/sys_generic.c will malloc and free
	a buffer for either the CCB or the CCB pointer (depending on
	how it is declared).  Since we have to keep a copy of the
	CCB (which is fairly large) anyway, having the ioctl malloc
	and free a CCB for each call is wasteful.

sys/cam/scsi/scsi_pass.c:
	Add asynchronous CCB support.

	Add two new ioctls, CAMIOQUEUE and CAMIOGET.

	CAMIOQUEUE adds a CCB to the incoming queue.  The CCB is
	executed immediately (and moved to the active queue) if it
	is an immediate CCB, but otherwise it will be executed
	in passstart() when a CCB is available from the transport layer.

	When CCBs are completed (because they are immediate or
	passdone() if they are queued), they are put on the done
	queue.

	If we get the final close on the device before all pending
	I/O is complete, all active I/O is moved to the abandoned
	queue and we increment the peripheral reference count so
	that the peripheral driver instance doesn't go away before
	all pending I/O is done.

	The new passcreatezone() function is called on the first
	call to the CAMIOQUEUE ioctl on a given device to allocate
	the UMA zones for I/O requests and S/G list buffers.  This
	may be good to move off to a taskqueue at some point.
	The new passmemsetup() function allocates memory and
	scatter/gather lists to hold the user's data, and copies
	in any data that needs to be written.  For virtual pointers
	(CAM_DATA_VADDR), the kernel buffer is malloced from the
	new pass(4) driver malloc bucket.  For virtual
	scatter/gather lists (CAM_DATA_SG), buffers are allocated
	from a new per-pass(9) UMA zone in MAXPHYS-sized chunks.
	Physical pointers are passed in unchanged.  We have support
	for up to 16 scatter/gather segments (for the user and
	kernel S/G lists) in the default struct pass_io_req, so
	requests with longer S/G lists require an extra kernel malloc.

	The new passcopysglist() function copies a user scatter/gather
	list to a kernel scatter/gather list.  The number of elements
	in each list may be different, but (obviously) the amount of data
	stored has to be identical.

	The new passmemdone() function copies data out for the
	CAM_DATA_VADDR and CAM_DATA_SG cases.

	The new passiocleanup() function restores data pointers in
	user CCBs and frees memory.

	Add new functions to support kqueue(2)/kevent(2):

	passreadfilt() tells kevent whether or not the done
	queue is empty.

	passkqfilter() adds a knote to our list.

	passreadfiltdetach() removes a knote from our list.

	Add a new function, passpoll(), for poll(2)/select(2)
	to use.

	Add devstat(9) support for the queued CCB path.

sys/cam/ata/ata_da.c:
	Add support for the BIO_VLIST bio type.

sys/cam/cam_ccb.h:
	Add a new enumeration for the xflags field in the CCB header.
	(This doesn't change the CCB header, just adds an enumeration to
	use.)

sys/cam/cam_xpt.c:
	Add a new function, xpt_setup_ccb_flags(), that allows specifying
	CCB flags.

sys/cam/cam_xpt.h:
	Add a prototype for xpt_setup_ccb_flags().

sys/cam/scsi/scsi_da.c:
	Add support for BIO_VLIST.

sys/dev/md/md.c:
	Add BIO_VLIST support to md(4).

sys/geom/geom_disk.c:
	Add BIO_VLIST support to the GEOM disk class.  Re-factor the I/O size
	limiting code in g_disk_start() a bit.

sys/kern/subr_bus_dma.c:
	Change _bus_dmamap_load_vlist() to take a starting offset and
	length.

	Add a new function, _bus_dmamap_load_pages(), that will load a list
	of physical pages starting at an offset.

	Update _bus_dmamap_load_bio() to allow loading BIO_VLIST bios.
	Allow unmapped I/O to start at an offset.

sys/kern/subr_uio.c:
	Add two new functions, physcopyin_vlist() and physcopyout_vlist().

sys/pc98/include/bus.h:
	Guard kernel-only parts of the pc98 machine/bus.h header with
	#ifdef _KERNEL.

	This allows userland programs to include <machine/bus.h> to get the
	definition of bus_addr_t and bus_size_t.

sys/sys/bio.h:
	Add a new bio flag, BIO_VLIST.

sys/sys/uio.h:
	Add prototypes for physcopyin_vlist() and physcopyout_vlist().

share/man/man4/pass.4:
	Document the CAMIOQUEUE and CAMIOGET ioctls.

usr.sbin/Makefile:
	Add camdd.

usr.sbin/camdd/Makefile:
	Add a makefile for camdd(8).

usr.sbin/camdd/camdd.8:
	Man page for camdd(8).

usr.sbin/camdd/camdd.c:
	The new camdd(8) utility.

Sponsored by:	Spectra Logic
MFC after:	1 week


Revision 273638 - (view) (download) (annotate) - [select for diffs]
Modified Sat Oct 25 15:16:19 2014 UTC (9 years, 8 months ago) by mav
File length: 19872 byte(s)
Diff to previous 267992
Revert somewhat hackish geom_disk optimization, committed as part of r256880,
and the following r273143 commit, supposed to workaround introduced issue by
quite innocent-looking change.

While there is no clear understanding why, but r273143 is accused in data
corruption in some environments with high I/O load.  I personally don't see
any problem in that commit, and possibly it is just a trigger to some other
bug somewhere, but better safe then sorry for now.

Requested by:	scottl@
MFC after:	3 days


Revision 267992 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jun 28 03:56:17 2014 UTC (10 years ago) by hselasky
File length: 20710 byte(s)
Diff to previous 267985
Pull in r267961 and r267973 again. Fix for issues reported will follow.


Revision 267985 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jun 27 22:05:21 2014 UTC (10 years ago) by gjb
File length: 20853 byte(s)
Diff to previous 267961
Revert r267961, r267973:

These changes prevent sysctl(8) from returning proper output,
such as:

 1) no output from sysctl(8)
 2) erroneously returning ENOMEM with tools like truss(1)
    or uname(1)
 truss: can not get etype: Cannot allocate memory


Revision 267961 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jun 27 16:33:43 2014 UTC (10 years ago) by hselasky
File length: 20710 byte(s)
Diff to previous 266319
Extend the meaning of the CTLFLAG_TUN flag to automatically check if
there is an environment variable which shall initialize the SYSCTL
during early boot. This works for all SYSCTL types both statically and
dynamically created ones, except for the SYSCTL NODE type and SYSCTLs
which belong to VNETs. A new flag, CTLFLAG_NOFETCH, has been added to
be used in the case a tunable sysctl has a custom initialisation
function allowing the sysctl to still be marked as a tunable. The
kernel SYSCTL API is mostly the same, with a few exceptions for some
special operations like iterating childrens of a static/extern SYSCTL
node. This operation should probably be made into a factored out
common macro, hence some device drivers use this. The reason for
changing the SYSCTL API was the need for a SYSCTL parent OID pointer
and not only the SYSCTL parent OID list pointer in order to quickly
generate the sysctl path. The motivation behind this patch is to avoid
parameter loading cludges inside the OFED driver subsystem. Instead of
adding special code to the OFED driver subsystem to post-load tunables
into dynamically created sysctls, we generalize this in the kernel.

Other changes:
- Corrected a possibly incorrect sysctl name from "hw.cbb.intr_mask"
to "hw.pcic.intr_mask".
- Removed redundant TUNABLE statements throughout the kernel.
- Some minor code rewrites in connection to removing not needed
TUNABLE statements.
- Added a missing SYSCTL_DECL().
- Wrapped two very long lines.
- Avoid malloc()/free() inside sysctl string handling, in case it is
called to initialize a sysctl from a tunable, hence malloc()/free() is
not ready when sysctls from the sysctl dataset are registered.
- Bumped FreeBSD version to indicate SYSCTL API change.

MFC after:	2 weeks
Sponsored by:	Mellanox Technologies


Revision 266319 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 17 15:07:00 2014 UTC (10 years, 1 month ago) by mav
File length: 20853 byte(s)
Diff to previous 265072
Make GEOM DISK to account also BIO_FLUSH operations.


Revision 265072 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 29 01:17:43 2014 UTC (10 years, 2 months ago) by bdrewery
File length: 20721 byte(s)
Diff to previous 264320
Remove redundant include

MFC after:	3 days


Revision 264320 - (view) (download) (annotate) - [select for diffs]
Modified Thu Apr 10 17:00:44 2014 UTC (10 years, 2 months ago) by bdrewery
File length: 20745 byte(s)
Diff to previous 258683
Fix spelling error in g_trace() call.

Sponsored by:	EMC / Isilon Storage Division
MFC after:	1 week


Revision 258683 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 27 14:25:06 2013 UTC (10 years, 7 months ago) by mav
File length: 20744 byte(s)
Diff to previous 257131
Escape special XML chars, returned by some devices, confusing XML parsers.

MFC after:	1 month


Revision 257131 - (view) (download) (annotate) - [select for diffs]
Modified Fri Oct 25 19:19:12 2013 UTC (10 years, 8 months ago) by jhb
File length: 20455 byte(s)
Diff to previous 256956
Reject attempts to attack a disk device that has the old NEEDSGIANT
flag set.

Reviewed by:	mav


Revision 256956 - (view) (download) (annotate) - [select for diffs]
Modified Wed Oct 23 09:54:58 2013 UTC (10 years, 8 months ago) by smh
File length: 20239 byte(s)
Diff to previous 256884
Improve ZFS N-way mirror read performance by using load and locality
information.

The existing algorithm selects a preferred leaf vdev based on offset of the zio
request modulo the number of members in the mirror. It assumes the devices are
of equal performance and that spreading the requests randomly over both drives
will be sufficient to saturate them. In practice this results in the leaf vdevs
being under utilized.

The new algorithm takes into the following additional factors:
* Load of the vdevs (number outstanding I/O requests)
* The locality of last queued I/O vs the new I/O request.

Within the locality calculation additional knowledge about the underlying vdev
is considered such as; is the device backing the vdev a rotating media device.

This results in performance increases across the board as well as significant
increases for predominantly streaming loads and for configurations which don't
have evenly performing devices.

The following are results from a setup with 3 Way Mirror with 2 x HD's and
1 x SSD from a basic test running multiple parrallel dd's.

With pre-fetch disabled (vfs.zfs.prefetch_disable=1):

== Stripe Balanced (default) ==
Read 15360MB using bs: 1048576, readers: 3, took 161 seconds @ 95 MB/s
== Load Balanced (zfslinux) ==
Read 15360MB using bs: 1048576, readers: 3, took 297 seconds @ 51 MB/s
== Load Balanced (locality freebsd) ==
Read 15360MB using bs: 1048576, readers: 3, took 54 seconds @ 284 MB/s

With pre-fetch enabled (vfs.zfs.prefetch_disable=0):

== Stripe Balanced (default) ==
Read 15360MB using bs: 1048576, readers: 3, took 91 seconds @ 168 MB/s
== Load Balanced (zfslinux) ==
Read 15360MB using bs: 1048576, readers: 3, took 108 seconds @ 142 MB/s
== Load Balanced (locality freebsd) ==
Read 15360MB using bs: 1048576, readers: 3, took 48 seconds @ 320 MB/s

In addition to the performance changes the code was also restructured, with
the help of Justin Gibbs, to provide a more logical flow which also ensures
vdevs loads are only calculated from the set of valid candidates.

The following additional sysctls where added to allow the administrator
to tune the behaviour of the load algorithm:
* vfs.zfs.vdev.mirror.rotating_inc
* vfs.zfs.vdev.mirror.rotating_seek_inc
* vfs.zfs.vdev.mirror.rotating_seek_offset
* vfs.zfs.vdev.mirror.non_rotating_inc
* vfs.zfs.vdev.mirror.non_rotating_seek_inc

These changes where based on work started by the zfsonlinux developers:
https://github.com/zfsonlinux/zfs/pull/1487

Reviewed by:	gibbs, mav, will
MFC after:	2 weeks
Sponsored by:	Multiplay


Revision 256884 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 22 10:21:20 2013 UTC (10 years, 8 months ago) by mav
File length: 20121 byte(s)
Diff to previous 256880
Remove Giant-locked drivers support (DISKFLAG_NEEDSGIANT flag) from disk(9).

Since at least FreeBSD 7 we had only four of them in the base tree, and
in head branch, thanks to jhb@, we have no any for more then a year.


Revision 256880 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 22 08:22:19 2013 UTC (10 years, 8 months ago) by mav
File length: 20687 byte(s)
Diff to previous 256606
Merge GEOM direct dispatch changes from the projects/camlock branch.

When safety requirements are met, it allows to avoid passing I/O requests
to GEOM g_up/g_down thread, executing them directly in the caller context.
That allows to avoid CPU bottlenecks in g_up/g_down threads, plus avoid
several context switches per I/O.

The defined now safety requirements are:
 - caller should not hold any locks and should be reenterable;
 - callee should not depend on GEOM dual-threaded concurency semantics;
 - on the way down, if request is unmapped while callee doesn't support it,
   the context should be sleepable;
 - kernel thread stack usage should be below 50%.

To keep compatibility with GEOM classes not meeting above requirements
new provider and consumer flags added:
 - G_CF_DIRECT_SEND -- consumer code meets caller requirements (request);
 - G_CF_DIRECT_RECEIVE -- consumer code meets callee requirements (done);
 - G_PF_DIRECT_SEND -- provider code meets caller requirements (done);
 - G_PF_DIRECT_RECEIVE -- provider code meets callee requirements (request).
Capable GEOM class can set them, allowing direct dispatch in cases where
it is safe.  If any of requirements are not met, request is queued to
g_up or g_down thread same as before.

Such GEOM classes were reviewed and updated to support direct dispatch:
CONCAT, DEV, DISK, GATE, MD, MIRROR, MULTIPATH, NOP, PART, RAID, STRIPE,
VFS, ZERO, ZFS::VDEV, ZFS::ZVOL, all classes based on g_slice KPI (LABEL,
MAP, FLASHMAP, etc).

To declare direct completion capability disk(9) KPI got new flag equivalent
to G_PF_DIRECT_SEND -- DISKFLAG_DIRECT_COMPLETION.  da(4) and ada(4) disk
drivers got it set now thanks to earlier CAM locking work.

This change more then twice increases peak block storage performance on
systems with manu CPUs, together with earlier CAM locking changes reaching
more then 1 million IOPS (512 byte raw reads from 16 SATA SSDs on 4 HBAs to
256 user-level threads).

Sponsored by:	iXsystems, Inc.
MFC after:	2 months


Revision 256606 - (view) (download) (annotate) - [select for diffs]
Modified Wed Oct 16 09:18:01 2013 UTC (10 years, 8 months ago) by mav
File length: 19433 byte(s)
Diff to previous 256603
MFprojects/camlock r254907:
Move g_io_deliver() out of the lock, as required for direct dispatch.
Move g_destroy_bio() out too to reduce lock scope even more.


Revision 256603 - (view) (download) (annotate) - [select for diffs]
Modified Wed Oct 16 09:12:40 2013 UTC (10 years, 8 months ago) by mav
File length: 19398 byte(s)
Diff to previous 254766
MFprojects/camlock r254905:
Introduce new function devstat_end_transaction_bio_bt(), adding new argument
to specify present time.  Use this function to move binuptime() out of lock,
substantially reducing lock congestion when slow timecounter is used.


Revision 254766 - (view) (download) (annotate) - [select for diffs]
Modified Sat Aug 24 09:42:14 2013 UTC (10 years, 10 months ago) by mav
File length: 19350 byte(s)
Diff to previous 252657
Add new attribute lunname to report only textual LUN-specific device IDs.
While lunid attribute prefers to report numeric ones, having both may be
useful in some situations.


Revision 252657 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 3 23:46:30 2013 UTC (11 years ago) by smh
File length: 19148 byte(s)
Diff to previous 251654
Bump disk(9) ABI version to signify the addition of d_delmaxsize by r249940.

Ensure that d_delmaxsize is always set, removing init to 0 which could cause
future issues if use cases change.

Allow kern.cam.da.X.delete_max (which maps to d_delmaxsize) to be increased
up to the calculated max after being reduced.

MFC after:	1 day
X-MFC-With: r249940


Revision 251654 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jun 12 13:36:20 2013 UTC (11 years ago) by mav
File length: 19185 byte(s)
Diff to previous 251616
Make CAM return and GEOM DISK pass through new GEOM::lunid attribute.

SPC-4 specification states that serial number may be property of device,
but not a specific logical unit.  People reported about FC storages using
serial number in that way, making it unusable for purposes of LUN multipath
detection.  SPC-4 states that designators associated with logical unit from
the VPD page 83h "Device Identification" should be used for that purpose.
Report first of them in the new attribute in such preference order: NAA,
EUI-64, T10 and SCSI name string.

While there, make GEOM DISK properly report GEOM::ident in XML output also
using d_getattr() method, if available.  This fixes serial numbers reporting
for SCSI disks in `geom disk list` output and confxml.

Discussed with:	gibbs, ken
Sponsored by:	iXsystems, Inc.
MFC after:	2 weeks


Revision 251616 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jun 11 10:06:07 2013 UTC (11 years ago) by mav
File length: 18554 byte(s)
Diff to previous 249940
Don't update provider properties and don't set DISKFLAG_OPEN if d_open()
disk method call returned error.  GEOM considers devices in such case as
still closed, and won't call symmetric d_close() for them.


Revision 249940 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 26 16:22:54 2013 UTC (11 years, 2 months ago) by smh
File length: 18515 byte(s)
Diff to previous 249507
Teach GEOM and CAM about the difference between the max "size" of r/w and delete
requests.

sys/geom/geom_disk.h:
        - Added d_delmaxsize which represents the maximum size of individual
          device delete requests in bytes. This can be used by devices to
          inform geom of their size limitations regarding delete operations
          which are generally different from the read / write limits as data
          is not usually transferred from the host to physical device.

sys/geom/geom_disk.c:
        - Use new d_delmaxsize to calculate the size of chunks passed through to
          the underlying strategy during deletes instead of using read / write
          optimised values. This defaults to d_maxsize if unset (0).

        - Moved d_maxsize default up so it can be used to default d_delmaxsize

sys/cam/ata/ata_da.c:
        - Added d_delmaxsize calculations for TRIM and CFA

sys/cam/scsi/scsi_da.c:
        - Added re-calculation of d_delmaxsize whenever delete_method is set.

        - Added kern.cam.da.X.delete_max sysctl which allows the max size for
          delete requests to be limited. This is useful in preventing timeouts
          on devices who's delete methods are slow. It should be noted that
          this limit is reset then the device delete method is changed and
          that it can only be lowered not increased from the device max.

Reviewed by:	mav
Approved by:	pjd (mentor)


Revision 249507 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 15 15:55:40 2013 UTC (11 years, 2 months ago) by ivoras
File length: 18012 byte(s)
Diff to previous 249161
Introduce a symbol for the GEOM class name instead of using the ad-hoc string
constant.


Revision 249161 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 5 13:11:28 2013 UTC (11 years, 3 months ago) by mav
File length: 18001 byte(s)
Diff to previous 248694
Following r241022, replace iteration over the provider list on media events
by taking first one and asserting that there is no others.

MFC after:	1 week


Revision 248694 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 25 05:45:24 2013 UTC (11 years, 3 months ago) by mav
File length: 17781 byte(s)
Diff to previous 248516
In GEOM DISK:
 - Replace single done mutex with per-disk ones.  On system with several
disks on several HBAs that removes small, but measurable lock congestion.
 - Modify disk destruction process to not destroy the mutex prematurely.
 - Remove some extra pointer derefences.


Revision 248516 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 19 14:49:15 2013 UTC (11 years, 3 months ago) by kib
File length: 18225 byte(s)
Diff to previous 242322
A flag for the geom disk driver to indicate that it accepts the
unmapped i/o requests.

Sponsored by:	The FreeBSD Foundation
Tested by:	pho


Revision 242322 - (view) (download) (annotate) - [select for diffs]
Modified Mon Oct 29 17:52:43 2012 UTC (11 years, 8 months ago) by trasz
File length: 17646 byte(s)
Diff to previous 241022
Fix locking problem in disk_resize(); previously it would run without
topology lock, resulting in assertion when running with DIAGNOSTIC.

Reviewed by:	mav (earlier version)


Revision 241022 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 28 08:22:51 2012 UTC (11 years, 9 months ago) by pjd
File length: 17852 byte(s)
Diff to previous 240822
Remove the topology lock from disk_gone(), it might be called with regular
mutexes held and the topology lock is an sx lock.

The topology lock was there to protect traversing through the list of providers
of disk's geom, but it seems that disk's geom has always exactly one provider.

Change the code to call g_wither_provider() for this one provider, which is
safe to do without holding the topology lock and assert that there is indeed
only one provider.

Discussed with:	ken
MFC after:	1 week


Revision 240822 - (view) (download) (annotate) - [select for diffs]
Modified Sat Sep 22 12:41:49 2012 UTC (11 years, 9 months ago) by pjd
File length: 17784 byte(s)
Diff to previous 240629
Use the topology lock to protect list of providers while withering them.
It is possible that provider is destroyed while we are iterating over the
list.

Reported by:	Brian Parkison <parkison@panzura.com>
Discussed with:	phk
MFC after:	1 week


Revision 240629 - (view) (download) (annotate) - [select for diffs]
Modified Tue Sep 18 07:57:34 2012 UTC (11 years, 9 months ago) by avg
File length: 17737 byte(s)
Diff to previous 239790
g_disk_flushcache definitely should not be traced under G_T_TOPOLOGY

... use G_T_BIO instead

MFC after:	1 week


Revision 239790 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 28 19:28:31 2012 UTC (11 years, 10 months ago) by ed
File length: 17742 byte(s)
Diff to previous 238886
Remove unneeded G_PF_CANDELETE flag.

This flag is only used by GEOM so it can be propagated to the character
device's SI_CANDELETE. Unfortunately, SI_CANDELETE seems to do nothing.


Revision 238886 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jul 29 11:51:48 2012 UTC (11 years, 11 months ago) by mav
File length: 17924 byte(s)
Diff to previous 238216
Implement media change notification for DA and CD removable media devices.
It includes three parts:
 1) Modifications to CAM to detect media media changes and report them to
disk(9) layer. For modern SATA (and potentially UAS) devices it utilizes
Asynchronous Notification mechanism to receive events from hardware.
Active polling with TEST UNIT READY commands with 3 seconds period is used
for incapable hardware. After that both CD and DA drivers work the same way,
detecting two conditions: "NOT READY: Medium not present" after medium was
detected previously, and "UNIT ATTENTION: Not ready to ready change, medium
may have changed". First one reported to disk(9) as media removal, second
as media insert/change. To reliably receive second event new
AC_UNIT_ATTENTION async added to make UAs broadcasted to all periphs by
generic error handling code in cam_periph_error().
 2) Modifications to GEOM core to handle media remove and change events.
Media removal handled by spoiling all consumers attached to the provider.
Media change event also schedules provider retaste after spoiling to probe
new media. New flag G_CF_ORPHAN was added to consumers to reflect that
consumer is in process of destruction. It allows retaste to create new
geom instance of the same class, while previous one is still dying.
 3) Modifications to some GEOM classes: DEV -- to report media change
events to devd; VFS -- to handle spoiling same as orphan to prevent
accessing replaced media. PART class already handles spoiling alike to
orphan.

Reviewed by:	silence on geom@ and scsi@
Tested by:	avg
Sponsored by:	iXsystems, Inc. / PC-BSD
MFC after:	2 months


Revision 238216 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 7 21:28:31 2012 UTC (11 years, 11 months ago) by trasz
File length: 17500 byte(s)
Diff to previous 237648
Add disk_resize(), to make it possible for the disk drivers such as da(4)
to notify GEOM about LUN size change.

Reviewed by:	mav (earlier version)
Sponsored by:	FreeBSD Foundation


Revision 237648 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jun 27 16:05:09 2012 UTC (12 years ago) by ken
File length: 16680 byte(s)
Diff to previous 237518
In g_disk_providergone(), don't continue if the softc is NULL.  This may be
the case if we've already gone through g_disk_destroy().

Reported by:	Michael Butler <imb@protected-networks.net>
MFC after:	3 days


Revision 237518 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jun 24 04:29:03 2012 UTC (12 years ago) by ken
File length: 16509 byte(s)
Diff to previous 227309
Fix a bug which causes a panic in daopen(). The panic is caused by
a da(4) instance going away while GEOM is still probing it.

In this case, the GEOM disk class instance has been created by
disk_create(), and the taste of the disk is queued in the GEOM
event queue.

While that event is queued, the da(4) instance goes away.  When the
open call comes into the da(4) driver, it dereferences the freed
(but non-NULL) peripheral pointer provided by GEOM, which results
in a panic.

The solution is to add a callback to the GEOM disk code that is
called when all of its resources are cleaned up.  This is
implemented inside GEOM by adding an optional callback that is
called when all consumers have detached from a provider, and the
provider is about to be deleted.

scsi_cd.c,
scsi_da.c:	In the register routine for the cd(4) and da(4)
		routines, acquire a reference to the CAM peripheral
		instance just before we call disk_create().

		Use the new GEOM disk d_gone() callback to register
		a callback (dadiskgonecb()/cddiskgonecb()) that
		decrements the peripheral reference count once GEOM
		has finished cleaning up its resources.

		In the cd(4) driver, clean up open and close
		behavior slightly.  GEOM makes sure we only get one
		open() and one close call, so there is no need to
		set an open flag and decrement the reference count
		if we are not the first open.

		In the cd(4) driver, use cam_periph_release_locked()
		in a couple of error scenarios to avoid extra mutex
		calls.

geom.h:		Add a new, optional, providergone callback that
		is called when a provider is about to be deleted.

geom_disk.h:	Add a new d_gone() callback to the GEOM disk
		interface.

		Bump the DISK_VERSION to version 2.  This probably
		should have been done after a couple of previous
		changes, especially the addition of the d_getattr()
		callback.

geom_disk.c:	Add a providergone callback for the disk class,
		g_disk_providergone(), that calls the user's
		d_gone() callback if it exists.

		Bump the DISK_VERSION to 2.

geom_subr.c:	In g_destroy_provider(), call the providergone
		callback if it has been provided.

		In g_new_geomf(), propagate the class's
		providergone callback to the new geom instance.

blkfront.c:	Callers of disk_create() are supposed to pass in
		DISK_VERSION, not an explicit disk API version
		number.  Update the blkfront driver to do that.

disk.9:		Update the disk(9) man page to include information
		on the new d_gone() callback, as well as the
		previously added d_getattr() callback, d_descr
		field, and HBA PCI ID fields.

MFC after:	5 days


Revision 227309 - (view) (download) (annotate) - [select for diffs]
Modified Mon Nov 7 15:43:11 2011 UTC (12 years, 7 months ago) by ed
File length: 15979 byte(s)
Diff to previous 226737
Mark all SYSCTL_NODEs static that have no corresponding SYSCTL_DECLs.

The SYSCTL_NODE macro defines a list that stores all child-elements of
that node. If there's no SYSCTL_DECL macro anywhere else, there's no
reason why it shouldn't be static.


Revision 226737 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 25 14:07:17 2011 UTC (12 years, 8 months ago) by pjd
File length: 15968 byte(s)
Diff to previous 226736
Allow upper layers to discover than BIO_DELETE and/or BIO_FLUSH is not
supported by returning EOPNOTSUPP instead of 0 or ENODEV.

MFC after:	3 days


Revision 226736 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 25 14:05:39 2011 UTC (12 years, 8 months ago) by pjd
File length: 15966 byte(s)
Diff to previous 226735
Improve style a bit.

MFC after:	3 days


Revision 226735 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 25 14:04:59 2011 UTC (12 years, 8 months ago) by pjd
File length: 15942 byte(s)
Diff to previous 223921
Simplify disk_alloc().

MFC after:	3 days


Revision 223921 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jul 11 05:22:31 2011 UTC (12 years, 11 months ago) by ae
File length: 15957 byte(s)
Diff to previous 223089
Include sys/sbuf.h directly.

Reviewed by:	pjd


Revision 223089 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jun 14 17:10:32 2011 UTC (13 years ago) by gibbs
File length: 15935 byte(s)
Diff to previous 222652
Plumb device physical path reporting from CAM devices, through GEOM and
DEVFS, and make it accessible via the diskinfo utility.

Extend GEOM's generic attribute query mechanism into generic disk consumers.
sys/geom/geom_disk.c:
sys/geom/geom_disk.h:
sys/cam/scsi/scsi_da.c:
sys/cam/ata/ata_da.c:
	- Allow disk providers to implement a new method which can override
	  the default BIO_GETATTR response, d_getattr(struct bio *).  This
	  function returns -1 if not handled, otherwise it returns 0 or an
	  errno to be passed to g_io_deliver().

sys/cam/scsi/scsi_da.c:
sys/cam/ata/ata_da.c:
	- Don't copy the serial number to dp->d_ident anymore, as the CAM XPT
	  is now responsible for returning this information via
	  d_getattr()->(a)dagetattr()->xpt_getatr().

sys/geom/geom_dev.c:
	- Implement a new ioctl, DIOCGPHYSPATH, which returns the GEOM
	  attribute "GEOM::physpath", if possible.  If the attribute request
	  returns a zero-length string, ENOENT is returned.

usr.sbin/diskinfo/diskinfo.c:
	- If the DIOCGPHYSPATH ioctl is successful, report physical path
	  data when diskinfo is executed with the '-v' option.

Submitted by:	will
Reviewed by:	gibbs
Sponsored by:	Spectra Logic Corporation

Add generic attribute change notification support to GEOM.

sys/sys/geom/geom.h:
	Add a new attrchanged method field to both g_class
	and g_geom.

sys/sys/geom/geom.h:
sys/geom/geom_event.c:
	- Provide the g_attr_changed() function that providers
	  can use to advertise attribute changes.
	- Perform delivery of attribute change notifications
	  from a thread context via the standard GEOM event
	  mechanism.

sys/geom/geom_subr.c:
	Inherit the attrchanged method from class to geom (class instance).

sys/geom/geom_disk.c:
	Provide disk_attr_changed() to provide g_attr_changed() access
	to consumers of the disk API.

sys/cam/scsi/scsi_pass.c:
sys/cam/scsi/scsi_da.c:
sys/geom/geom_dev.c:
sys/geom/geom_disk.c:
	Use attribute changed events to track updates to physical path
	information.

sys/cam/scsi/scsi_da.c:
	Add AC_ADVINFO_CHANGED to the registered asynchronous CAM
	events for this driver.  When this event occurs, and
	the updated buffer type references our physical path
	attribute, emit a GEOM attribute changed event via the
	disk_attr_changed() API.

sys/cam/scsi/scsi_pass.c:
	Add AC_ADVINFO_CHANGED to the registered asynchronous CAM
	events for this driver.  When this event occurs, update
	the physical patch devfs alias for this pass instance.

Submitted by:	gibbs
Sponsored by:	Spectra Logic Corporation


Revision 222652 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jun 3 13:49:18 2011 UTC (13 years, 1 month ago) by mav
File length: 15480 byte(s)
Diff to previous 219970
Update disk's stripesize and stripeoffset parameters on provider open.
They are media-dependent and may change in run-time, same as sectorsize
and/or mediasize.

SCSI devices return physical sector size and offset via READ CAPACITY(16)
command and so can not report it until media inserted or at least until
probe sequence completed. UNMAP support is also reported there.


Revision 219970 - (view) (download) (annotate) - [select for diffs]
Modified Thu Mar 24 19:23:42 2011 UTC (13 years, 3 months ago) by mav
File length: 15290 byte(s)
Diff to previous 219950
MFgraid/head r218212, r218257:
Introduce new type of BIO_GETATTR -- GEOM::setstate, used to inform lower
GEOM about state of it's providers from the point of upper layers.
Make geom_disk use led(4) subsystem to illuminate states in such fashion:
FAILED - "1" (on), REBUILD - "f5" (slow blink), RESYNC - "f1" (fast blink),
ACTIVE - "0" (off).
LED name should be set for each disk via kern.geom.disk.%s.led sysctl.
Later disk API could be extended to allow disk driver to report this info
in custom way via it's own facilities.


Revision 219950 - (view) (download) (annotate) - [select for diffs]
Modified Thu Mar 24 08:37:48 2011 UTC (13 years, 3 months ago) by mav
File length: 13249 byte(s)
Diff to previous 219056
MFgraid/head r217827:
Change BIO_GETATTR("GEOM::kerneldump") API to make set_dumper() called by
consumer (geom_dev) instead of provider (geom_disk). This allows any geom
insert it's code into the dump call chain, implementing more sophisticated
functionality then just disk partitioning.


Revision 219056 - (view) (download) (annotate) - [select for diffs]
Modified Sat Feb 26 14:58:54 2011 UTC (13 years, 4 months ago) by nwhitehorn
File length: 13307 byte(s)
Diff to previous 217915
Add the disk ident and a human-meaningful description (here, the disk model
string) to the geom_disk config XML so that they are easily accessible from
userland.

MFC after:	1 week


Revision 217915 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 26 22:48:09 2011 UTC (13 years, 5 months ago) by mdf
File length: 13177 byte(s)
Diff to previous 216794
Remove the CTLFLAG_NOLOCK as it seems to be both unused and
unfunctional.  Wiring the user buffer has only been done explicitly
since r101422.

Mark the kern.disks sysctl as MPSAFE since it is and it seems to have
been mis-using the NOLOCK flag.

Partially break the KPI (but not the KBI) for the sysctl_req 'lock'
field since this member should be private and the "REQ_LOCKED" state
seems meaningless now.


Revision 216794 - (view) (download) (annotate) - [select for diffs]
Modified Wed Dec 29 12:11:07 2010 UTC (13 years, 6 months ago) by kib
File length: 13171 byte(s)
Diff to previous 210471
Add reporting of GEOM::candelete BIO_GETATTR for md(4) and geom_disk(4).
Non-zero value of attribute means that device supports BIO_DELETE.

Suggested and reviewed by:	pjd
Tested by:	pho
MFC after:	1 week


Revision 210471 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jul 25 15:43:52 2010 UTC (13 years, 11 months ago) by mav
File length: 13056 byte(s)
Diff to previous 196823
Export PCI IDs of ATA/SATA controllers through CAM and ata(4) layers to
GEOM. This information needed for proper soft-RAID's on-disk metadata
reading and writing.


Revision 196823 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 4 09:39:06 2009 UTC (14 years, 10 months ago) by pjd
File length: 12696 byte(s)
Diff to previous 190878
Simplify g_disk_ident_adjust() function and allow any printable character
in serial number.

Discussed with:	trasz
Obtained from:	Wheel Sp. z o.o. (http://www.wheel.pl)


Revision 190878 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 10 04:08:34 2009 UTC (15 years, 2 months ago) by thompsa
File length: 12908 byte(s)
Diff to previous 190677
Revert r190676,190677

The geom and CAM changes for root_hold are the wrong solution for USB design
quirks.

Requested by:	scottl


Revision 190677 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 3 19:49:33 2009 UTC (15 years, 3 months ago) by thompsa
File length: 12998 byte(s)
Diff to previous 184499
Add interleaving root hold tokens from the CAM probe to disk_create and geom
provider tasting. This is needed for disk attachments that happen after threads
are running in the boot process.

Tested by:	rnoland


Revision 184499 - (view) (download) (annotate) - [select for diffs]
Modified Fri Oct 31 10:11:35 2008 UTC (15 years, 8 months ago) by kib
File length: 12908 byte(s)
Diff to previous 184136
Revert r184136. Instead, push the check for crashdumpmap overflow into the
MD i386 and amd64 dump code.

Requested by:	jhb
Retested by:	pho
MFC after:	3 days (+ 176304 + 184136)


Revision 184136 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 21 18:52:38 2008 UTC (15 years, 8 months ago) by kib
File length: 12937 byte(s)
Diff to previous 181463
Do not overflow crashdumpmap.

Reported and tested by:	pho
Reviewed by:	jhb
MFC after:	1 week


Revision 181463 - (view) (download) (annotate) - [select for diffs]
Modified Sat Aug 9 11:14:05 2008 UTC (15 years, 10 months ago) by des
File length: 12908 byte(s)
Diff to previous 176304
Add sbuf_new_auto as a shortcut for the very common case of creating a
completely dynamic sbuf.

Obtained from:	Varnish
MFC after:	2 weeks


Revision 176304 - (view) (download) (annotate) - [select for diffs]
Modified Fri Feb 15 06:26:25 2008 UTC (16 years, 4 months ago) by scottl
File length: 12933 byte(s)
Diff to previous 169289
Teach the dump and minidump code to respect the maxioszie attribute of
the disk; the hard-coded assumption of 64K doesn't work in all cases.


Revision 169289 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 5 18:09:17 2007 UTC (17 years, 2 months ago) by pjd
File length: 12902 byte(s)
Diff to previous 169287
Allow to use ':' in d_ident, which is quite handy character.


Revision 169287 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 5 17:47:20 2007 UTC (17 years, 2 months ago) by pjd
File length: 12889 byte(s)
Diff to previous 169285
Because there are many strange hardware out there, allow to use only
[a-zA-Z0-9-_@#%.] characters in d_ident field.


Revision 169285 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 5 17:12:15 2007 UTC (17 years, 2 months ago) by pjd
File length: 12091 byte(s)
Diff to previous 166861
- Extend disk structure to allow to store disk's serial number, which can be
  retrieved via GEOM::ident attribute.
- Bump disk(9) ABI version.

OK'ed by:	phk


Revision 166861 - (view) (download) (annotate) - [select for diffs]
Modified Wed Feb 21 07:45:02 2007 UTC (17 years, 4 months ago) by n_hibma
File length: 11990 byte(s)
Diff to previous 163833
Reduce the noise when plugging in (USB) mass storage devices, like a 4 port
flash card reader.
Also remove an 'Opened da0 -> <random number>' which is not needed on a daily
basis (available through bootverbose).

Reviewed by:	phk, ken
MFC after:	1 week


Revision 163833 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 31 21:12:43 2006 UTC (17 years, 8 months ago) by pjd
File length: 11975 byte(s)
Diff to previous 157619
Add a new disk flag - DISKFLAG_CANFLUSHCACHE, which indicates that the disk
can handle BIO_FLUSH requests.

Sponsored by:	home.pl


Revision 157619 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 10 03:55:13 2006 UTC (18 years, 2 months ago) by marcel
File length: 11503 byte(s)
Diff to previous 152565
Add g_wither_provider() to abstract the details of destroying a
particular provider. Use this function where g_orphan_provider()
is being called so that the flags are updated correctly and
g_orphan_provider() is called only when allowed.


Revision 152565 - (view) (download) (annotate) - [select for diffs]
Modified Fri Nov 18 02:43:49 2005 UTC (18 years, 7 months ago) by jdp
File length: 11503 byte(s)
Diff to previous 150759
Fix a bug that caused some /dev entries to continue to exist after
the underlying drive had been hot-unplugged from the system.  Here
is a specific example.  Filesystem code had opened /dev/da1s1e.
Subsequently, the drive was hot-unplugged.  This (correctly) caused
all of the associated /dev/da1* entries to be deleted.  When the
filesystem later realized that the drive was gone it closed the
device, reducing the write-access counts to 0 on the geom providers
for da1s1e, da1s1, and da1.  This caused geom to re-taste the
providers, resulting in the devices being created again.  When the
drive was hot-plugged back in, it resulted in duplicate /dev entries
for da1s1e, da1s1, and da1.

This fix adds a new disk_gone() function which is called by CAM when a
drive goes away.  It orphans all of the providers associated with the
drive, setting an error condition of ENXIO in each one.  In addition,
we prevent a re-taste on last close for writing if an error condition
has been set in the provider.

Sponsored by:   Isilon Systems
Reviewed by:    phk
MFC after:      1 week


Revision 150759 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 30 17:32:08 2005 UTC (18 years, 9 months ago) by tegge
File length: 11309 byte(s)
Diff to previous 143791
Move some devstat collection to below where large IO operations are chopped
up.  This make iostat report operations passed down to the device driver
instead of operations passed down to GEOM disk.  The transfer size limit
imposed by the device driver is no longer hidden, improving the correlation
between iostat output and device driver workload.


Revision 143791 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 18 07:01:31 2005 UTC (19 years, 3 months ago) by phk
File length: 11310 byte(s)
Diff to previous 141624
After rejecting the bio request early, return instead of panicing.

Found by:	Coverity (ID#450)


Revision 141624 - (view) (download) (annotate) - [select for diffs]
Modified Thu Feb 10 12:10:35 2005 UTC (19 years, 4 months ago) by phk
File length: 11295 byte(s)
Diff to previous 140968
Make various random things static


Revision 140968 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 29 16:49:43 2005 UTC (19 years, 5 months ago) by phk
File length: 11288 byte(s)
Diff to previous 140261
When dumping to a unpartitioned disk, make sure to chop the
length of the dump area accordingly.

Run into by:	scottl


Revision 140261 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 14 21:05:35 2005 UTC (19 years, 5 months ago) by phk
File length: 11189 byte(s)
Diff to previous 138732
CAM will sometimes remove a disk again even before it finished being
initialized.  We already cancel the pending events but we need to not
dereference the geom pointer which never got set different from NULL.


Revision 138732 - (view) (download) (annotate) - [select for diffs]
Modified Sun Dec 12 10:09:05 2004 UTC (19 years, 6 months ago) by phk
File length: 11165 byte(s)
Diff to previous 133318
Pass the file->flags down to geom ioctl handlers.

Reject certain ioctls if write permission is not indicated.

Bump geom API version.

Reported by:	Ruben de Groot <mail25@bzerk.org>


Revision 133318 - (view) (download) (annotate) - [select for diffs]
Modified Sun Aug 8 07:57:53 2004 UTC (19 years, 10 months ago) by phk
File length: 11150 byte(s)
Diff to previous 133314
Tag all geom classes in the tree with a version number.


Revision 133314 - (view) (download) (annotate) - [select for diffs]
Added Sun Aug 8 06:49:07 2004 UTC (19 years, 10 months ago) by phk
File length: 11127 byte(s)
Diff to previous 131267
Use default method initialization on geoms.



This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

  ViewVC Help
Powered by ViewVC 1.1.27