Model fits displayed in this section are obtained from SASBDB.
Experimental (blue) and model (red) scattering profiles are presented as log I(q) vs. q together with error weighted residual difference plot between the experimental and model I(q) vs q. The I(q) is the intensity (preferably in absolute units of cm-1 or arbitrary units) and q is the modulus of the scattering vector. For a good fit, residual values should be equally and randomly spaced around the horizontal axis largely within +/- three standard deviations.
χ² values are a measure of the overall fit of the model to the 1D scattering profile. A model that fits the data within its error estimates will have a χ² value close to one, provided that the dominant errors are the random statistical errors (i.e. no systematic errors) from the SAS measurement that are correctly propagated. Correlation Map (CorMap) test is a variance-covariance analysis on the scattering intensities comparing two (or more) scattering profiles (e.g. model versus experiment or multiple measures from the same sample). The CorMap test complements χ² and importantly is independent of the reported errors. The method assigns a probability (P-value based on a 1-tailed Schilling test) for finding the longest string of experimental data points that lie systematically above (+1) or below (-1) the model profile. The P-value lies between 0 – 1 and a significance threshold is chosen below which the model fit is judged to show systematic deviation from experiment. A typical range statisticians use to indicate significant deviation is 0.01 - 0.05. As implemented in the ATSAS suite, the reported CorMap P-value is green (model fit is good) for P > 0.05, yellow for 0.01 < P < 0.05, and red (model deviates significantly) for P < 0.01.
| SASDB ID | Model | χ² | P-value |
|---|---|---|---|
|
SASDBV9
|
1
|
1.28
|
0.02
|
|
SASDBV9
|
2
|
1.10
|
0.01
|
|
SASDBW9
|
1
|
1.97
|
0.00
|
|
SASDBZ9
|
1
|
1.94
|
0.00
|
|
SASDBX9
|
1
|
2.86
|
0.00
|
|
SASDBY9
|
1
|
2.02
|
0.00
|
This table summarizes information about crosslinker(s) used for data generation, and how crosslinking information was translated into actual modeling restraints. Restraints assigned "by-residue" are interpreted as between CA atoms. Restraints between coarse-grained beads are indicated as "coarse-grained". Restraint group represents a set of crosslinking restraints applied collectively in the modeling.
There are 1071 crosslinking restraints combined in 615 restraint groups.
| Linker | Residue 1 | Atom 1 | Residue 2 | Atom 2 | Restraint type | Distance, Ã… | Count |
|---|---|---|---|---|---|---|---|
| DSS | LYS | coarse-grained | LYS | coarse-grained | upper bound | 26.00 | 916 |
| DSS | LYS | CA | LYS | CA | upper bound | 26.00 | 102 |
| DSS | LYS | coarse-grained | THR | coarse-grained | upper bound | 26.00 | 13 |
| DSS | LYS | coarse-grained | MET | coarse-grained | upper bound | 26.00 | 27 |
| DSS | ASN | coarse-grained | LYS | coarse-grained | upper bound | 26.00 | 2 |
| DSS | ARG | coarse-grained | LYS | coarse-grained | upper bound | 26.00 | 2 |
| DSS | GLN | coarse-grained | LYS | coarse-grained | upper bound | 26.00 | 2 |
| DSS | ALA | coarse-grained | LYS | coarse-grained | upper bound | 26.00 | 2 |
| DSS | LYS | coarse-grained | VAL | coarse-grained | upper bound | 26.00 | 3 |
| DSS | GLN | coarse-grained | MET | coarse-grained | upper bound | 26.00 | 1 |
| DSS | LYS | CA | MET | CA | upper bound | 26.00 | 1 |
Distograms of individual restraints
Distograms (i.e., histogram plots of distances) provide an overview of distributions of distances between residues for which chemical crosslinks were identified. The shift of the distogram relative to the threshold value may indicate a poor model. Restraints with identical thresholds are grouped into one plot. Only the best distance per restraint per model group/ensemble is plotted. Inter- and intramolecular (including self-links) restraints are also grouped into one plot. Distance for a restraint between coarse-grained beads is calculated as a minimal distance between shells; if beads intersect, the distance will be reported as 0.0. A bead with the highest available resolution for a given residue is used for the assessment.
Satisfaction of restraints is calculated on a restraint group (a set of crosslinking restraints applied collectively in the modeling) level. Satisfaction of a restraint group depends on satisfaction of individual restraints in the group and the conditionality (all/any). A restraint group is considered satisfied, if the condition was met in at least one model of the model group/ensemble. The number of measured restraints can be smaller than the total number of restraint groups if crosslinks involve non-modeled residues. Only deposited models are used for validation right now.
| State group | State | Model group | # of Deposited models/Total | Restraint group type | Satisfied (%) | Violated (%) | Count (Total=615) |
|---|---|---|---|---|---|---|---|
| 1 | 1 | 1 | 1/5 | All | 87.52 | 12.48 | 609 |
| Self-links/ Intramolecular |
96.67 | 3.33 | 300 | ||||
| Heteromeric links/ Intermolecular |
75.37 | 24.63 | 268 | ||||
| Self-links/ Ambiguous |
100.00 | 0.00 | 37 | ||||
| Self-links/ Intermolecular |
100.00 | 0.00 | 4 | ||||
| 1 | 1 | 2 | 1/1000 | All | Not available | Not available | 0 |
Per-model satisfaction rates in model groups/ensembles
Every point represents one model in a model group/ensemble. Where possible, boxplots with quartile marks are also plotted.
This section describes fit of models to the 3DEM data. Only results for the representative model, selected as a first model with the largest number of asymmetric units.
3DEM validation for coarse-grained structures is under development.