Amino acid dipepetide frequency for Desulfosarcina ovata subsp. ovata

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.865AlaAla: 9.865 ± 0.115
1.342AlaCys: 1.342 ± 0.03
5.821AlaAsp: 5.821 ± 0.058
5.517AlaGlu: 5.517 ± 0.066
3.76AlaPhe: 3.76 ± 0.046
7.669AlaGly: 7.669 ± 0.072
1.825AlaHis: 1.825 ± 0.031
5.941AlaIle: 5.941 ± 0.067
3.808AlaLys: 3.808 ± 0.057
9.17AlaLeu: 9.17 ± 0.086
2.984AlaMet: 2.984 ± 0.044
2.658AlaAsn: 2.658 ± 0.039
3.368AlaPro: 3.368 ± 0.046
2.773AlaGln: 2.773 ± 0.036
5.573AlaArg: 5.573 ± 0.065
4.696AlaSer: 4.696 ± 0.05
4.616AlaThr: 4.616 ± 0.059
7.015AlaVal: 7.015 ± 0.057
1.05AlaTrp: 1.05 ± 0.027
2.529AlaTyr: 2.529 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
0.97CysAla: 0.97 ± 0.026
0.263CysCys: 0.263 ± 0.013
0.676CysAsp: 0.676 ± 0.019
0.628CysGlu: 0.628 ± 0.02
0.529CysPhe: 0.529 ± 0.016
1.329CysGly: 1.329 ± 0.03
0.42CysHis: 0.42 ± 0.016
0.729CysIle: 0.729 ± 0.02
0.483CysLys: 0.483 ± 0.018
1.261CysLeu: 1.261 ± 0.023
0.33CysMet: 0.33 ± 0.013
0.414CysAsn: 0.414 ± 0.015
0.792CysPro: 0.792 ± 0.021
0.451CysGln: 0.451 ± 0.013
1.01CysArg: 1.01 ± 0.023
0.73CysSer: 0.73 ± 0.021
0.643CysThr: 0.643 ± 0.018
0.75CysVal: 0.75 ± 0.022
0.138CysTrp: 0.138 ± 0.009
0.4CysTyr: 0.4 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.56AspAla: 5.56 ± 0.06
0.689AspCys: 0.689 ± 0.019
3.492AspAsp: 3.492 ± 0.06
3.592AspGlu: 3.592 ± 0.048
2.606AspPhe: 2.606 ± 0.038
4.527AspGly: 4.527 ± 0.061
1.4AspHis: 1.4 ± 0.029
3.927AspIle: 3.927 ± 0.043
2.351AspLys: 2.351 ± 0.042
6.16AspLeu: 6.16 ± 0.064
1.388AspMet: 1.388 ± 0.028
1.841AspAsn: 1.841 ± 0.031
3.303AspPro: 3.303 ± 0.045
2.128AspGln: 2.128 ± 0.034
4.133AspArg: 4.133 ± 0.056
2.688AspSer: 2.688 ± 0.042
2.972AspThr: 2.972 ± 0.04
3.719AspVal: 3.719 ± 0.048
0.718AspTrp: 0.718 ± 0.018
1.949AspTyr: 1.949 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
5.666GluAla: 5.666 ± 0.068
0.514GluCys: 0.514 ± 0.018
2.963GluAsp: 2.963 ± 0.041
3.558GluGlu: 3.558 ± 0.058
1.872GluPhe: 1.872 ± 0.031
3.484GluGly: 3.484 ± 0.048
1.062GluHis: 1.062 ± 0.025
4.46GluIle: 4.46 ± 0.05
4.658GluLys: 4.658 ± 0.067
5.289GluLeu: 5.289 ± 0.053
1.894GluMet: 1.894 ± 0.033
2.773GluAsn: 2.773 ± 0.041
2.139GluPro: 2.139 ± 0.031
2.225GluGln: 2.225 ± 0.036
3.756GluArg: 3.756 ± 0.042
3.297GluSer: 3.297 ± 0.042
4.039GluThr: 4.039 ± 0.047
3.453GluVal: 3.453 ± 0.04
0.567GluTrp: 0.567 ± 0.017
1.491GluTyr: 1.491 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
3.225PheAla: 3.225 ± 0.043
0.597PheCys: 0.597 ± 0.018
2.736PheAsp: 2.736 ± 0.038
2.292PheGlu: 2.292 ± 0.036
2.086PhePhe: 2.086 ± 0.04
3.19PheGly: 3.19 ± 0.046
0.877PheHis: 0.877 ± 0.022
2.48PheIle: 2.48 ± 0.038
1.918PheLys: 1.918 ± 0.035
3.888PheLeu: 3.888 ± 0.054
1.023PheMet: 1.023 ± 0.023
1.618PheAsn: 1.618 ± 0.031
1.712PhePro: 1.712 ± 0.03
1.314PheGln: 1.314 ± 0.022
2.202PheArg: 2.202 ± 0.036
3.133PheSer: 3.133 ± 0.046
2.163PheThr: 2.163 ± 0.032
2.646PheVal: 2.646 ± 0.04
0.509PheTrp: 0.509 ± 0.015
1.316PheTyr: 1.316 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
5.969GlyAla: 5.969 ± 0.06
1.227GlyCys: 1.227 ± 0.024
3.873GlyAsp: 3.873 ± 0.053
4.195GlyGlu: 4.195 ± 0.046
3.288GlyPhe: 3.288 ± 0.042
5.679GlyGly: 5.679 ± 0.079
1.671GlyHis: 1.671 ± 0.032
5.463GlyIle: 5.463 ± 0.054
4.286GlyLys: 4.286 ± 0.049
7.463GlyLeu: 7.463 ± 0.069
2.414GlyMet: 2.414 ± 0.036
2.605GlyAsn: 2.605 ± 0.041
2.528GlyPro: 2.528 ± 0.039
2.682GlyGln: 2.682 ± 0.037
4.882GlyArg: 4.882 ± 0.052
4.164GlySer: 4.164 ± 0.05
4.219GlyThr: 4.219 ± 0.05
5.188GlyVal: 5.188 ± 0.054
1.012GlyTrp: 1.012 ± 0.025
2.613GlyTyr: 2.613 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
1.811HisAla: 1.811 ± 0.035
0.357HisCys: 0.357 ± 0.013
1.136HisAsp: 1.136 ± 0.025
1.027HisGlu: 1.027 ± 0.023
1.024HisPhe: 1.024 ± 0.024
1.628HisGly: 1.628 ± 0.032
0.649HisHis: 0.649 ± 0.022
1.304HisIle: 1.304 ± 0.026
0.773HisLys: 0.773 ± 0.02
2.459HisLeu: 2.459 ± 0.037
0.506HisMet: 0.506 ± 0.016
0.65HisAsn: 0.65 ± 0.017
1.407HisPro: 1.407 ± 0.027
0.892HisGln: 0.892 ± 0.022
1.579HisArg: 1.579 ± 0.028
1.05HisSer: 1.05 ± 0.023
1.097HisThr: 1.097 ± 0.024
1.344HisVal: 1.344 ± 0.024
0.301HisTrp: 0.301 ± 0.014
0.721HisTyr: 0.721 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
5.865IleAla: 5.865 ± 0.059
0.895IleCys: 0.895 ± 0.022
4.672IleAsp: 4.672 ± 0.054
4.265IleGlu: 4.265 ± 0.051
2.593IlePhe: 2.593 ± 0.04
5.004IleGly: 5.004 ± 0.062
1.545IleHis: 1.545 ± 0.03
3.812IleIle: 3.812 ± 0.051
3.094IleLys: 3.094 ± 0.044
5.797IleLeu: 5.797 ± 0.056
1.46IleMet: 1.46 ± 0.027
2.486IleAsn: 2.486 ± 0.038
3.187IlePro: 3.187 ± 0.043
2.25IleGln: 2.25 ± 0.035
4.2IleArg: 4.2 ± 0.045
3.875IleSer: 3.875 ± 0.048
3.526IleThr: 3.526 ± 0.047
4.536IleVal: 4.536 ± 0.046
0.579IleTrp: 0.579 ± 0.017
1.746IleTyr: 1.746 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.719LysAla: 4.719 ± 0.065
0.45LysCys: 0.45 ± 0.016
2.668LysAsp: 2.668 ± 0.041
3.13LysGlu: 3.13 ± 0.052
1.327LysPhe: 1.327 ± 0.027
3.675LysGly: 3.675 ± 0.043
0.888LysHis: 0.888 ± 0.023
3.537LysIle: 3.537 ± 0.053
3.709LysLys: 3.709 ± 0.056
3.874LysLeu: 3.874 ± 0.05
1.479LysMet: 1.479 ± 0.029
2.221LysAsn: 2.221 ± 0.038
2.238LysPro: 2.238 ± 0.032
1.84LysGln: 1.84 ± 0.03
3.249LysArg: 3.249 ± 0.044
2.775LysSer: 2.775 ± 0.039
3.334LysThr: 3.334 ± 0.039
2.972LysVal: 2.972 ± 0.04
0.55LysTrp: 0.55 ± 0.017
1.298LysTyr: 1.298 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
9.611LeuAla: 9.611 ± 0.103
1.288LeuCys: 1.288 ± 0.025
5.869LeuAsp: 5.869 ± 0.06
5.984LeuGlu: 5.984 ± 0.051
4.232LeuPhe: 4.232 ± 0.046
6.892LeuGly: 6.892 ± 0.066
1.783LeuHis: 1.783 ± 0.029
5.953LeuIle: 5.953 ± 0.054
5.645LeuLys: 5.645 ± 0.058
9.267LeuLeu: 9.267 ± 0.096
2.509LeuMet: 2.509 ± 0.034
3.626LeuAsn: 3.626 ± 0.042
4.857LeuPro: 4.857 ± 0.057
2.887LeuGln: 2.887 ± 0.039
5.164LeuArg: 5.164 ± 0.054
6.641LeuSer: 6.641 ± 0.07
5.463LeuThr: 5.463 ± 0.05
6.567LeuVal: 6.567 ± 0.064
0.981LeuTrp: 0.981 ± 0.024
2.409LeuTyr: 2.409 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
3.279MetAla: 3.279 ± 0.045
0.244MetCys: 0.244 ± 0.012
1.773MetAsp: 1.773 ± 0.031
1.673MetGlu: 1.673 ± 0.031
0.74MetPhe: 0.74 ± 0.02
2.214MetGly: 2.214 ± 0.035
0.513MetHis: 0.513 ± 0.015
1.786MetIle: 1.786 ± 0.027
1.461MetLys: 1.461 ± 0.026
2.487MetLeu: 2.487 ± 0.037
0.714MetMet: 0.714 ± 0.018
1.089MetAsn: 1.089 ± 0.021
1.351MetPro: 1.351 ± 0.027
0.988MetGln: 0.988 ± 0.024
1.43MetArg: 1.43 ± 0.026
1.345MetSer: 1.345 ± 0.026
1.613MetThr: 1.613 ± 0.029
2.163MetVal: 2.163 ± 0.034
0.187MetTrp: 0.187 ± 0.009
0.416MetTyr: 0.416 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.033AsnAla: 3.033 ± 0.042
0.445AsnCys: 0.445 ± 0.014
1.881AsnAsp: 1.881 ± 0.033
1.783AsnGlu: 1.783 ± 0.03
1.295AsnPhe: 1.295 ± 0.031
2.741AsnGly: 2.741 ± 0.036
0.817AsnHis: 0.817 ± 0.022
2.365AsnIle: 2.365 ± 0.038
1.481AsnLys: 1.481 ± 0.029
3.655AsnLeu: 3.655 ± 0.042
0.852AsnMet: 0.852 ± 0.023
1.213AsnAsn: 1.213 ± 0.024
2.197AsnPro: 2.197 ± 0.039
1.375AsnGln: 1.375 ± 0.024
2.582AsnArg: 2.582 ± 0.038
1.61AsnSer: 1.61 ± 0.032
1.748AsnThr: 1.748 ± 0.032
2.28AsnVal: 2.28 ± 0.035
0.441AsnTrp: 0.441 ± 0.014
1.07AsnTyr: 1.07 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
4.278ProAla: 4.278 ± 0.046
0.496ProCys: 0.496 ± 0.017
3.443ProAsp: 3.443 ± 0.039
3.641ProGlu: 3.641 ± 0.044
2.059ProPhe: 2.059 ± 0.038
3.638ProGly: 3.638 ± 0.044
0.979ProHis: 0.979 ± 0.028
2.489ProIle: 2.489 ± 0.039
1.933ProLys: 1.933 ± 0.034
4.286ProLeu: 4.286 ± 0.046
1.263ProMet: 1.263 ± 0.021
1.251ProAsn: 1.251 ± 0.025
2.198ProPro: 2.198 ± 0.043
1.558ProGln: 1.558 ± 0.031
2.088ProArg: 2.088 ± 0.035
2.415ProSer: 2.415 ± 0.033
2.233ProThr: 2.233 ± 0.037
3.985ProVal: 3.985 ± 0.046
0.546ProTrp: 0.546 ± 0.019
1.299ProTyr: 1.299 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.659GlnAla: 3.659 ± 0.049
0.392GlnCys: 0.392 ± 0.015
1.591GlnAsp: 1.591 ± 0.024
1.834GlnGlu: 1.834 ± 0.027
1.226GlnPhe: 1.226 ± 0.026
2.324GlnGly: 2.324 ± 0.034
0.699GlnHis: 0.699 ± 0.017
2.345GlnIle: 2.345 ± 0.03
1.941GlnLys: 1.941 ± 0.032
3.41GlnLeu: 3.41 ± 0.04
1.104GlnMet: 1.104 ± 0.026
1.218GlnAsn: 1.218 ± 0.025
1.583GlnPro: 1.583 ± 0.028
1.428GlnGln: 1.428 ± 0.035
2.399GlnArg: 2.399 ± 0.044
1.916GlnSer: 1.916 ± 0.029
2.143GlnThr: 2.143 ± 0.035
2.49GlnVal: 2.49 ± 0.034
0.515GlnTrp: 0.515 ± 0.018
0.916GlnTyr: 0.916 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
4.397ArgAla: 4.397 ± 0.048
0.858ArgCys: 0.858 ± 0.021
3.048ArgAsp: 3.048 ± 0.037
3.761ArgGlu: 3.761 ± 0.049
3.138ArgPhe: 3.138 ± 0.036
3.245ArgGly: 3.245 ± 0.039
1.665ArgHis: 1.665 ± 0.028
4.644ArgIle: 4.644 ± 0.043
3.329ArgLys: 3.329 ± 0.04
6.976ArgLeu: 6.976 ± 0.069
1.922ArgMet: 1.922 ± 0.034
1.98ArgAsn: 1.98 ± 0.027
2.658ArgPro: 2.658 ± 0.034
3.017ArgGln: 3.017 ± 0.043
4.345ArgArg: 4.345 ± 0.055
3.332ArgSer: 3.332 ± 0.042
2.948ArgThr: 2.948 ± 0.042
4.083ArgVal: 4.083 ± 0.053
0.831ArgTrp: 0.831 ± 0.02
2.16ArgTyr: 2.16 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
4.926SerAla: 4.926 ± 0.053
0.633SerCys: 0.633 ± 0.019
3.315SerAsp: 3.315 ± 0.047
3.136SerGlu: 3.136 ± 0.046
2.18SerPhe: 2.18 ± 0.033
5.372SerGly: 5.372 ± 0.062
1.254SerHis: 1.254 ± 0.025
3.579SerIle: 3.579 ± 0.047
2.359SerLys: 2.359 ± 0.04
5.578SerLeu: 5.578 ± 0.06
1.568SerMet: 1.568 ± 0.03
1.647SerAsn: 1.647 ± 0.03
2.646SerPro: 2.646 ± 0.038
1.9SerGln: 1.9 ± 0.029
3.564SerArg: 3.564 ± 0.043
3.113SerSer: 3.113 ± 0.051
2.735SerThr: 2.735 ± 0.038
3.986SerVal: 3.986 ± 0.046
0.641SerTrp: 0.641 ± 0.018
1.517SerTyr: 1.517 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
5.335ThrAla: 5.335 ± 0.058
0.673ThrCys: 0.673 ± 0.019
3.324ThrAsp: 3.324 ± 0.05
2.794ThrGlu: 2.794 ± 0.038
2.108ThrPhe: 2.108 ± 0.036
4.995ThrGly: 4.995 ± 0.057
1.194ThrHis: 1.194 ± 0.025
3.92ThrIle: 3.92 ± 0.049
1.803ThrLys: 1.803 ± 0.03
5.667ThrLeu: 5.667 ± 0.058
1.31ThrMet: 1.31 ± 0.025
1.606ThrAsn: 1.606 ± 0.03
2.934ThrPro: 2.934 ± 0.035
1.586ThrGln: 1.586 ± 0.026
3.092ThrArg: 3.092 ± 0.038
2.568ThrSer: 2.568 ± 0.04
2.92ThrThr: 2.92 ± 0.043
4.516ThrVal: 4.516 ± 0.063
0.565ThrTrp: 0.565 ± 0.016
1.571ThrTyr: 1.571 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
6.624ValAla: 6.624 ± 0.071
0.984ValCys: 0.984 ± 0.024
4.516ValAsp: 4.516 ± 0.051
4.192ValGlu: 4.192 ± 0.049
3.043ValPhe: 3.043 ± 0.041
4.814ValGly: 4.814 ± 0.055
1.37ValHis: 1.37 ± 0.031
4.388ValIle: 4.388 ± 0.049
3.194ValLys: 3.194 ± 0.039
6.659ValLeu: 6.659 ± 0.066
1.803ValMet: 1.803 ± 0.03
2.528ValAsn: 2.528 ± 0.04
3.072ValPro: 3.072 ± 0.038
2.087ValGln: 2.087 ± 0.037
4.069ValArg: 4.069 ± 0.048
4.208ValSer: 4.208 ± 0.047
3.919ValThr: 3.919 ± 0.055
5.581ValVal: 5.581 ± 0.063
0.711ValTrp: 0.711 ± 0.021
1.9ValTyr: 1.9 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
0.842TrpAla: 0.842 ± 0.021
0.165TrpCys: 0.165 ± 0.009
0.637TrpAsp: 0.637 ± 0.022
0.665TrpGlu: 0.665 ± 0.018
0.493TrpPhe: 0.493 ± 0.016
0.774TrpGly: 0.774 ± 0.02
0.281TrpHis: 0.281 ± 0.012
0.75TrpIle: 0.75 ± 0.019
0.569TrpLys: 0.569 ± 0.015
1.236TrpLeu: 1.236 ± 0.027
0.34TrpMet: 0.34 ± 0.013
0.434TrpAsn: 0.434 ± 0.017
0.485TrpPro: 0.485 ± 0.016
0.579TrpGln: 0.579 ± 0.017
0.728TrpArg: 0.728 ± 0.021
0.602TrpSer: 0.602 ± 0.02
0.586TrpThr: 0.586 ± 0.018
0.763TrpVal: 0.763 ± 0.021
0.166TrpTrp: 0.166 ± 0.01
0.308TrpTyr: 0.308 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.347TyrAla: 2.347 ± 0.033
0.434TyrCys: 0.434 ± 0.016
1.709TyrAsp: 1.709 ± 0.036
1.494TyrGlu: 1.494 ± 0.029
1.327TyrPhe: 1.327 ± 0.03
2.229TyrGly: 2.229 ± 0.04
0.822TyrHis: 0.822 ± 0.018
1.512TyrIle: 1.512 ± 0.028
1.134TyrLys: 1.134 ± 0.03
3.046TyrLeu: 3.046 ± 0.042
0.603TyrMet: 0.603 ± 0.015
1.005TyrAsn: 1.005 ± 0.026
1.426TyrPro: 1.426 ± 0.027
1.175TyrGln: 1.175 ± 0.023
2.291TyrArg: 2.291 ± 0.032
1.492TyrSer: 1.492 ± 0.028
1.564TyrThr: 1.564 ± 0.03
1.625TyrVal: 1.625 ± 0.029
0.395TyrTrp: 0.395 ± 0.014
1.054TyrTyr: 1.054 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6391 proteins (2080000 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski