Amino acid dipepetide frequency for Suicoccus acidiformans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.814AlaAla: 5.814 ± 0.148
0.517AlaCys: 0.517 ± 0.029
4.831AlaAsp: 4.831 ± 0.095
6.162AlaGlu: 6.162 ± 0.123
3.385AlaPhe: 3.385 ± 0.083
5.693AlaGly: 5.693 ± 0.105
1.536AlaHis: 1.536 ± 0.054
6.242AlaIle: 6.242 ± 0.114
4.424AlaLys: 4.424 ± 0.087
8.483AlaLeu: 8.483 ± 0.124
2.472AlaMet: 2.472 ± 0.07
3.632AlaAsn: 3.632 ± 0.078
2.336AlaPro: 2.336 ± 0.071
3.718AlaGln: 3.718 ± 0.108
3.186AlaArg: 3.186 ± 0.079
5.014AlaSer: 5.014 ± 0.1
4.27AlaThr: 4.27 ± 0.08
5.161AlaVal: 5.161 ± 0.108
0.704AlaTrp: 0.704 ± 0.03
3.48AlaTyr: 3.48 ± 0.076
0.0AlaXaa: 0.0 ± 0.0
Cys
0.31CysAla: 0.31 ± 0.024
0.056CysCys: 0.056 ± 0.009
0.289CysAsp: 0.289 ± 0.019
0.307CysGlu: 0.307 ± 0.021
0.276CysPhe: 0.276 ± 0.023
0.438CysGly: 0.438 ± 0.025
0.155CysHis: 0.155 ± 0.019
0.349CysIle: 0.349 ± 0.027
0.172CysLys: 0.172 ± 0.017
0.572CysLeu: 0.572 ± 0.031
0.109CysMet: 0.109 ± 0.013
0.165CysAsn: 0.165 ± 0.014
0.239CysPro: 0.239 ± 0.018
0.305CysGln: 0.305 ± 0.021
0.267CysArg: 0.267 ± 0.02
0.251CysSer: 0.251 ± 0.02
0.217CysThr: 0.217 ± 0.017
0.329CysVal: 0.329 ± 0.025
0.041CysTrp: 0.041 ± 0.009
0.236CysTyr: 0.236 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
4.656AspAla: 4.656 ± 0.095
0.29AspCys: 0.29 ± 0.024
3.073AspAsp: 3.073 ± 0.099
4.949AspGlu: 4.949 ± 0.111
2.788AspPhe: 2.788 ± 0.064
3.876AspGly: 3.876 ± 0.145
1.088AspHis: 1.088 ± 0.04
4.691AspIle: 4.691 ± 0.085
2.814AspLys: 2.814 ± 0.077
5.833AspLeu: 5.833 ± 0.101
1.593AspMet: 1.593 ± 0.048
2.299AspAsn: 2.299 ± 0.071
2.237AspPro: 2.237 ± 0.098
2.644AspGln: 2.644 ± 0.067
2.508AspArg: 2.508 ± 0.065
3.066AspSer: 3.066 ± 0.067
3.231AspThr: 3.231 ± 0.083
4.302AspVal: 4.302 ± 0.08
0.609AspTrp: 0.609 ± 0.03
3.008AspTyr: 3.008 ± 0.078
0.0AspXaa: 0.0 ± 0.0
Glu
8.37GluAla: 8.37 ± 0.156
0.286GluCys: 0.286 ± 0.022
4.514GluAsp: 4.514 ± 0.084
6.987GluGlu: 6.987 ± 0.15
2.413GluPhe: 2.413 ± 0.062
4.724GluGly: 4.724 ± 0.11
1.615GluHis: 1.615 ± 0.047
5.269GluIle: 5.269 ± 0.106
3.805GluLys: 3.805 ± 0.093
7.052GluLeu: 7.052 ± 0.117
2.127GluMet: 2.127 ± 0.059
3.167GluAsn: 3.167 ± 0.081
2.455GluPro: 2.455 ± 0.097
3.593GluGln: 3.593 ± 0.092
4.049GluArg: 4.049 ± 0.104
4.066GluSer: 4.066 ± 0.088
4.37GluThr: 4.37 ± 0.099
5.91GluVal: 5.91 ± 0.094
0.709GluTrp: 0.709 ± 0.038
2.365GluTyr: 2.365 ± 0.065
0.0GluXaa: 0.0 ± 0.0
Phe
2.983PheAla: 2.983 ± 0.087
0.245PheCys: 0.245 ± 0.019
2.675PheAsp: 2.675 ± 0.056
2.84PheGlu: 2.84 ± 0.078
1.76PhePhe: 1.76 ± 0.058
2.88PheGly: 2.88 ± 0.076
0.724PheHis: 0.724 ± 0.033
3.033PheIle: 3.033 ± 0.073
1.978PheLys: 1.978 ± 0.052
3.72PheLeu: 3.72 ± 0.095
1.023PheMet: 1.023 ± 0.045
1.957PheAsn: 1.957 ± 0.051
1.34PhePro: 1.34 ± 0.047
1.754PheGln: 1.754 ± 0.059
1.452PheArg: 1.452 ± 0.046
2.377PheSer: 2.377 ± 0.067
2.255PheThr: 2.255 ± 0.051
2.834PheVal: 2.834 ± 0.073
0.377PheTrp: 0.377 ± 0.023
1.57PheTyr: 1.57 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
4.884GlyAla: 4.884 ± 0.107
0.349GlyCys: 0.349 ± 0.023
3.645GlyAsp: 3.645 ± 0.087
4.883GlyGlu: 4.883 ± 0.095
2.774GlyPhe: 2.774 ± 0.073
4.413GlyGly: 4.413 ± 0.094
1.459GlyHis: 1.459 ± 0.048
5.123GlyIle: 5.123 ± 0.096
3.512GlyLys: 3.512 ± 0.089
6.592GlyLeu: 6.592 ± 0.097
1.892GlyMet: 1.892 ± 0.054
2.682GlyAsn: 2.682 ± 0.068
1.717GlyPro: 1.717 ± 0.052
3.393GlyGln: 3.393 ± 0.086
3.06GlyArg: 3.06 ± 0.074
3.956GlySer: 3.956 ± 0.082
3.679GlyThr: 3.679 ± 0.096
4.892GlyVal: 4.892 ± 0.097
0.598GlyTrp: 0.598 ± 0.035
2.865GlyTyr: 2.865 ± 0.072
0.0GlyXaa: 0.0 ± 0.0
His
1.396HisAla: 1.396 ± 0.044
0.186HisCys: 0.186 ± 0.017
1.119HisAsp: 1.119 ± 0.043
1.396HisGlu: 1.396 ± 0.046
1.073HisPhe: 1.073 ± 0.043
1.385HisGly: 1.385 ± 0.054
0.648HisHis: 0.648 ± 0.04
1.469HisIle: 1.469 ± 0.056
0.899HisLys: 0.899 ± 0.039
2.128HisLeu: 2.128 ± 0.062
0.488HisMet: 0.488 ± 0.028
0.794HisAsn: 0.794 ± 0.034
1.051HisPro: 1.051 ± 0.045
1.092HisGln: 1.092 ± 0.042
1.005HisArg: 1.005 ± 0.041
1.125HisSer: 1.125 ± 0.041
1.155HisThr: 1.155 ± 0.046
1.282HisVal: 1.282 ± 0.046
0.209HisTrp: 0.209 ± 0.017
1.073HisTyr: 1.073 ± 0.046
0.0HisXaa: 0.0 ± 0.0
Ile
5.963IleAla: 5.963 ± 0.108
0.444IleCys: 0.444 ± 0.025
4.833IleAsp: 4.833 ± 0.086
5.909IleGlu: 5.909 ± 0.117
2.587IlePhe: 2.587 ± 0.078
4.9IleGly: 4.9 ± 0.105
1.665IleHis: 1.665 ± 0.055
5.368IleIle: 5.368 ± 0.115
3.06IleLys: 3.06 ± 0.083
6.617IleLeu: 6.617 ± 0.128
1.587IleMet: 1.587 ± 0.059
3.129IleAsn: 3.129 ± 0.067
3.092IlePro: 3.092 ± 0.069
3.882IleGln: 3.882 ± 0.097
3.132IleArg: 3.132 ± 0.08
4.214IleSer: 4.214 ± 0.079
3.813IleThr: 3.813 ± 0.087
4.872IleVal: 4.872 ± 0.084
0.504IleTrp: 0.504 ± 0.027
2.676IleTyr: 2.676 ± 0.072
0.0IleXaa: 0.0 ± 0.0
Lys
4.398LysAla: 4.398 ± 0.089
0.189LysCys: 0.189 ± 0.018
3.306LysAsp: 3.306 ± 0.089
4.501LysGlu: 4.501 ± 0.097
1.472LysPhe: 1.472 ± 0.047
3.285LysGly: 3.285 ± 0.076
1.12LysHis: 1.12 ± 0.047
3.014LysIle: 3.014 ± 0.077
2.62LysLys: 2.62 ± 0.07
4.19LysLeu: 4.19 ± 0.075
1.239LysMet: 1.239 ± 0.047
2.021LysAsn: 2.021 ± 0.062
1.875LysPro: 1.875 ± 0.074
2.69LysGln: 2.69 ± 0.064
2.644LysArg: 2.644 ± 0.073
2.417LysSer: 2.417 ± 0.069
2.634LysThr: 2.634 ± 0.077
3.66LysVal: 3.66 ± 0.094
0.388LysTrp: 0.388 ± 0.027
2.037LysTyr: 2.037 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
8.892LeuAla: 8.892 ± 0.134
0.531LeuCys: 0.531 ± 0.03
5.755LeuAsp: 5.755 ± 0.107
7.183LeuGlu: 7.183 ± 0.129
3.667LeuPhe: 3.667 ± 0.09
6.268LeuGly: 6.268 ± 0.1
1.637LeuHis: 1.637 ± 0.056
6.6LeuIle: 6.6 ± 0.134
5.116LeuLys: 5.116 ± 0.088
9.288LeuLeu: 9.288 ± 0.191
2.597LeuMet: 2.597 ± 0.066
4.615LeuAsn: 4.615 ± 0.083
3.929LeuPro: 3.929 ± 0.075
3.562LeuGln: 3.562 ± 0.086
3.884LeuArg: 3.884 ± 0.083
6.609LeuSer: 6.609 ± 0.12
5.878LeuThr: 5.878 ± 0.099
6.56LeuVal: 6.56 ± 0.102
0.724LeuTrp: 0.724 ± 0.036
3.269LeuTyr: 3.269 ± 0.074
0.0LeuXaa: 0.0 ± 0.0
Met
2.178MetAla: 2.178 ± 0.063
0.093MetCys: 0.093 ± 0.012
1.62MetAsp: 1.62 ± 0.054
1.773MetGlu: 1.773 ± 0.055
0.724MetPhe: 0.724 ± 0.033
1.744MetGly: 1.744 ± 0.046
0.529MetHis: 0.529 ± 0.028
1.864MetIle: 1.864 ± 0.058
1.714MetLys: 1.714 ± 0.052
2.302MetLeu: 2.302 ± 0.063
0.724MetMet: 0.724 ± 0.034
1.384MetAsn: 1.384 ± 0.049
0.983MetPro: 0.983 ± 0.042
1.206MetGln: 1.206 ± 0.042
1.223MetArg: 1.223 ± 0.045
1.677MetSer: 1.677 ± 0.053
1.724MetThr: 1.724 ± 0.048
1.658MetVal: 1.658 ± 0.06
0.137MetTrp: 0.137 ± 0.016
0.756MetTyr: 0.756 ± 0.032
0.0MetXaa: 0.0 ± 0.0
Asn
2.898AsnAla: 2.898 ± 0.065
0.248AsnCys: 0.248 ± 0.02
2.284AsnAsp: 2.284 ± 0.07
3.108AsnGlu: 3.108 ± 0.077
1.826AsnPhe: 1.826 ± 0.059
2.84AsnGly: 2.84 ± 0.081
1.13AsnHis: 1.13 ± 0.042
3.225AsnIle: 3.225 ± 0.079
2.077AsnLys: 2.077 ± 0.071
4.281AsnLeu: 4.281 ± 0.095
1.07AsnMet: 1.07 ± 0.048
1.889AsnAsn: 1.889 ± 0.056
2.248AsnPro: 2.248 ± 0.075
2.485AsnGln: 2.485 ± 0.068
2.237AsnArg: 2.237 ± 0.053
2.055AsnSer: 2.055 ± 0.065
2.252AsnThr: 2.252 ± 0.055
2.962AsnVal: 2.962 ± 0.075
0.494AsnTrp: 0.494 ± 0.033
1.978AsnTyr: 1.978 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
2.842ProAla: 2.842 ± 0.083
0.116ProCys: 0.116 ± 0.013
2.405ProAsp: 2.405 ± 0.064
3.667ProGlu: 3.667 ± 0.102
1.533ProPhe: 1.533 ± 0.049
2.352ProGly: 2.352 ± 0.058
0.787ProHis: 0.787 ± 0.035
2.64ProIle: 2.64 ± 0.065
1.783ProLys: 1.783 ± 0.056
3.185ProLeu: 3.185 ± 0.083
0.911ProMet: 0.911 ± 0.036
1.763ProAsn: 1.763 ± 0.072
0.735ProPro: 0.735 ± 0.034
1.44ProGln: 1.44 ± 0.055
1.147ProArg: 1.147 ± 0.043
2.211ProSer: 2.211 ± 0.06
2.273ProThr: 2.273 ± 0.083
2.923ProVal: 2.923 ± 0.104
0.311ProTrp: 0.311 ± 0.022
1.44ProTyr: 1.44 ± 0.052
0.0ProXaa: 0.0 ± 0.0
Gln
5.898GlnAla: 5.898 ± 0.141
0.153GlnCys: 0.153 ± 0.014
2.594GlnAsp: 2.594 ± 0.066
4.108GlnGlu: 4.108 ± 0.094
1.642GlnPhe: 1.642 ± 0.051
2.909GlnGly: 2.909 ± 0.077
0.865GlnHis: 0.865 ± 0.037
3.157GlnIle: 3.157 ± 0.085
2.335GlnLys: 2.335 ± 0.065
4.654GlnLeu: 4.654 ± 0.115
1.309GlnMet: 1.309 ± 0.04
1.718GlnAsn: 1.718 ± 0.051
1.565GlnPro: 1.565 ± 0.057
2.215GlnGln: 2.215 ± 0.074
2.27GlnArg: 2.27 ± 0.063
2.514GlnSer: 2.514 ± 0.064
2.769GlnThr: 2.769 ± 0.067
3.472GlnVal: 3.472 ± 0.08
0.488GlnTrp: 0.488 ± 0.03
1.577GlnTyr: 1.577 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
2.713ArgAla: 2.713 ± 0.068
0.184ArgCys: 0.184 ± 0.016
2.379ArgAsp: 2.379 ± 0.068
3.385ArgGlu: 3.385 ± 0.072
2.025ArgPhe: 2.025 ± 0.054
2.438ArgGly: 2.438 ± 0.057
0.912ArgHis: 0.912 ± 0.039
3.366ArgIle: 3.366 ± 0.083
2.668ArgLys: 2.668 ± 0.073
4.628ArgLeu: 4.628 ± 0.093
1.238ArgMet: 1.238 ± 0.044
1.997ArgAsn: 1.997 ± 0.053
1.645ArgPro: 1.645 ± 0.049
2.557ArgGln: 2.557 ± 0.076
2.422ArgArg: 2.422 ± 0.059
2.315ArgSer: 2.315 ± 0.059
2.253ArgThr: 2.253 ± 0.057
2.927ArgVal: 2.927 ± 0.073
0.432ArgTrp: 0.432 ± 0.027
2.003ArgTyr: 2.003 ± 0.063
0.0ArgXaa: 0.0 ± 0.0
Ser
3.832SerAla: 3.832 ± 0.081
0.218SerCys: 0.218 ± 0.021
3.293SerAsp: 3.293 ± 0.071
4.062SerGlu: 4.062 ± 0.11
2.654SerPhe: 2.654 ± 0.07
4.392SerGly: 4.392 ± 0.089
1.366SerHis: 1.366 ± 0.051
4.212SerIle: 4.212 ± 0.092
2.812SerLys: 2.812 ± 0.065
5.963SerLeu: 5.963 ± 0.099
1.5SerMet: 1.5 ± 0.049
2.579SerAsn: 2.579 ± 0.076
2.004SerPro: 2.004 ± 0.058
3.182SerGln: 3.182 ± 0.078
2.591SerArg: 2.591 ± 0.071
3.362SerSer: 3.362 ± 0.078
3.054SerThr: 3.054 ± 0.076
3.894SerVal: 3.894 ± 0.086
0.531SerTrp: 0.531 ± 0.03
2.346SerTyr: 2.346 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
4.057ThrAla: 4.057 ± 0.102
0.251ThrCys: 0.251 ± 0.022
3.281ThrAsp: 3.281 ± 0.077
3.848ThrGlu: 3.848 ± 0.081
2.373ThrPhe: 2.373 ± 0.061
4.128ThrGly: 4.128 ± 0.091
1.225ThrHis: 1.225 ± 0.042
4.041ThrIle: 4.041 ± 0.088
2.548ThrLys: 2.548 ± 0.067
5.63ThrLeu: 5.63 ± 0.09
1.297ThrMet: 1.297 ± 0.047
2.507ThrAsn: 2.507 ± 0.068
2.494ThrPro: 2.494 ± 0.085
2.339ThrGln: 2.339 ± 0.067
2.034ThrArg: 2.034 ± 0.059
3.313ThrSer: 3.313 ± 0.076
2.796ThrThr: 2.796 ± 0.077
4.323ThrVal: 4.323 ± 0.099
0.483ThrTrp: 0.483 ± 0.027
2.497ThrTyr: 2.497 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
5.627ValAla: 5.627 ± 0.104
0.42ValCys: 0.42 ± 0.025
4.584ValAsp: 4.584 ± 0.114
5.394ValGlu: 5.394 ± 0.098
2.629ValPhe: 2.629 ± 0.075
4.542ValGly: 4.542 ± 0.092
1.347ValHis: 1.347 ± 0.048
5.332ValIle: 5.332 ± 0.093
3.466ValLys: 3.466 ± 0.083
6.171ValLeu: 6.171 ± 0.105
1.783ValMet: 1.783 ± 0.053
3.194ValAsn: 3.194 ± 0.066
2.691ValPro: 2.691 ± 0.075
2.883ValGln: 2.883 ± 0.072
2.762ValArg: 2.762 ± 0.064
4.657ValSer: 4.657 ± 0.089
4.279ValThr: 4.279 ± 0.114
4.889ValVal: 4.889 ± 0.1
0.542ValTrp: 0.542 ± 0.03
2.724ValTyr: 2.724 ± 0.067
0.0ValXaa: 0.0 ± 0.0
Trp
0.542TrpAla: 0.542 ± 0.031
0.056TrpCys: 0.056 ± 0.008
0.457TrpAsp: 0.457 ± 0.029
0.536TrpGlu: 0.536 ± 0.027
0.355TrpPhe: 0.355 ± 0.023
0.548TrpGly: 0.548 ± 0.027
0.209TrpHis: 0.209 ± 0.017
0.563TrpIle: 0.563 ± 0.029
0.363TrpLys: 0.363 ± 0.023
1.237TrpLeu: 1.237 ± 0.045
0.256TrpMet: 0.256 ± 0.018
0.43TrpAsn: 0.43 ± 0.029
0.261TrpPro: 0.261 ± 0.02
0.625TrpGln: 0.625 ± 0.031
0.432TrpArg: 0.432 ± 0.023
0.45TrpSer: 0.45 ± 0.028
0.455TrpThr: 0.455 ± 0.03
0.566TrpVal: 0.566 ± 0.031
0.108TrpTrp: 0.108 ± 0.013
0.315TrpTyr: 0.315 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.809TyrAla: 2.809 ± 0.078
0.249TyrCys: 0.249 ± 0.018
2.62TyrAsp: 2.62 ± 0.073
2.849TyrGlu: 2.849 ± 0.072
1.82TyrPhe: 1.82 ± 0.061
2.612TyrGly: 2.612 ± 0.067
0.955TyrHis: 0.955 ± 0.04
2.688TyrIle: 2.688 ± 0.057
1.534TyrLys: 1.534 ± 0.052
4.034TyrLeu: 4.034 ± 0.09
0.796TyrMet: 0.796 ± 0.04
1.664TyrAsn: 1.664 ± 0.056
1.565TyrPro: 1.565 ± 0.06
2.648TyrGln: 2.648 ± 0.081
2.184TyrArg: 2.184 ± 0.059
2.2TyrSer: 2.2 ± 0.057
2.109TyrThr: 2.109 ± 0.061
2.501TyrVal: 2.501 ± 0.065
0.376TyrTrp: 0.376 ± 0.023
1.854TyrTyr: 1.854 ± 0.069
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2209 proteins (678505 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski