Amino acid dipepetide frequency for Agrococcus casei LMG 22410

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.903AlaAla: 17.903 ± 0.231
0.613AlaCys: 0.613 ± 0.029
8.401AlaAsp: 8.401 ± 0.121
8.579AlaGlu: 8.579 ± 0.128
3.82AlaPhe: 3.82 ± 0.069
10.837AlaGly: 10.837 ± 0.128
2.433AlaHis: 2.433 ± 0.058
5.827AlaIle: 5.827 ± 0.097
3.492AlaLys: 3.492 ± 0.083
12.538AlaLeu: 12.538 ± 0.176
3.031AlaMet: 3.031 ± 0.055
2.437AlaAsn: 2.437 ± 0.059
5.399AlaPro: 5.399 ± 0.111
3.966AlaGln: 3.966 ± 0.077
7.625AlaArg: 7.625 ± 0.122
7.519AlaSer: 7.519 ± 0.106
6.777AlaThr: 6.777 ± 0.112
10.675AlaVal: 10.675 ± 0.153
1.653AlaTrp: 1.653 ± 0.055
2.186AlaTyr: 2.186 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.591CysAla: 0.591 ± 0.028
0.031CysCys: 0.031 ± 0.006
0.355CysAsp: 0.355 ± 0.023
0.274CysGlu: 0.274 ± 0.022
0.16CysPhe: 0.16 ± 0.013
0.534CysGly: 0.534 ± 0.027
0.11CysHis: 0.11 ± 0.011
0.23CysIle: 0.23 ± 0.017
0.085CysLys: 0.085 ± 0.01
0.377CysLeu: 0.377 ± 0.021
0.086CysMet: 0.086 ± 0.01
0.117CysAsn: 0.117 ± 0.012
0.236CysPro: 0.236 ± 0.017
0.127CysGln: 0.127 ± 0.014
0.292CysArg: 0.292 ± 0.02
0.341CysSer: 0.341 ± 0.021
0.314CysThr: 0.314 ± 0.02
0.41CysVal: 0.41 ± 0.023
0.07CysTrp: 0.07 ± 0.01
0.087CysTyr: 0.087 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.928AspAla: 8.928 ± 0.123
0.253AspCys: 0.253 ± 0.018
4.584AspAsp: 4.584 ± 0.092
5.093AspGlu: 5.093 ± 0.092
1.943AspPhe: 1.943 ± 0.054
5.788AspGly: 5.788 ± 0.097
1.152AspHis: 1.152 ± 0.041
2.605AspIle: 2.605 ± 0.06
1.177AspLys: 1.177 ± 0.047
5.617AspLeu: 5.617 ± 0.088
1.078AspMet: 1.078 ± 0.031
1.137AspAsn: 1.137 ± 0.045
3.607AspPro: 3.607 ± 0.083
1.868AspGln: 1.868 ± 0.049
4.249AspArg: 4.249 ± 0.08
3.303AspSer: 3.303 ± 0.069
3.1AspThr: 3.1 ± 0.065
5.161AspVal: 5.161 ± 0.096
0.981AspTrp: 0.981 ± 0.033
1.349AspTyr: 1.349 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
8.065GluAla: 8.065 ± 0.122
0.273GluCys: 0.273 ± 0.022
2.97GluAsp: 2.97 ± 0.069
3.472GluGlu: 3.472 ± 0.086
2.106GluPhe: 2.106 ± 0.052
4.629GluGly: 4.629 ± 0.092
1.959GluHis: 1.959 ± 0.052
3.174GluIle: 3.174 ± 0.066
1.479GluLys: 1.479 ± 0.052
7.023GluLeu: 7.023 ± 0.099
1.207GluMet: 1.207 ± 0.038
1.42GluAsn: 1.42 ± 0.043
3.185GluPro: 3.185 ± 0.077
3.274GluGln: 3.274 ± 0.072
5.321GluArg: 5.321 ± 0.106
3.785GluSer: 3.785 ± 0.068
3.681GluThr: 3.681 ± 0.075
4.828GluVal: 4.828 ± 0.082
0.956GluTrp: 0.956 ± 0.035
1.295GluTyr: 1.295 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
4.235PheAla: 4.235 ± 0.076
0.166PheCys: 0.166 ± 0.014
2.623PheAsp: 2.623 ± 0.055
2.232PheGlu: 2.232 ± 0.053
1.084PhePhe: 1.084 ± 0.044
3.535PheGly: 3.535 ± 0.081
0.538PheHis: 0.538 ± 0.026
1.359PheIle: 1.359 ± 0.041
0.691PheLys: 0.691 ± 0.03
2.465PheLeu: 2.465 ± 0.071
0.661PheMet: 0.661 ± 0.029
0.741PheAsn: 0.741 ± 0.029
1.177PhePro: 1.177 ± 0.041
0.783PheGln: 0.783 ± 0.029
1.475PheArg: 1.475 ± 0.045
1.979PheSer: 1.979 ± 0.045
2.179PheThr: 2.179 ± 0.05
2.834PheVal: 2.834 ± 0.066
0.459PheTrp: 0.459 ± 0.027
0.59PheTyr: 0.59 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
9.555GlyAla: 9.555 ± 0.13
0.553GlyCys: 0.553 ± 0.025
5.142GlyAsp: 5.142 ± 0.097
5.369GlyGlu: 5.369 ± 0.105
3.266GlyPhe: 3.266 ± 0.065
7.223GlyGly: 7.223 ± 0.13
1.772GlyHis: 1.772 ± 0.045
4.948GlyIle: 4.948 ± 0.095
2.482GlyLys: 2.482 ± 0.063
8.222GlyLeu: 8.222 ± 0.132
2.155GlyMet: 2.155 ± 0.058
1.83GlyAsn: 1.83 ± 0.053
3.13GlyPro: 3.13 ± 0.061
2.829GlyGln: 2.829 ± 0.07
5.652GlyArg: 5.652 ± 0.094
5.41GlySer: 5.41 ± 0.09
5.278GlyThr: 5.278 ± 0.102
7.432GlyVal: 7.432 ± 0.117
1.508GlyTrp: 1.508 ± 0.051
2.126GlyTyr: 2.126 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
2.309HisAla: 2.309 ± 0.066
0.126HisCys: 0.126 ± 0.012
1.48HisAsp: 1.48 ± 0.042
1.479HisGlu: 1.479 ± 0.044
0.621HisPhe: 0.621 ± 0.03
2.046HisGly: 2.046 ± 0.052
0.541HisHis: 0.541 ± 0.023
0.91HisIle: 0.91 ± 0.035
0.354HisLys: 0.354 ± 0.022
1.892HisLeu: 1.892 ± 0.057
0.405HisMet: 0.405 ± 0.025
0.427HisAsn: 0.427 ± 0.025
1.428HisPro: 1.428 ± 0.046
0.593HisGln: 0.593 ± 0.026
1.532HisArg: 1.532 ± 0.05
1.196HisSer: 1.196 ± 0.036
1.131HisThr: 1.131 ± 0.039
1.685HisVal: 1.685 ± 0.041
0.347HisTrp: 0.347 ± 0.02
0.45HisTyr: 0.45 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
7.209IleAla: 7.209 ± 0.1
0.272IleCys: 0.272 ± 0.019
3.979IleAsp: 3.979 ± 0.064
3.692IleGlu: 3.692 ± 0.069
1.194IlePhe: 1.194 ± 0.039
4.95IleGly: 4.95 ± 0.092
0.74IleHis: 0.74 ± 0.03
2.236IleIle: 2.236 ± 0.063
0.949IleLys: 0.949 ± 0.038
3.617IleLeu: 3.617 ± 0.075
0.904IleMet: 0.904 ± 0.037
1.022IleAsn: 1.022 ± 0.038
2.302IlePro: 2.302 ± 0.059
1.079IleGln: 1.079 ± 0.038
3.005IleArg: 3.005 ± 0.058
2.694IleSer: 2.694 ± 0.065
3.075IleThr: 3.075 ± 0.064
4.899IleVal: 4.899 ± 0.084
0.574IleTrp: 0.574 ± 0.029
0.775IleTyr: 0.775 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
2.892LysAla: 2.892 ± 0.069
0.082LysCys: 0.082 ± 0.011
1.124LysAsp: 1.124 ± 0.041
1.152LysGlu: 1.152 ± 0.037
0.643LysPhe: 0.643 ± 0.031
1.867LysGly: 1.867 ± 0.049
0.65LysHis: 0.65 ± 0.032
1.003LysIle: 1.003 ± 0.035
0.923LysLys: 0.923 ± 0.039
2.365LysLeu: 2.365 ± 0.06
0.505LysMet: 0.505 ± 0.026
0.709LysAsn: 0.709 ± 0.033
1.52LysPro: 1.52 ± 0.05
1.088LysGln: 1.088 ± 0.043
2.173LysArg: 2.173 ± 0.053
1.678LysSer: 1.678 ± 0.052
1.685LysThr: 1.685 ± 0.056
1.857LysVal: 1.857 ± 0.057
0.32LysTrp: 0.32 ± 0.023
0.458LysTyr: 0.458 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
12.873LeuAla: 12.873 ± 0.169
0.468LeuCys: 0.468 ± 0.023
6.245LeuAsp: 6.245 ± 0.091
5.943LeuGlu: 5.943 ± 0.09
2.837LeuPhe: 2.837 ± 0.073
8.569LeuGly: 8.569 ± 0.13
1.9LeuHis: 1.9 ± 0.051
4.605LeuIle: 4.605 ± 0.097
2.173LeuLys: 2.173 ± 0.059
9.712LeuLeu: 9.712 ± 0.181
2.031LeuMet: 2.031 ± 0.056
1.964LeuAsn: 1.964 ± 0.053
4.933LeuPro: 4.933 ± 0.082
3.327LeuGln: 3.327 ± 0.071
6.557LeuArg: 6.557 ± 0.105
5.838LeuSer: 5.838 ± 0.078
6.066LeuThr: 6.066 ± 0.084
8.489LeuVal: 8.489 ± 0.111
1.217LeuTrp: 1.217 ± 0.045
1.495LeuTyr: 1.495 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
2.388MetAla: 2.388 ± 0.064
0.102MetCys: 0.102 ± 0.011
0.802MetAsp: 0.802 ± 0.03
0.824MetGlu: 0.824 ± 0.031
0.668MetPhe: 0.668 ± 0.03
1.463MetGly: 1.463 ± 0.043
0.564MetHis: 0.564 ± 0.025
1.066MetIle: 1.066 ± 0.036
0.568MetLys: 0.568 ± 0.029
2.564MetLeu: 2.564 ± 0.063
0.445MetMet: 0.445 ± 0.03
0.695MetAsn: 0.695 ± 0.025
1.358MetPro: 1.358 ± 0.044
0.974MetGln: 0.974 ± 0.031
1.848MetArg: 1.848 ± 0.046
1.614MetSer: 1.614 ± 0.042
1.891MetThr: 1.891 ± 0.042
1.566MetVal: 1.566 ± 0.037
0.248MetTrp: 0.248 ± 0.016
0.341MetTyr: 0.341 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.785AsnAla: 2.785 ± 0.058
0.11AsnCys: 0.11 ± 0.011
1.422AsnAsp: 1.422 ± 0.055
1.434AsnGlu: 1.434 ± 0.04
0.593AsnPhe: 0.593 ± 0.025
1.984AsnGly: 1.984 ± 0.054
0.413AsnHis: 0.413 ± 0.022
1.008AsnIle: 1.008 ± 0.035
0.528AsnLys: 0.528 ± 0.027
1.871AsnLeu: 1.871 ± 0.048
0.46AsnMet: 0.46 ± 0.026
0.559AsnAsn: 0.559 ± 0.028
1.634AsnPro: 1.634 ± 0.053
0.637AsnGln: 0.637 ± 0.028
1.456AsnArg: 1.456 ± 0.043
1.204AsnSer: 1.204 ± 0.042
1.263AsnThr: 1.263 ± 0.049
1.764AsnVal: 1.764 ± 0.05
0.353AsnTrp: 0.353 ± 0.018
0.498AsnTyr: 0.498 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
5.861ProAla: 5.861 ± 0.105
0.16ProCys: 0.16 ± 0.015
3.446ProAsp: 3.446 ± 0.068
3.994ProGlu: 3.994 ± 0.075
1.564ProPhe: 1.564 ± 0.047
4.263ProGly: 4.263 ± 0.086
1.088ProHis: 1.088 ± 0.035
2.213ProIle: 2.213 ± 0.052
1.428ProLys: 1.428 ± 0.045
4.248ProLeu: 4.248 ± 0.079
1.054ProMet: 1.054 ± 0.035
1.213ProAsn: 1.213 ± 0.037
1.809ProPro: 1.809 ± 0.064
1.639ProGln: 1.639 ± 0.055
2.521ProArg: 2.521 ± 0.058
2.973ProSer: 2.973 ± 0.063
2.982ProThr: 2.982 ± 0.072
4.312ProVal: 4.312 ± 0.086
0.743ProTrp: 0.743 ± 0.033
0.923ProTyr: 0.923 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.967GlnAla: 3.967 ± 0.074
0.139GlnCys: 0.139 ± 0.013
1.327GlnAsp: 1.327 ± 0.042
1.623GlnGlu: 1.623 ± 0.049
1.101GlnPhe: 1.101 ± 0.037
2.33GlnGly: 2.33 ± 0.057
0.941GlnHis: 0.941 ± 0.034
1.827GlnIle: 1.827 ± 0.053
0.864GlnLys: 0.864 ± 0.033
4.105GlnLeu: 4.105 ± 0.07
0.767GlnMet: 0.767 ± 0.032
0.806GlnAsn: 0.806 ± 0.038
1.946GlnPro: 1.946 ± 0.072
2.051GlnGln: 2.051 ± 0.071
2.846GlnArg: 2.846 ± 0.067
1.949GlnSer: 1.949 ± 0.058
1.857GlnThr: 1.857 ± 0.047
2.677GlnVal: 2.677 ± 0.062
0.479GlnTrp: 0.479 ± 0.024
0.764GlnTyr: 0.764 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
7.598ArgAla: 7.598 ± 0.114
0.247ArgCys: 0.247 ± 0.018
4.169ArgAsp: 4.169 ± 0.08
4.693ArgGlu: 4.693 ± 0.093
2.303ArgPhe: 2.303 ± 0.059
4.904ArgGly: 4.904 ± 0.09
1.408ArgHis: 1.408 ± 0.045
3.939ArgIle: 3.939 ± 0.07
1.888ArgLys: 1.888 ± 0.054
6.615ArgLeu: 6.615 ± 0.115
1.903ArgMet: 1.903 ± 0.048
1.344ArgAsn: 1.344 ± 0.048
2.884ArgPro: 2.884 ± 0.06
2.303ArgGln: 2.303 ± 0.058
5.277ArgArg: 5.277 ± 0.105
3.897ArgSer: 3.897 ± 0.07
3.763ArgThr: 3.763 ± 0.071
5.478ArgVal: 5.478 ± 0.089
1.003ArgTrp: 1.003 ± 0.039
1.431ArgTyr: 1.431 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
7.105SerAla: 7.105 ± 0.096
0.296SerCys: 0.296 ± 0.021
3.751SerAsp: 3.751 ± 0.07
3.408SerGlu: 3.408 ± 0.071
2.035SerPhe: 2.035 ± 0.045
5.788SerGly: 5.788 ± 0.102
1.169SerHis: 1.169 ± 0.038
3.129SerIle: 3.129 ± 0.069
1.598SerLys: 1.598 ± 0.048
5.644SerLeu: 5.644 ± 0.093
1.564SerMet: 1.564 ± 0.046
1.381SerAsn: 1.381 ± 0.04
2.841SerPro: 2.841 ± 0.063
1.908SerGln: 1.908 ± 0.054
3.771SerArg: 3.771 ± 0.074
3.78SerSer: 3.78 ± 0.073
3.584SerThr: 3.584 ± 0.064
5.089SerVal: 5.089 ± 0.089
0.875SerTrp: 0.875 ± 0.033
1.196SerTyr: 1.196 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
7.162ThrAla: 7.162 ± 0.128
0.276ThrCys: 0.276 ± 0.021
3.89ThrAsp: 3.89 ± 0.079
3.682ThrGlu: 3.682 ± 0.08
1.805ThrPhe: 1.805 ± 0.05
5.753ThrGly: 5.753 ± 0.097
1.142ThrHis: 1.142 ± 0.035
2.985ThrIle: 2.985 ± 0.07
1.416ThrLys: 1.416 ± 0.045
5.939ThrLeu: 5.939 ± 0.103
1.213ThrMet: 1.213 ± 0.042
1.303ThrAsn: 1.303 ± 0.046
3.451ThrPro: 3.451 ± 0.067
1.813ThrGln: 1.813 ± 0.056
3.381ThrArg: 3.381 ± 0.068
3.329ThrSer: 3.329 ± 0.063
3.668ThrThr: 3.668 ± 0.085
5.854ThrVal: 5.854 ± 0.113
0.806ThrTrp: 0.806 ± 0.032
1.137ThrTyr: 1.137 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
10.581ValAla: 10.581 ± 0.146
0.447ValCys: 0.447 ± 0.022
5.324ValAsp: 5.324 ± 0.099
5.286ValGlu: 5.286 ± 0.094
2.865ValPhe: 2.865 ± 0.071
6.645ValGly: 6.645 ± 0.108
1.77ValHis: 1.77 ± 0.051
4.562ValIle: 4.562 ± 0.071
1.832ValLys: 1.832 ± 0.051
8.885ValLeu: 8.885 ± 0.122
1.782ValMet: 1.782 ± 0.046
1.946ValAsn: 1.946 ± 0.061
4.017ValPro: 4.017 ± 0.069
2.79ValGln: 2.79 ± 0.057
5.428ValArg: 5.428 ± 0.093
5.206ValSer: 5.206 ± 0.089
5.786ValThr: 5.786 ± 0.106
7.768ValVal: 7.768 ± 0.129
1.113ValTrp: 1.113 ± 0.043
1.42ValTyr: 1.42 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.449TrpAla: 1.449 ± 0.04
0.089TrpCys: 0.089 ± 0.01
0.627TrpAsp: 0.627 ± 0.029
0.63TrpGlu: 0.63 ± 0.028
0.535TrpPhe: 0.535 ± 0.028
0.943TrpGly: 0.943 ± 0.041
0.382TrpHis: 0.382 ± 0.022
0.801TrpIle: 0.801 ± 0.033
0.352TrpLys: 0.352 ± 0.024
1.697TrpLeu: 1.697 ± 0.052
0.407TrpMet: 0.407 ± 0.02
0.453TrpAsn: 0.453 ± 0.025
0.683TrpPro: 0.683 ± 0.031
0.731TrpGln: 0.731 ± 0.03
1.118TrpArg: 1.118 ± 0.037
0.967TrpSer: 0.967 ± 0.037
0.772TrpThr: 0.772 ± 0.029
1.14TrpVal: 1.14 ± 0.04
0.312TrpTrp: 0.312 ± 0.019
0.211TrpTyr: 0.211 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.252TyrAla: 2.252 ± 0.049
0.126TyrCys: 0.126 ± 0.013
1.309TyrAsp: 1.309 ± 0.042
1.246TyrGlu: 1.246 ± 0.043
0.662TyrPhe: 0.662 ± 0.029
1.83TyrGly: 1.83 ± 0.058
0.283TyrHis: 0.283 ± 0.02
0.738TyrIle: 0.738 ± 0.031
0.416TyrLys: 0.416 ± 0.024
1.844TyrLeu: 1.844 ± 0.052
0.33TyrMet: 0.33 ± 0.018
0.474TyrAsn: 0.474 ± 0.025
0.892TyrPro: 0.892 ± 0.033
0.586TyrGln: 0.586 ± 0.028
1.52TyrArg: 1.52 ± 0.046
1.251TyrSer: 1.251 ± 0.042
1.153TyrThr: 1.153 ± 0.049
1.538TyrVal: 1.538 ± 0.043
0.282TyrTrp: 0.282 ± 0.017
0.441TyrTyr: 0.441 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2698 proteins (827534 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski