Amino acid dipepetide frequency for Aeromicrobium sp. 592

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.2AlaAla: 18.2 ± 0.183
0.877AlaCys: 0.877 ± 0.029
8.247AlaAsp: 8.247 ± 0.077
7.608AlaGlu: 7.608 ± 0.092
3.693AlaPhe: 3.693 ± 0.057
11.743AlaGly: 11.743 ± 0.111
2.415AlaHis: 2.415 ± 0.047
5.241AlaIle: 5.241 ± 0.075
3.035AlaLys: 3.035 ± 0.077
12.706AlaLeu: 12.706 ± 0.129
2.852AlaMet: 2.852 ± 0.054
2.085AlaAsn: 2.085 ± 0.048
5.556AlaPro: 5.556 ± 0.093
3.633AlaGln: 3.633 ± 0.068
8.708AlaArg: 8.708 ± 0.103
6.66AlaSer: 6.66 ± 0.085
7.484AlaThr: 7.484 ± 0.086
11.359AlaVal: 11.359 ± 0.13
1.744AlaTrp: 1.744 ± 0.042
2.639AlaTyr: 2.639 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.781CysAla: 0.781 ± 0.025
0.047CysCys: 0.047 ± 0.008
0.412CysAsp: 0.412 ± 0.019
0.342CysGlu: 0.342 ± 0.018
0.202CysPhe: 0.202 ± 0.015
0.75CysGly: 0.75 ± 0.027
0.194CysHis: 0.194 ± 0.013
0.201CysIle: 0.201 ± 0.015
0.105CysLys: 0.105 ± 0.01
0.594CysLeu: 0.594 ± 0.027
0.112CysMet: 0.112 ± 0.011
0.129CysAsn: 0.129 ± 0.013
0.386CysPro: 0.386 ± 0.018
0.158CysGln: 0.158 ± 0.012
0.482CysArg: 0.482 ± 0.021
0.391CysSer: 0.391 ± 0.017
0.35CysThr: 0.35 ± 0.018
0.526CysVal: 0.526 ± 0.019
0.078CysTrp: 0.078 ± 0.008
0.12CysTyr: 0.12 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
8.125AspAla: 8.125 ± 0.101
0.312AspCys: 0.312 ± 0.016
4.828AspAsp: 4.828 ± 0.073
4.644AspGlu: 4.644 ± 0.081
1.756AspPhe: 1.756 ± 0.039
6.319AspGly: 6.319 ± 0.092
1.482AspHis: 1.482 ± 0.034
2.397AspIle: 2.397 ± 0.043
1.422AspLys: 1.422 ± 0.046
7.122AspLeu: 7.122 ± 0.08
0.927AspMet: 0.927 ± 0.027
1.066AspAsn: 1.066 ± 0.034
4.29AspPro: 4.29 ± 0.074
1.837AspGln: 1.837 ± 0.046
4.897AspArg: 4.897 ± 0.067
2.797AspSer: 2.797 ± 0.051
3.018AspThr: 3.018 ± 0.049
6.584AspVal: 6.584 ± 0.087
0.945AspTrp: 0.945 ± 0.029
1.228AspTyr: 1.228 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
6.731GluAla: 6.731 ± 0.098
0.293GluCys: 0.293 ± 0.018
3.102GluAsp: 3.102 ± 0.056
2.931GluGlu: 2.931 ± 0.065
1.512GluPhe: 1.512 ± 0.042
4.208GluGly: 4.208 ± 0.066
1.546GluHis: 1.546 ± 0.039
2.699GluIle: 2.699 ± 0.06
1.498GluLys: 1.498 ± 0.045
6.403GluLeu: 6.403 ± 0.086
1.079GluMet: 1.079 ± 0.034
0.998GluAsn: 0.998 ± 0.028
3.007GluPro: 3.007 ± 0.059
2.263GluGln: 2.263 ± 0.046
4.921GluArg: 4.921 ± 0.08
2.902GluSer: 2.902 ± 0.053
3.031GluThr: 3.031 ± 0.053
4.865GluVal: 4.865 ± 0.071
0.719GluTrp: 0.719 ± 0.028
0.979GluTyr: 0.979 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
3.885PheAla: 3.885 ± 0.059
0.245PheCys: 0.245 ± 0.015
2.227PheAsp: 2.227 ± 0.046
1.584PheGlu: 1.584 ± 0.035
0.965PhePhe: 0.965 ± 0.034
3.145PheGly: 3.145 ± 0.065
0.586PheHis: 0.586 ± 0.021
1.004PheIle: 1.004 ± 0.032
0.595PheLys: 0.595 ± 0.023
2.605PheLeu: 2.605 ± 0.047
0.485PheMet: 0.485 ± 0.02
0.612PheAsn: 0.612 ± 0.021
1.292PhePro: 1.292 ± 0.034
0.658PheGln: 0.658 ± 0.029
1.741PheArg: 1.741 ± 0.041
1.706PheSer: 1.706 ± 0.039
1.971PheThr: 1.971 ± 0.045
2.847PheVal: 2.847 ± 0.062
0.435PheTrp: 0.435 ± 0.021
0.681PheTyr: 0.681 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
10.036GlyAla: 10.036 ± 0.112
0.653GlyCys: 0.653 ± 0.028
5.455GlyAsp: 5.455 ± 0.073
4.592GlyGlu: 4.592 ± 0.065
3.118GlyPhe: 3.118 ± 0.052
7.695GlyGly: 7.695 ± 0.102
2.05GlyHis: 2.05 ± 0.046
4.219GlyIle: 4.219 ± 0.064
2.43GlyLys: 2.43 ± 0.055
9.211GlyLeu: 9.211 ± 0.105
1.937GlyMet: 1.937 ± 0.043
1.698GlyAsn: 1.698 ± 0.048
4.261GlyPro: 4.261 ± 0.065
2.613GlyGln: 2.613 ± 0.05
6.947GlyArg: 6.947 ± 0.081
5.627GlySer: 5.627 ± 0.071
5.917GlyThr: 5.917 ± 0.087
8.0GlyVal: 8.0 ± 0.093
1.578GlyTrp: 1.578 ± 0.039
2.086GlyTyr: 2.086 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.344HisAla: 2.344 ± 0.049
0.146HisCys: 0.146 ± 0.012
1.609HisAsp: 1.609 ± 0.045
1.316HisGlu: 1.316 ± 0.041
0.588HisPhe: 0.588 ± 0.026
2.194HisGly: 2.194 ± 0.045
0.676HisHis: 0.676 ± 0.026
0.611HisIle: 0.611 ± 0.025
0.317HisLys: 0.317 ± 0.016
2.355HisLeu: 2.355 ± 0.044
0.342HisMet: 0.342 ± 0.018
0.352HisAsn: 0.352 ± 0.02
1.478HisPro: 1.478 ± 0.042
0.65HisGln: 0.65 ± 0.023
1.804HisArg: 1.804 ± 0.037
0.924HisSer: 0.924 ± 0.028
1.133HisThr: 1.133 ± 0.032
2.089HisVal: 2.089 ± 0.043
0.294HisTrp: 0.294 ± 0.016
0.398HisTyr: 0.398 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
5.982IleAla: 5.982 ± 0.08
0.322IleCys: 0.322 ± 0.016
3.271IleAsp: 3.271 ± 0.054
2.71IleGlu: 2.71 ± 0.046
0.999IlePhe: 0.999 ± 0.035
4.579IleGly: 4.579 ± 0.077
0.707IleHis: 0.707 ± 0.025
1.527IleIle: 1.527 ± 0.042
0.952IleLys: 0.952 ± 0.034
3.17IleLeu: 3.17 ± 0.053
0.594IleMet: 0.594 ± 0.028
0.948IleAsn: 0.948 ± 0.028
1.957IlePro: 1.957 ± 0.043
0.931IleGln: 0.931 ± 0.028
2.451IleArg: 2.451 ± 0.047
2.332IleSer: 2.332 ± 0.047
2.577IleThr: 2.577 ± 0.05
4.083IleVal: 4.083 ± 0.063
0.459IleTrp: 0.459 ± 0.02
0.67IleTyr: 0.67 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.003LysAla: 3.003 ± 0.063
0.099LysCys: 0.099 ± 0.009
1.572LysAsp: 1.572 ± 0.043
1.117LysGlu: 1.117 ± 0.036
0.542LysPhe: 0.542 ± 0.024
2.042LysGly: 2.042 ± 0.053
0.489LysHis: 0.489 ± 0.025
1.146LysIle: 1.146 ± 0.036
1.081LysLys: 1.081 ± 0.045
1.989LysLeu: 1.989 ± 0.047
0.452LysMet: 0.452 ± 0.021
0.606LysAsn: 0.606 ± 0.03
1.373LysPro: 1.373 ± 0.036
0.743LysGln: 0.743 ± 0.028
1.55LysArg: 1.55 ± 0.044
1.34LysSer: 1.34 ± 0.038
1.469LysThr: 1.469 ± 0.046
2.201LysVal: 2.201 ± 0.062
0.25LysTrp: 0.25 ± 0.016
0.456LysTyr: 0.456 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
14.021LeuAla: 14.021 ± 0.142
0.616LeuCys: 0.616 ± 0.026
7.245LeuAsp: 7.245 ± 0.094
5.249LeuGlu: 5.249 ± 0.073
2.643LeuPhe: 2.643 ± 0.052
9.22LeuGly: 9.22 ± 0.107
2.015LeuHis: 2.015 ± 0.045
3.751LeuIle: 3.751 ± 0.069
2.118LeuLys: 2.118 ± 0.053
10.149LeuLeu: 10.149 ± 0.135
1.752LeuMet: 1.752 ± 0.04
1.633LeuAsn: 1.633 ± 0.042
5.303LeuPro: 5.303 ± 0.065
2.564LeuGln: 2.564 ± 0.047
7.18LeuArg: 7.18 ± 0.091
5.634LeuSer: 5.634 ± 0.07
6.47LeuThr: 6.47 ± 0.085
10.151LeuVal: 10.151 ± 0.114
1.071LeuTrp: 1.071 ± 0.033
1.535LeuTyr: 1.535 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
2.439MetAla: 2.439 ± 0.047
0.121MetCys: 0.121 ± 0.01
0.948MetAsp: 0.948 ± 0.026
0.727MetGlu: 0.727 ± 0.026
0.534MetPhe: 0.534 ± 0.023
1.462MetGly: 1.462 ± 0.038
0.42MetHis: 0.42 ± 0.02
0.852MetIle: 0.852 ± 0.026
0.498MetLys: 0.498 ± 0.023
1.863MetLeu: 1.863 ± 0.04
0.362MetMet: 0.362 ± 0.02
0.459MetAsn: 0.459 ± 0.021
1.088MetPro: 1.088 ± 0.035
0.563MetGln: 0.563 ± 0.025
1.503MetArg: 1.503 ± 0.041
1.564MetSer: 1.564 ± 0.033
1.884MetThr: 1.884 ± 0.048
1.506MetVal: 1.506 ± 0.038
0.22MetTrp: 0.22 ± 0.014
0.312MetTyr: 0.312 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.308AsnAla: 2.308 ± 0.046
0.113AsnCys: 0.113 ± 0.01
1.178AsnAsp: 1.178 ± 0.029
0.885AsnGlu: 0.885 ± 0.03
0.51AsnPhe: 0.51 ± 0.02
1.841AsnGly: 1.841 ± 0.043
0.416AsnHis: 0.416 ± 0.018
0.794AsnIle: 0.794 ± 0.031
0.446AsnLys: 0.446 ± 0.024
1.875AsnLeu: 1.875 ± 0.048
0.328AsnMet: 0.328 ± 0.016
0.44AsnAsn: 0.44 ± 0.021
1.394AsnPro: 1.394 ± 0.036
0.538AsnGln: 0.538 ± 0.021
1.275AsnArg: 1.275 ± 0.036
0.85AsnSer: 0.85 ± 0.029
1.025AsnThr: 1.025 ± 0.028
1.643AsnVal: 1.643 ± 0.039
0.252AsnTrp: 0.252 ± 0.017
0.395AsnTyr: 0.395 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
6.461ProAla: 6.461 ± 0.08
0.242ProCys: 0.242 ± 0.017
4.209ProAsp: 4.209 ± 0.061
3.581ProGlu: 3.581 ± 0.058
1.592ProPhe: 1.592 ± 0.038
5.11ProGly: 5.11 ± 0.069
1.196ProHis: 1.196 ± 0.034
1.782ProIle: 1.782 ± 0.036
1.146ProLys: 1.146 ± 0.039
4.648ProLeu: 4.648 ± 0.06
1.011ProMet: 1.011 ± 0.029
0.852ProAsn: 0.852 ± 0.03
2.478ProPro: 2.478 ± 0.065
1.526ProGln: 1.526 ± 0.035
3.53ProArg: 3.53 ± 0.07
3.192ProSer: 3.192 ± 0.055
3.372ProThr: 3.372 ± 0.058
4.962ProVal: 4.962 ± 0.057
0.879ProTrp: 0.879 ± 0.028
1.101ProTyr: 1.101 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.573GlnAla: 3.573 ± 0.056
0.135GlnCys: 0.135 ± 0.013
1.511GlnAsp: 1.511 ± 0.035
1.336GlnGlu: 1.336 ± 0.039
0.754GlnPhe: 0.754 ± 0.024
2.105GlnGly: 2.105 ± 0.039
0.713GlnHis: 0.713 ± 0.025
1.351GlnIle: 1.351 ± 0.031
0.691GlnLys: 0.691 ± 0.03
3.162GlnLeu: 3.162 ± 0.059
0.631GlnMet: 0.631 ± 0.026
0.518GlnAsn: 0.518 ± 0.021
1.491GlnPro: 1.491 ± 0.039
1.162GlnGln: 1.162 ± 0.036
2.42GlnArg: 2.42 ± 0.05
1.413GlnSer: 1.413 ± 0.035
1.506GlnThr: 1.506 ± 0.035
2.829GlnVal: 2.829 ± 0.053
0.482GlnTrp: 0.482 ± 0.021
0.559GlnTyr: 0.559 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
8.283ArgAla: 8.283 ± 0.104
0.432ArgCys: 0.432 ± 0.021
4.439ArgAsp: 4.439 ± 0.064
4.283ArgGlu: 4.283 ± 0.069
2.227ArgPhe: 2.227 ± 0.044
5.44ArgGly: 5.44 ± 0.07
1.761ArgHis: 1.761 ± 0.044
3.318ArgIle: 3.318 ± 0.045
1.558ArgLys: 1.558 ± 0.04
7.651ArgLeu: 7.651 ± 0.093
1.641ArgMet: 1.641 ± 0.033
1.328ArgAsn: 1.328 ± 0.035
4.035ArgPro: 4.035 ± 0.067
2.191ArgGln: 2.191 ± 0.048
6.81ArgArg: 6.81 ± 0.118
4.487ArgSer: 4.487 ± 0.069
4.925ArgThr: 4.925 ± 0.065
5.934ArgVal: 5.934 ± 0.074
1.224ArgTrp: 1.224 ± 0.036
1.482ArgTyr: 1.482 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
6.801SerAla: 6.801 ± 0.09
0.355SerCys: 0.355 ± 0.021
3.261SerAsp: 3.261 ± 0.057
2.573SerGlu: 2.573 ± 0.052
1.84SerPhe: 1.84 ± 0.043
5.76SerGly: 5.76 ± 0.082
1.117SerHis: 1.117 ± 0.033
2.378SerIle: 2.378 ± 0.05
1.295SerLys: 1.295 ± 0.045
5.414SerLeu: 5.414 ± 0.072
1.41SerMet: 1.41 ± 0.035
1.064SerAsn: 1.064 ± 0.034
3.106SerPro: 3.106 ± 0.053
1.462SerGln: 1.462 ± 0.041
3.988SerArg: 3.988 ± 0.065
3.839SerSer: 3.839 ± 0.08
3.906SerThr: 3.906 ± 0.071
4.791SerVal: 4.791 ± 0.067
0.96SerTrp: 0.96 ± 0.027
1.315SerTyr: 1.315 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
7.493ThrAla: 7.493 ± 0.086
0.405ThrCys: 0.405 ± 0.021
3.789ThrAsp: 3.789 ± 0.061
3.014ThrGlu: 3.014 ± 0.051
2.086ThrPhe: 2.086 ± 0.05
5.844ThrGly: 5.844 ± 0.079
1.221ThrHis: 1.221 ± 0.031
2.913ThrIle: 2.913 ± 0.056
1.505ThrLys: 1.505 ± 0.037
5.767ThrLeu: 5.767 ± 0.075
1.203ThrMet: 1.203 ± 0.031
1.21ThrAsn: 1.21 ± 0.036
3.77ThrPro: 3.77 ± 0.058
1.495ThrGln: 1.495 ± 0.04
3.87ThrArg: 3.87 ± 0.064
3.904ThrSer: 3.904 ± 0.066
4.338ThrThr: 4.338 ± 0.075
6.067ThrVal: 6.067 ± 0.086
1.022ThrTrp: 1.022 ± 0.034
1.572ThrTyr: 1.572 ± 0.044
0.0ThrXaa: 0.0 ± 0.0
Val
11.984ValAla: 11.984 ± 0.122
0.671ValCys: 0.671 ± 0.026
6.32ValAsp: 6.32 ± 0.085
5.447ValGlu: 5.447 ± 0.085
2.535ValPhe: 2.535 ± 0.049
7.642ValGly: 7.642 ± 0.085
1.939ValHis: 1.939 ± 0.041
3.848ValIle: 3.848 ± 0.059
2.05ValLys: 2.05 ± 0.055
9.977ValLeu: 9.977 ± 0.115
1.701ValMet: 1.701 ± 0.047
1.818ValAsn: 1.818 ± 0.043
4.983ValPro: 4.983 ± 0.058
2.261ValGln: 2.261 ± 0.039
6.64ValArg: 6.64 ± 0.086
5.054ValSer: 5.054 ± 0.062
6.109ValThr: 6.109 ± 0.091
10.055ValVal: 10.055 ± 0.117
1.113ValTrp: 1.113 ± 0.03
1.533ValTyr: 1.533 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
1.448TrpAla: 1.448 ± 0.04
0.135TrpCys: 0.135 ± 0.01
0.836TrpAsp: 0.836 ± 0.025
0.614TrpGlu: 0.614 ± 0.024
0.514TrpPhe: 0.514 ± 0.018
1.028TrpGly: 1.028 ± 0.03
0.372TrpHis: 0.372 ± 0.019
0.61TrpIle: 0.61 ± 0.022
0.357TrpLys: 0.357 ± 0.018
1.642TrpLeu: 1.642 ± 0.039
0.277TrpMet: 0.277 ± 0.015
0.34TrpAsn: 0.34 ± 0.017
0.684TrpPro: 0.684 ± 0.027
0.534TrpGln: 0.534 ± 0.024
1.185TrpArg: 1.185 ± 0.033
1.018TrpSer: 1.018 ± 0.031
0.986TrpThr: 0.986 ± 0.032
1.14TrpVal: 1.14 ± 0.034
0.343TrpTrp: 0.343 ± 0.019
0.281TrpTyr: 0.281 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.585TyrAla: 2.585 ± 0.043
0.143TyrCys: 0.143 ± 0.01
1.537TyrAsp: 1.537 ± 0.037
1.135TyrGlu: 1.135 ± 0.034
0.661TyrPhe: 0.661 ± 0.027
1.994TyrGly: 1.994 ± 0.047
0.305TyrHis: 0.305 ± 0.016
0.579TyrIle: 0.579 ± 0.026
0.422TyrLys: 0.422 ± 0.023
2.016TyrLeu: 2.016 ± 0.04
0.228TyrMet: 0.228 ± 0.014
0.352TyrAsn: 0.352 ± 0.018
0.937TyrPro: 0.937 ± 0.034
0.559TyrGln: 0.559 ± 0.025
1.546TyrArg: 1.546 ± 0.039
1.001TyrSer: 1.001 ± 0.033
1.017TyrThr: 1.017 ± 0.033
2.045TyrVal: 2.045 ± 0.041
0.28TyrTrp: 0.28 ± 0.015
0.416TyrTyr: 0.416 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3511 proteins (1133916 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski