Amino acid dipepetide frequency for Actinorugispora endophytica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.124AlaAla: 21.124 ± 0.219
0.978AlaCys: 0.978 ± 0.03
8.78AlaAsp: 8.78 ± 0.101
9.668AlaGlu: 9.668 ± 0.096
3.55AlaPhe: 3.55 ± 0.062
12.743AlaGly: 12.743 ± 0.135
2.695AlaHis: 2.695 ± 0.048
3.458AlaIle: 3.458 ± 0.056
2.062AlaLys: 2.062 ± 0.05
14.687AlaLeu: 14.687 ± 0.144
2.464AlaMet: 2.464 ± 0.043
1.945AlaAsn: 1.945 ± 0.036
7.54AlaPro: 7.54 ± 0.1
2.98AlaGln: 2.98 ± 0.049
11.26AlaArg: 11.26 ± 0.128
5.804AlaSer: 5.804 ± 0.068
6.042AlaThr: 6.042 ± 0.075
12.737AlaVal: 12.737 ± 0.141
1.843AlaTrp: 1.843 ± 0.036
2.402AlaTyr: 2.402 ± 0.038
0.001AlaXaa: 0.001 ± 0.001
Cys
1.037CysAla: 1.037 ± 0.031
0.09CysCys: 0.09 ± 0.009
0.435CysAsp: 0.435 ± 0.016
0.397CysGlu: 0.397 ± 0.016
0.201CysPhe: 0.201 ± 0.013
0.884CysGly: 0.884 ± 0.026
0.202CysHis: 0.202 ± 0.013
0.087CysIle: 0.087 ± 0.008
0.069CysLys: 0.069 ± 0.007
0.687CysLeu: 0.687 ± 0.02
0.106CysMet: 0.106 ± 0.008
0.102CysAsn: 0.102 ± 0.008
0.456CysPro: 0.456 ± 0.022
0.145CysGln: 0.145 ± 0.009
0.594CysArg: 0.594 ± 0.022
0.387CysSer: 0.387 ± 0.018
0.328CysThr: 0.328 ± 0.017
0.692CysVal: 0.692 ± 0.026
0.087CysTrp: 0.087 ± 0.007
0.161CysTyr: 0.161 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.978AspAla: 7.978 ± 0.09
0.379AspCys: 0.379 ± 0.017
3.946AspAsp: 3.946 ± 0.066
4.194AspGlu: 4.194 ± 0.056
1.651AspPhe: 1.651 ± 0.037
6.397AspGly: 6.397 ± 0.085
1.442AspHis: 1.442 ± 0.034
1.861AspIle: 1.861 ± 0.037
0.852AspLys: 0.852 ± 0.027
6.589AspLeu: 6.589 ± 0.08
0.873AspMet: 0.873 ± 0.025
0.94AspAsn: 0.94 ± 0.028
5.092AspPro: 5.092 ± 0.078
1.567AspGln: 1.567 ± 0.042
5.289AspArg: 5.289 ± 0.07
2.785AspSer: 2.785 ± 0.063
3.247AspThr: 3.247 ± 0.058
4.974AspVal: 4.974 ± 0.062
0.966AspTrp: 0.966 ± 0.03
1.348AspTyr: 1.348 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
8.176GluAla: 8.176 ± 0.081
0.403GluCys: 0.403 ± 0.018
3.222GluAsp: 3.222 ± 0.047
4.394GluGlu: 4.394 ± 0.064
1.682GluPhe: 1.682 ± 0.034
4.834GluGly: 4.834 ± 0.058
1.605GluHis: 1.605 ± 0.034
2.472GluIle: 2.472 ± 0.04
1.189GluLys: 1.189 ± 0.035
6.772GluLeu: 6.772 ± 0.075
0.998GluMet: 0.998 ± 0.03
1.19GluAsn: 1.19 ± 0.03
3.872GluPro: 3.872 ± 0.072
2.097GluGln: 2.097 ± 0.043
6.519GluArg: 6.519 ± 0.085
2.913GluSer: 2.913 ± 0.045
3.259GluThr: 3.259 ± 0.057
5.155GluVal: 5.155 ± 0.067
0.883GluTrp: 0.883 ± 0.026
1.262GluTyr: 1.262 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
3.774PheAla: 3.774 ± 0.058
0.254PheCys: 0.254 ± 0.011
2.095PheAsp: 2.095 ± 0.039
1.469PheGlu: 1.469 ± 0.038
0.883PhePhe: 0.883 ± 0.032
3.131PheGly: 3.131 ± 0.048
0.601PheHis: 0.601 ± 0.021
0.661PheIle: 0.661 ± 0.025
0.377PheLys: 0.377 ± 0.017
2.684PheLeu: 2.684 ± 0.055
0.438PheMet: 0.438 ± 0.017
0.551PheAsn: 0.551 ± 0.022
1.423PhePro: 1.423 ± 0.036
0.599PheGln: 0.599 ± 0.021
1.697PheArg: 1.697 ± 0.035
1.549PheSer: 1.549 ± 0.033
2.052PheThr: 2.052 ± 0.038
2.417PheVal: 2.417 ± 0.059
0.409PheTrp: 0.409 ± 0.018
0.601PheTyr: 0.601 ± 0.02
0.0PheXaa: 0.0 ± 0.0
Gly
11.768GlyAla: 11.768 ± 0.116
0.778GlyCys: 0.778 ± 0.024
5.617GlyAsp: 5.617 ± 0.071
5.916GlyGlu: 5.916 ± 0.075
3.0GlyPhe: 3.0 ± 0.05
9.438GlyGly: 9.438 ± 0.103
2.2GlyHis: 2.2 ± 0.037
3.185GlyIle: 3.185 ± 0.058
1.751GlyLys: 1.751 ± 0.037
9.797GlyLeu: 9.797 ± 0.105
2.06GlyMet: 2.06 ± 0.04
1.705GlyAsn: 1.705 ± 0.04
5.38GlyPro: 5.38 ± 0.068
2.408GlyGln: 2.408 ± 0.051
8.298GlyArg: 8.298 ± 0.092
5.229GlySer: 5.229 ± 0.075
5.594GlyThr: 5.594 ± 0.06
8.469GlyVal: 8.469 ± 0.088
1.551GlyTrp: 1.551 ± 0.04
2.401GlyTyr: 2.401 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.557HisAla: 2.557 ± 0.044
0.164HisCys: 0.164 ± 0.011
1.296HisAsp: 1.296 ± 0.032
1.184HisGlu: 1.184 ± 0.029
0.557HisPhe: 0.557 ± 0.018
2.21HisGly: 2.21 ± 0.04
0.604HisHis: 0.604 ± 0.027
0.593HisIle: 0.593 ± 0.02
0.287HisLys: 0.287 ± 0.015
2.184HisLeu: 2.184 ± 0.048
0.339HisMet: 0.339 ± 0.016
0.374HisAsn: 0.374 ± 0.015
1.679HisPro: 1.679 ± 0.039
0.576HisGln: 0.576 ± 0.022
2.02HisArg: 2.02 ± 0.042
0.996HisSer: 0.996 ± 0.029
1.182HisThr: 1.182 ± 0.031
1.695HisVal: 1.695 ± 0.035
0.32HisTrp: 0.32 ± 0.016
0.453HisTyr: 0.453 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
4.655IleAla: 4.655 ± 0.067
0.223IleCys: 0.223 ± 0.011
2.24IleAsp: 2.24 ± 0.041
1.988IleGlu: 1.988 ± 0.039
0.557IlePhe: 0.557 ± 0.022
3.523IleGly: 3.523 ± 0.068
0.532IleHis: 0.532 ± 0.019
0.981IleIle: 0.981 ± 0.032
0.519IleLys: 0.519 ± 0.022
2.345IleLeu: 2.345 ± 0.047
0.468IleMet: 0.468 ± 0.02
0.614IleAsn: 0.614 ± 0.024
1.709IlePro: 1.709 ± 0.037
0.679IleGln: 0.679 ± 0.026
2.278IleArg: 2.278 ± 0.041
1.603IleSer: 1.603 ± 0.034
2.102IleThr: 2.102 ± 0.037
2.628IleVal: 2.628 ± 0.052
0.308IleTrp: 0.308 ± 0.016
0.471IleTyr: 0.471 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
2.053LysAla: 2.053 ± 0.043
0.084LysCys: 0.084 ± 0.008
0.826LysAsp: 0.826 ± 0.025
1.026LysGlu: 1.026 ± 0.032
0.342LysPhe: 0.342 ± 0.019
1.427LysGly: 1.427 ± 0.038
0.345LysHis: 0.345 ± 0.017
0.629LysIle: 0.629 ± 0.024
0.501LysLys: 0.501 ± 0.022
1.425LysLeu: 1.425 ± 0.036
0.281LysMet: 0.281 ± 0.016
0.381LysAsn: 0.381 ± 0.016
0.98LysPro: 0.98 ± 0.031
0.449LysGln: 0.449 ± 0.021
1.383LysArg: 1.383 ± 0.04
0.86LysSer: 0.86 ± 0.028
0.926LysThr: 0.926 ± 0.028
1.335LysVal: 1.335 ± 0.031
0.206LysTrp: 0.206 ± 0.012
0.329LysTyr: 0.329 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
15.457LeuAla: 15.457 ± 0.143
0.753LeuCys: 0.753 ± 0.024
7.017LeuAsp: 7.017 ± 0.09
5.364LeuGlu: 5.364 ± 0.063
2.776LeuPhe: 2.776 ± 0.052
9.332LeuGly: 9.332 ± 0.097
1.975LeuHis: 1.975 ± 0.038
2.95LeuIle: 2.95 ± 0.063
1.461LeuLys: 1.461 ± 0.03
11.567LeuLeu: 11.567 ± 0.131
1.61LeuMet: 1.61 ± 0.039
1.674LeuAsn: 1.674 ± 0.035
6.282LeuPro: 6.282 ± 0.064
1.742LeuGln: 1.742 ± 0.035
9.355LeuArg: 9.355 ± 0.104
5.645LeuSer: 5.645 ± 0.061
5.981LeuThr: 5.981 ± 0.063
9.651LeuVal: 9.651 ± 0.111
1.282LeuTrp: 1.282 ± 0.027
1.735LeuTyr: 1.735 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
2.361MetAla: 2.361 ± 0.043
0.111MetCys: 0.111 ± 0.009
0.925MetAsp: 0.925 ± 0.023
0.842MetGlu: 0.842 ± 0.026
0.485MetPhe: 0.485 ± 0.021
1.422MetGly: 1.422 ± 0.033
0.292MetHis: 0.292 ± 0.015
0.624MetIle: 0.624 ± 0.022
0.307MetLys: 0.307 ± 0.014
1.756MetLeu: 1.756 ± 0.039
0.285MetMet: 0.285 ± 0.014
0.434MetAsn: 0.434 ± 0.018
1.035MetPro: 1.035 ± 0.028
0.337MetGln: 0.337 ± 0.016
1.63MetArg: 1.63 ± 0.034
1.392MetSer: 1.392 ± 0.031
1.452MetThr: 1.452 ± 0.035
1.484MetVal: 1.484 ± 0.038
0.209MetTrp: 0.209 ± 0.013
0.265MetTyr: 0.265 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.11AsnAla: 2.11 ± 0.043
0.153AsnCys: 0.153 ± 0.011
0.917AsnAsp: 0.917 ± 0.026
0.922AsnGlu: 0.922 ± 0.026
0.442AsnPhe: 0.442 ± 0.019
1.882AsnGly: 1.882 ± 0.043
0.402AsnHis: 0.402 ± 0.017
0.686AsnIle: 0.686 ± 0.025
0.3AsnLys: 0.3 ± 0.016
1.663AsnLeu: 1.663 ± 0.036
0.287AsnMet: 0.287 ± 0.013
0.376AsnAsn: 0.376 ± 0.017
1.391AsnPro: 1.391 ± 0.034
0.547AsnGln: 0.547 ± 0.02
1.437AsnArg: 1.437 ± 0.036
0.843AsnSer: 0.843 ± 0.027
1.05AsnThr: 1.05 ± 0.027
1.305AsnVal: 1.305 ± 0.03
0.28AsnTrp: 0.28 ± 0.014
0.397AsnTyr: 0.397 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
7.961ProAla: 7.961 ± 0.103
0.308ProCys: 0.308 ± 0.015
4.907ProAsp: 4.907 ± 0.08
5.013ProGlu: 5.013 ± 0.071
1.624ProPhe: 1.624 ± 0.036
7.311ProGly: 7.311 ± 0.082
1.252ProHis: 1.252 ± 0.033
1.294ProIle: 1.294 ± 0.03
0.899ProLys: 0.899 ± 0.027
5.391ProLeu: 5.391 ± 0.065
1.052ProMet: 1.052 ± 0.029
0.936ProAsn: 0.936 ± 0.028
4.15ProPro: 4.15 ± 0.109
1.427ProGln: 1.427 ± 0.042
4.505ProArg: 4.505 ± 0.067
3.207ProSer: 3.207 ± 0.065
2.779ProThr: 2.779 ± 0.049
5.806ProVal: 5.806 ± 0.069
0.908ProTrp: 0.908 ± 0.026
1.218ProTyr: 1.218 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.191GlnAla: 3.191 ± 0.058
0.136GlnCys: 0.136 ± 0.01
1.109GlnAsp: 1.109 ± 0.035
1.489GlnGlu: 1.489 ± 0.037
0.527GlnPhe: 0.527 ± 0.019
2.093GlnGly: 2.093 ± 0.042
0.48GlnHis: 0.48 ± 0.019
0.985GlnIle: 0.985 ± 0.027
0.402GlnLys: 0.402 ± 0.017
2.392GlnLeu: 2.392 ± 0.045
0.462GlnMet: 0.462 ± 0.019
0.451GlnAsn: 0.451 ± 0.02
1.317GlnPro: 1.317 ± 0.044
0.885GlnGln: 0.885 ± 0.036
2.282GlnArg: 2.282 ± 0.049
1.088GlnSer: 1.088 ± 0.03
1.17GlnThr: 1.17 ± 0.031
2.164GlnVal: 2.164 ± 0.036
0.404GlnTrp: 0.404 ± 0.017
0.464GlnTyr: 0.464 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
10.928ArgAla: 10.928 ± 0.125
0.572ArgCys: 0.572 ± 0.022
4.915ArgAsp: 4.915 ± 0.068
5.25ArgGlu: 5.25 ± 0.066
2.615ArgPhe: 2.615 ± 0.044
6.812ArgGly: 6.812 ± 0.067
2.074ArgHis: 2.074 ± 0.038
3.126ArgIle: 3.126 ± 0.044
1.406ArgLys: 1.406 ± 0.036
9.477ArgLeu: 9.477 ± 0.113
1.956ArgMet: 1.956 ± 0.042
1.466ArgAsn: 1.466 ± 0.034
5.316ArgPro: 5.316 ± 0.076
1.946ArgGln: 1.946 ± 0.037
8.756ArgArg: 8.756 ± 0.093
4.525ArgSer: 4.525 ± 0.064
4.752ArgThr: 4.752 ± 0.061
7.373ArgVal: 7.373 ± 0.081
1.396ArgTrp: 1.396 ± 0.032
1.921ArgTyr: 1.921 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
6.632SerAla: 6.632 ± 0.061
0.366SerCys: 0.366 ± 0.018
2.898SerAsp: 2.898 ± 0.051
2.874SerGlu: 2.874 ± 0.046
1.535SerPhe: 1.535 ± 0.031
6.327SerGly: 6.327 ± 0.073
0.96SerHis: 0.96 ± 0.026
1.419SerIle: 1.419 ± 0.034
0.758SerLys: 0.758 ± 0.028
4.849SerLeu: 4.849 ± 0.061
1.034SerMet: 1.034 ± 0.025
0.817SerAsn: 0.817 ± 0.024
3.335SerPro: 3.335 ± 0.054
1.133SerGln: 1.133 ± 0.028
3.895SerArg: 3.895 ± 0.056
2.77SerSer: 2.77 ± 0.051
2.859SerThr: 2.859 ± 0.051
4.43SerVal: 4.43 ± 0.056
0.857SerTrp: 0.857 ± 0.026
1.161SerTyr: 1.161 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
7.723ThrAla: 7.723 ± 0.087
0.33ThrCys: 0.33 ± 0.015
3.402ThrAsp: 3.402 ± 0.057
3.22ThrGlu: 3.22 ± 0.048
1.394ThrPhe: 1.394 ± 0.033
6.485ThrGly: 6.485 ± 0.078
1.057ThrHis: 1.057 ± 0.029
1.605ThrIle: 1.605 ± 0.038
0.794ThrLys: 0.794 ± 0.025
5.3ThrLeu: 5.3 ± 0.066
0.902ThrMet: 0.902 ± 0.027
0.895ThrAsn: 0.895 ± 0.025
3.749ThrPro: 3.749 ± 0.062
1.101ThrGln: 1.101 ± 0.029
4.249ThrArg: 4.249 ± 0.062
2.656ThrSer: 2.656 ± 0.038
3.234ThrThr: 3.234 ± 0.061
5.474ThrVal: 5.474 ± 0.066
0.794ThrTrp: 0.794 ± 0.025
0.96ThrTyr: 0.96 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
11.251ValAla: 11.251 ± 0.133
0.775ValCys: 0.775 ± 0.023
5.671ValAsp: 5.671 ± 0.059
5.611ValGlu: 5.611 ± 0.072
2.75ValPhe: 2.75 ± 0.047
7.192ValGly: 7.192 ± 0.094
1.818ValHis: 1.818 ± 0.039
2.829ValIle: 2.829 ± 0.05
1.304ValLys: 1.304 ± 0.032
10.146ValLeu: 10.146 ± 0.109
1.499ValMet: 1.499 ± 0.037
1.807ValAsn: 1.807 ± 0.039
5.368ValPro: 5.368 ± 0.061
1.778ValGln: 1.778 ± 0.034
7.826ValArg: 7.826 ± 0.079
4.726ValSer: 4.726 ± 0.05
5.179ValThr: 5.179 ± 0.061
9.124ValVal: 9.124 ± 0.112
1.248ValTrp: 1.248 ± 0.032
1.704ValTyr: 1.704 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.572TrpAla: 1.572 ± 0.035
0.136TrpCys: 0.136 ± 0.01
0.794TrpAsp: 0.794 ± 0.026
0.822TrpGlu: 0.822 ± 0.022
0.472TrpPhe: 0.472 ± 0.019
1.049TrpGly: 1.049 ± 0.026
0.327TrpHis: 0.327 ± 0.015
0.513TrpIle: 0.513 ± 0.018
0.281TrpLys: 0.281 ± 0.015
1.734TrpLeu: 1.734 ± 0.04
0.281TrpMet: 0.281 ± 0.012
0.389TrpAsn: 0.389 ± 0.017
0.781TrpPro: 0.781 ± 0.025
0.466TrpGln: 0.466 ± 0.018
1.454TrpArg: 1.454 ± 0.036
0.906TrpSer: 0.906 ± 0.025
0.954TrpThr: 0.954 ± 0.03
1.06TrpVal: 1.06 ± 0.031
0.377TrpTrp: 0.377 ± 0.018
0.282TrpTyr: 0.282 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.456TyrAla: 2.456 ± 0.038
0.154TyrCys: 0.154 ± 0.01
1.358TyrAsp: 1.358 ± 0.04
1.255TyrGlu: 1.255 ± 0.028
0.624TyrPhe: 0.624 ± 0.024
2.053TyrGly: 2.053 ± 0.041
0.408TyrHis: 0.408 ± 0.017
0.466TyrIle: 0.466 ± 0.018
0.268TyrLys: 0.268 ± 0.014
2.165TyrLeu: 2.165 ± 0.038
0.252TyrMet: 0.252 ± 0.014
0.345TyrAsn: 0.345 ± 0.014
1.092TyrPro: 1.092 ± 0.03
0.628TyrGln: 0.628 ± 0.024
1.854TyrArg: 1.854 ± 0.042
1.023TyrSer: 1.023 ± 0.027
1.182TyrThr: 1.182 ± 0.034
1.63TyrVal: 1.63 ± 0.031
0.323TyrTrp: 0.323 ± 0.013
0.438TyrTyr: 0.438 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 4593 proteins (1481852 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski