Amino acid dipepetide frequency for Acidianus two-tailed virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.075AlaAla: 1.075 ± 0.299
0.17AlaCys: 0.17 ± 0.108
2.546AlaAsp: 2.546 ± 0.433
4.809AlaGlu: 4.809 ± 0.942
3.282AlaPhe: 3.282 ± 0.48
2.659AlaGly: 2.659 ± 0.488
0.509AlaHis: 0.509 ± 0.167
5.092AlaIle: 5.092 ± 0.572
5.149AlaLys: 5.149 ± 0.778
6.28AlaLeu: 6.28 ± 0.877
1.245AlaMet: 1.245 ± 0.229
3.565AlaAsn: 3.565 ± 0.529
2.32AlaPro: 2.32 ± 0.303
1.867AlaGln: 1.867 ± 0.457
2.546AlaArg: 2.546 ± 0.432
4.13AlaSer: 4.13 ± 0.458
3.734AlaThr: 3.734 ± 0.525
5.036AlaVal: 5.036 ± 0.541
0.509AlaTrp: 0.509 ± 0.266
3.168AlaTyr: 3.168 ± 0.721
0.0AlaXaa: 0.0 ± 0.0
Cys
0.226CysAla: 0.226 ± 0.104
0.0CysCys: 0.0 ± 0.0
0.057CysAsp: 0.057 ± 0.052
0.453CysGlu: 0.453 ± 0.159
0.283CysPhe: 0.283 ± 0.153
0.453CysGly: 0.453 ± 0.203
0.113CysHis: 0.113 ± 0.081
0.226CysIle: 0.226 ± 0.126
0.283CysLys: 0.283 ± 0.147
0.226CysLeu: 0.226 ± 0.116
0.17CysMet: 0.17 ± 0.111
0.226CysAsn: 0.226 ± 0.111
0.339CysPro: 0.339 ± 0.139
0.113CysGln: 0.113 ± 0.111
0.283CysArg: 0.283 ± 0.119
0.339CysSer: 0.339 ± 0.133
0.396CysThr: 0.396 ± 0.2
0.113CysVal: 0.113 ± 0.072
0.17CysTrp: 0.17 ± 0.115
0.283CysTyr: 0.283 ± 0.118
0.0CysXaa: 0.0 ± 0.0
Asp
3.338AspAla: 3.338 ± 0.651
0.17AspCys: 0.17 ± 0.103
2.546AspAsp: 2.546 ± 0.429
3.508AspGlu: 3.508 ± 0.559
2.716AspPhe: 2.716 ± 0.373
2.263AspGly: 2.263 ± 0.337
0.509AspHis: 0.509 ± 0.164
3.904AspIle: 3.904 ± 0.677
3.621AspLys: 3.621 ± 0.72
5.205AspLeu: 5.205 ± 1.176
0.849AspMet: 0.849 ± 0.212
2.093AspAsn: 2.093 ± 0.289
1.924AspPro: 1.924 ± 0.319
1.471AspGln: 1.471 ± 0.328
1.697AspArg: 1.697 ± 0.38
2.037AspSer: 2.037 ± 0.312
2.32AspThr: 2.32 ± 0.407
3.395AspVal: 3.395 ± 0.606
0.339AspTrp: 0.339 ± 0.148
2.207AspTyr: 2.207 ± 0.363
0.0AspXaa: 0.0 ± 0.0
Glu
4.583GluAla: 4.583 ± 0.725
0.339GluCys: 0.339 ± 0.12
3.112GluAsp: 3.112 ± 0.446
10.071GluGlu: 10.071 ± 2.618
2.659GluPhe: 2.659 ± 0.461
3.904GluGly: 3.904 ± 1.508
0.566GluHis: 0.566 ± 0.168
5.149GluIle: 5.149 ± 0.606
6.79GluLys: 6.79 ± 0.931
5.771GluLeu: 5.771 ± 0.928
1.301GluMet: 1.301 ± 0.344
4.583GluAsn: 4.583 ± 0.737
1.811GluPro: 1.811 ± 0.295
3.338GluGln: 3.338 ± 0.795
1.98GluArg: 1.98 ± 0.359
4.64GluSer: 4.64 ± 1.31
2.942GluThr: 2.942 ± 0.557
3.565GluVal: 3.565 ± 0.638
0.339GluTrp: 0.339 ± 0.133
2.603GluTyr: 2.603 ± 0.508
0.0GluXaa: 0.0 ± 0.0
Phe
2.999PheAla: 2.999 ± 0.43
0.339PheCys: 0.339 ± 0.167
2.433PheAsp: 2.433 ± 0.41
2.376PheGlu: 2.376 ± 0.429
1.924PhePhe: 1.924 ± 0.575
2.093PheGly: 2.093 ± 0.344
0.509PheHis: 0.509 ± 0.244
3.451PheIle: 3.451 ± 0.612
3.112PheLys: 3.112 ± 0.459
5.149PheLeu: 5.149 ± 0.69
1.415PheMet: 1.415 ± 0.275
2.659PheAsn: 2.659 ± 0.428
2.093PhePro: 2.093 ± 0.512
1.132PheGln: 1.132 ± 0.208
1.415PheArg: 1.415 ± 0.275
3.508PheSer: 3.508 ± 0.593
3.055PheThr: 3.055 ± 0.806
3.847PheVal: 3.847 ± 0.5
0.339PheTrp: 0.339 ± 0.169
2.037PheTyr: 2.037 ± 0.377
0.0PheXaa: 0.0 ± 0.0
Gly
2.376GlyAla: 2.376 ± 0.509
0.17GlyCys: 0.17 ± 0.104
2.546GlyAsp: 2.546 ± 0.487
4.357GlyGlu: 4.357 ± 1.638
2.886GlyPhe: 2.886 ± 0.544
4.074GlyGly: 4.074 ± 1.184
0.566GlyHis: 0.566 ± 0.184
4.64GlyIle: 4.64 ± 0.816
4.3GlyLys: 4.3 ± 0.814
5.545GlyLeu: 5.545 ± 0.626
1.415GlyMet: 1.415 ± 0.228
2.829GlyAsn: 2.829 ± 0.764
0.962GlyPro: 0.962 ± 0.292
2.546GlyGln: 2.546 ± 0.763
1.528GlyArg: 1.528 ± 0.301
3.621GlySer: 3.621 ± 0.798
2.716GlyThr: 2.716 ± 0.559
4.526GlyVal: 4.526 ± 0.599
0.849GlyTrp: 0.849 ± 0.243
2.716GlyTyr: 2.716 ± 0.439
0.0GlyXaa: 0.0 ± 0.0
His
0.622HisAla: 0.622 ± 0.16
0.057HisCys: 0.057 ± 0.066
0.679HisAsp: 0.679 ± 0.198
0.679HisGlu: 0.679 ± 0.171
0.736HisPhe: 0.736 ± 0.224
0.622HisGly: 0.622 ± 0.218
0.17HisHis: 0.17 ± 0.098
0.962HisIle: 0.962 ± 0.247
0.962HisLys: 0.962 ± 0.291
1.132HisLeu: 1.132 ± 0.318
0.339HisMet: 0.339 ± 0.132
0.339HisAsn: 0.339 ± 0.135
0.339HisPro: 0.339 ± 0.145
0.396HisGln: 0.396 ± 0.119
0.453HisArg: 0.453 ± 0.176
0.509HisSer: 0.509 ± 0.143
0.396HisThr: 0.396 ± 0.162
0.905HisVal: 0.905 ± 0.179
0.0HisTrp: 0.0 ± 0.0
0.736HisTyr: 0.736 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
6.167IleAla: 6.167 ± 0.627
0.566IleCys: 0.566 ± 0.19
4.244IleAsp: 4.244 ± 0.789
4.413IleGlu: 4.413 ± 0.53
3.451IlePhe: 3.451 ± 0.464
3.451IleGly: 3.451 ± 0.623
0.962IleHis: 0.962 ± 0.276
5.205IleIle: 5.205 ± 0.877
5.771IleLys: 5.771 ± 0.96
6.903IleLeu: 6.903 ± 0.659
1.528IleMet: 1.528 ± 0.299
4.13IleAsn: 4.13 ± 0.665
3.791IlePro: 3.791 ± 0.501
2.546IleGln: 2.546 ± 0.46
2.886IleArg: 2.886 ± 0.389
5.601IleSer: 5.601 ± 0.625
5.319IleThr: 5.319 ± 0.714
5.092IleVal: 5.092 ± 0.547
0.736IleTrp: 0.736 ± 0.206
3.395IleTyr: 3.395 ± 0.525
0.0IleXaa: 0.0 ± 0.0
Lys
4.357LysAla: 4.357 ± 0.773
0.226LysCys: 0.226 ± 0.138
4.074LysAsp: 4.074 ± 0.99
6.45LysGlu: 6.45 ± 1.111
2.376LysPhe: 2.376 ± 0.338
4.3LysGly: 4.3 ± 0.756
0.962LysHis: 0.962 ± 0.259
5.828LysIle: 5.828 ± 0.758
7.186LysLys: 7.186 ± 0.914
7.242LysLeu: 7.242 ± 0.935
1.98LysMet: 1.98 ± 0.449
4.413LysAsn: 4.413 ± 0.775
3.055LysPro: 3.055 ± 0.465
4.017LysGln: 4.017 ± 0.753
3.621LysArg: 3.621 ± 0.662
3.225LysSer: 3.225 ± 0.457
5.036LysThr: 5.036 ± 0.758
5.092LysVal: 5.092 ± 0.728
0.566LysTrp: 0.566 ± 0.174
4.3LysTyr: 4.3 ± 0.596
0.0LysXaa: 0.0 ± 0.0
Leu
7.129LeuAla: 7.129 ± 0.663
0.453LeuCys: 0.453 ± 0.161
5.092LeuAsp: 5.092 ± 0.744
4.753LeuGlu: 4.753 ± 0.595
4.64LeuPhe: 4.64 ± 0.882
4.753LeuGly: 4.753 ± 0.612
1.301LeuHis: 1.301 ± 0.27
6.337LeuIle: 6.337 ± 0.891
7.751LeuLys: 7.751 ± 1.057
9.958LeuLeu: 9.958 ± 1.044
2.037LeuMet: 2.037 ± 0.325
5.262LeuAsn: 5.262 ± 0.461
5.432LeuPro: 5.432 ± 0.62
3.565LeuGln: 3.565 ± 0.637
4.187LeuArg: 4.187 ± 0.798
7.412LeuSer: 7.412 ± 0.829
4.696LeuThr: 4.696 ± 0.531
7.129LeuVal: 7.129 ± 0.78
0.792LeuTrp: 0.792 ± 0.242
4.47LeuTyr: 4.47 ± 0.623
0.0LeuXaa: 0.0 ± 0.0
Met
1.754MetAla: 1.754 ± 0.249
0.113MetCys: 0.113 ± 0.09
0.849MetAsp: 0.849 ± 0.221
1.301MetGlu: 1.301 ± 0.32
0.905MetPhe: 0.905 ± 0.217
1.245MetGly: 1.245 ± 0.294
0.283MetHis: 0.283 ± 0.136
1.415MetIle: 1.415 ± 0.373
1.754MetLys: 1.754 ± 0.344
2.49MetLeu: 2.49 ± 0.452
0.339MetMet: 0.339 ± 0.154
1.018MetAsn: 1.018 ± 0.215
1.132MetPro: 1.132 ± 0.25
1.132MetGln: 1.132 ± 0.237
0.622MetArg: 0.622 ± 0.237
1.697MetSer: 1.697 ± 0.29
1.075MetThr: 1.075 ± 0.278
1.697MetVal: 1.697 ± 0.379
0.113MetTrp: 0.113 ± 0.085
0.905MetTyr: 0.905 ± 0.22
0.0MetXaa: 0.0 ± 0.0
Asn
3.734AsnAla: 3.734 ± 0.767
0.226AsnCys: 0.226 ± 0.122
2.433AsnAsp: 2.433 ± 0.67
3.734AsnGlu: 3.734 ± 0.502
1.867AsnPhe: 1.867 ± 0.349
3.734AsnGly: 3.734 ± 0.853
0.622AsnHis: 0.622 ± 0.203
4.583AsnIle: 4.583 ± 0.566
3.451AsnLys: 3.451 ± 0.448
4.583AsnLeu: 4.583 ± 0.628
1.132AsnMet: 1.132 ± 0.261
3.112AsnAsn: 3.112 ± 0.526
3.225AsnPro: 3.225 ± 0.576
1.697AsnGln: 1.697 ± 0.486
0.622AsnArg: 0.622 ± 0.228
3.565AsnSer: 3.565 ± 0.882
4.922AsnThr: 4.922 ± 0.996
5.092AsnVal: 5.092 ± 0.897
0.339AsnTrp: 0.339 ± 0.144
1.697AsnTyr: 1.697 ± 0.363
0.0AsnXaa: 0.0 ± 0.0
Pro
2.659ProAla: 2.659 ± 0.407
0.17ProCys: 0.17 ± 0.09
1.924ProAsp: 1.924 ± 0.453
2.603ProGlu: 2.603 ± 0.428
2.942ProPhe: 2.942 ± 0.79
2.603ProGly: 2.603 ± 0.38
0.339ProHis: 0.339 ± 0.14
3.565ProIle: 3.565 ± 0.392
3.678ProLys: 3.678 ± 0.635
3.791ProLeu: 3.791 ± 0.55
1.018ProMet: 1.018 ± 0.248
1.867ProAsn: 1.867 ± 0.338
4.413ProPro: 4.413 ± 1.075
1.528ProGln: 1.528 ± 0.233
1.301ProArg: 1.301 ± 0.343
4.47ProSer: 4.47 ± 0.775
2.659ProThr: 2.659 ± 0.425
2.999ProVal: 2.999 ± 0.443
0.792ProTrp: 0.792 ± 0.299
2.433ProTyr: 2.433 ± 0.706
0.0ProXaa: 0.0 ± 0.0
Gln
1.867GlnAla: 1.867 ± 0.411
0.226GlnCys: 0.226 ± 0.096
1.132GlnAsp: 1.132 ± 0.259
3.338GlnGlu: 3.338 ± 0.921
1.188GlnPhe: 1.188 ± 0.331
2.037GlnGly: 2.037 ± 0.613
0.226GlnHis: 0.226 ± 0.11
3.508GlnIle: 3.508 ± 0.485
2.942GlnLys: 2.942 ± 0.534
5.205GlnLeu: 5.205 ± 0.666
0.736GlnMet: 0.736 ± 0.256
3.451GlnAsn: 3.451 ± 0.567
2.037GlnPro: 2.037 ± 0.374
3.621GlnGln: 3.621 ± 0.901
1.301GlnArg: 1.301 ± 0.275
2.32GlnSer: 2.32 ± 0.442
2.15GlnThr: 2.15 ± 0.392
1.924GlnVal: 1.924 ± 0.372
0.057GlnTrp: 0.057 ± 0.051
1.754GlnTyr: 1.754 ± 0.456
0.0GlnXaa: 0.0 ± 0.0
Arg
1.754ArgAla: 1.754 ± 0.292
0.17ArgCys: 0.17 ± 0.111
2.037ArgAsp: 2.037 ± 0.396
1.924ArgGlu: 1.924 ± 0.37
1.641ArgPhe: 1.641 ± 0.348
2.32ArgGly: 2.32 ± 0.573
0.622ArgHis: 0.622 ± 0.205
3.112ArgIle: 3.112 ± 0.558
2.886ArgLys: 2.886 ± 0.527
3.734ArgLeu: 3.734 ± 0.561
0.792ArgMet: 0.792 ± 0.219
1.641ArgAsn: 1.641 ± 0.354
0.905ArgPro: 0.905 ± 0.259
1.471ArgGln: 1.471 ± 0.341
2.093ArgArg: 2.093 ± 0.529
2.15ArgSer: 2.15 ± 0.407
1.924ArgThr: 1.924 ± 0.357
2.037ArgVal: 2.037 ± 0.468
0.17ArgTrp: 0.17 ± 0.105
1.415ArgTyr: 1.415 ± 0.266
0.0ArgXaa: 0.0 ± 0.0
Ser
4.244SerAla: 4.244 ± 0.732
0.453SerCys: 0.453 ± 0.168
2.15SerAsp: 2.15 ± 0.362
4.753SerGlu: 4.753 ± 1.295
2.886SerPhe: 2.886 ± 0.454
4.583SerGly: 4.583 ± 1.115
0.396SerHis: 0.396 ± 0.142
5.319SerIle: 5.319 ± 0.597
4.809SerLys: 4.809 ± 0.681
5.884SerLeu: 5.884 ± 0.521
1.584SerMet: 1.584 ± 0.348
3.565SerAsn: 3.565 ± 0.52
3.621SerPro: 3.621 ± 0.596
3.678SerGln: 3.678 ± 0.766
2.207SerArg: 2.207 ± 0.455
6.111SerSer: 6.111 ± 1.346
5.488SerThr: 5.488 ± 1.35
4.3SerVal: 4.3 ± 0.563
0.283SerTrp: 0.283 ± 0.113
2.999SerTyr: 2.999 ± 0.633
0.0SerXaa: 0.0 ± 0.0
Thr
4.074ThrAla: 4.074 ± 0.533
0.17ThrCys: 0.17 ± 0.092
2.49ThrAsp: 2.49 ± 0.521
3.055ThrGlu: 3.055 ± 0.481
2.829ThrPhe: 2.829 ± 0.532
3.338ThrGly: 3.338 ± 0.528
0.849ThrHis: 0.849 ± 0.249
4.47ThrIle: 4.47 ± 0.69
4.526ThrLys: 4.526 ± 0.865
6.111ThrLeu: 6.111 ± 0.928
1.188ThrMet: 1.188 ± 0.253
3.395ThrAsn: 3.395 ± 0.589
3.961ThrPro: 3.961 ± 0.964
2.942ThrGln: 2.942 ± 0.567
1.754ThrArg: 1.754 ± 0.365
4.357ThrSer: 4.357 ± 0.945
5.036ThrThr: 5.036 ± 1.284
4.47ThrVal: 4.47 ± 1.01
0.339ThrTrp: 0.339 ± 0.124
2.49ThrTyr: 2.49 ± 0.473
0.0ThrXaa: 0.0 ± 0.0
Val
3.565ValAla: 3.565 ± 0.452
0.283ValCys: 0.283 ± 0.11
2.829ValAsp: 2.829 ± 0.503
3.904ValGlu: 3.904 ± 0.428
3.451ValPhe: 3.451 ± 0.693
3.678ValGly: 3.678 ± 0.549
0.679ValHis: 0.679 ± 0.205
5.771ValIle: 5.771 ± 0.592
5.998ValLys: 5.998 ± 0.916
7.186ValLeu: 7.186 ± 0.753
1.358ValMet: 1.358 ± 0.363
3.791ValAsn: 3.791 ± 0.623
3.282ValPro: 3.282 ± 0.518
2.093ValGln: 2.093 ± 0.421
2.207ValArg: 2.207 ± 0.467
5.941ValSer: 5.941 ± 1.025
4.583ValThr: 4.583 ± 0.833
5.262ValVal: 5.262 ± 0.676
0.509ValTrp: 0.509 ± 0.159
3.847ValTyr: 3.847 ± 0.701
0.0ValXaa: 0.0 ± 0.0
Trp
0.509TrpAla: 0.509 ± 0.152
0.113TrpCys: 0.113 ± 0.082
0.396TrpAsp: 0.396 ± 0.108
0.566TrpGlu: 0.566 ± 0.232
0.226TrpPhe: 0.226 ± 0.118
0.226TrpGly: 0.226 ± 0.12
0.17TrpHis: 0.17 ± 0.111
0.396TrpIle: 0.396 ± 0.193
0.679TrpLys: 0.679 ± 0.201
0.962TrpLeu: 0.962 ± 0.23
0.226TrpMet: 0.226 ± 0.122
0.17TrpAsn: 0.17 ± 0.097
0.17TrpPro: 0.17 ± 0.102
0.453TrpGln: 0.453 ± 0.236
0.509TrpArg: 0.509 ± 0.417
0.679TrpSer: 0.679 ± 0.274
0.283TrpThr: 0.283 ± 0.125
0.453TrpVal: 0.453 ± 0.165
0.113TrpTrp: 0.113 ± 0.08
0.622TrpTyr: 0.622 ± 0.214
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.659TyrAla: 2.659 ± 0.472
0.339TyrCys: 0.339 ± 0.125
2.546TyrAsp: 2.546 ± 0.288
2.999TyrGlu: 2.999 ± 0.509
2.999TyrPhe: 2.999 ± 0.387
2.829TyrGly: 2.829 ± 0.405
0.679TyrHis: 0.679 ± 0.169
3.225TyrIle: 3.225 ± 0.529
2.659TyrLys: 2.659 ± 0.544
3.791TyrLeu: 3.791 ± 0.628
1.132TyrMet: 1.132 ± 0.321
2.263TyrAsn: 2.263 ± 0.451
2.942TyrPro: 2.942 ± 0.711
1.415TyrGln: 1.415 ± 0.31
1.584TyrArg: 1.584 ± 0.284
3.055TyrSer: 3.055 ± 0.589
3.112TyrThr: 3.112 ± 0.852
3.282TyrVal: 3.282 ± 0.769
0.566TyrTrp: 0.566 ± 0.245
2.546TyrTyr: 2.546 ± 0.573
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (17675 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski