Amino acid dipepetide frequency for Acidianus two-tailed virus (ATV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.883AlaAla: 0.883 ± 0.244
0.415AlaCys: 0.415 ± 0.126
2.128AlaAsp: 2.128 ± 0.356
4.88AlaGlu: 4.88 ± 0.751
2.596AlaPhe: 2.596 ± 0.429
2.699AlaGly: 2.699 ± 0.38
0.623AlaHis: 0.623 ± 0.16
5.399AlaIle: 5.399 ± 0.6
5.97AlaLys: 5.97 ± 0.712
6.385AlaLeu: 6.385 ± 0.687
1.194AlaMet: 1.194 ± 0.263
3.271AlaAsn: 3.271 ± 0.536
2.596AlaPro: 2.596 ± 0.3
2.492AlaGln: 2.492 ± 0.409
2.336AlaArg: 2.336 ± 0.424
4.413AlaSer: 4.413 ± 0.446
3.322AlaThr: 3.322 ± 0.41
4.257AlaVal: 4.257 ± 0.385
0.519AlaTrp: 0.519 ± 0.203
3.374AlaTyr: 3.374 ± 0.745
0.0AlaXaa: 0.0 ± 0.0
Cys
0.26CysAla: 0.26 ± 0.123
0.104CysCys: 0.104 ± 0.104
0.363CysAsp: 0.363 ± 0.124
0.415CysGlu: 0.415 ± 0.141
0.208CysPhe: 0.208 ± 0.129
0.363CysGly: 0.363 ± 0.205
0.104CysHis: 0.104 ± 0.089
0.208CysIle: 0.208 ± 0.097
0.311CysLys: 0.311 ± 0.139
0.311CysLeu: 0.311 ± 0.119
0.052CysMet: 0.052 ± 0.051
0.156CysAsn: 0.156 ± 0.087
0.779CysPro: 0.779 ± 0.26
0.104CysGln: 0.104 ± 0.114
0.363CysArg: 0.363 ± 0.129
0.26CysSer: 0.26 ± 0.106
0.363CysThr: 0.363 ± 0.159
0.156CysVal: 0.156 ± 0.089
0.052CysTrp: 0.052 ± 0.055
0.467CysTyr: 0.467 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
2.855AspAla: 2.855 ± 0.658
0.208AspCys: 0.208 ± 0.102
2.232AspAsp: 2.232 ± 0.412
4.205AspGlu: 4.205 ± 0.524
2.232AspPhe: 2.232 ± 0.369
2.077AspGly: 2.077 ± 0.391
0.467AspHis: 0.467 ± 0.156
3.842AspIle: 3.842 ± 0.65
3.426AspLys: 3.426 ± 0.84
5.243AspLeu: 5.243 ± 0.924
0.675AspMet: 0.675 ± 0.179
1.609AspAsn: 1.609 ± 0.278
1.713AspPro: 1.713 ± 0.306
1.609AspGln: 1.609 ± 0.295
1.921AspArg: 1.921 ± 0.387
1.661AspSer: 1.661 ± 0.268
2.544AspThr: 2.544 ± 0.405
3.842AspVal: 3.842 ± 0.708
0.311AspTrp: 0.311 ± 0.121
2.648AspTyr: 2.648 ± 0.346
0.0AspXaa: 0.0 ± 0.0
Glu
4.932GluAla: 4.932 ± 0.634
0.311GluCys: 0.311 ± 0.118
3.011GluAsp: 3.011 ± 0.419
10.175GluGlu: 10.175 ± 2.101
2.803GluPhe: 2.803 ± 0.469
4.361GluGly: 4.361 ± 1.409
1.09GluHis: 1.09 ± 0.742
5.191GluIle: 5.191 ± 0.534
7.268GluLys: 7.268 ± 0.876
5.866GluLeu: 5.866 ± 0.711
1.35GluMet: 1.35 ± 0.379
4.101GluAsn: 4.101 ± 0.551
1.869GluPro: 1.869 ± 0.341
3.426GluGln: 3.426 ± 0.81
2.232GluArg: 2.232 ± 0.387
4.257GluSer: 4.257 ± 1.326
3.219GluThr: 3.219 ± 0.573
3.79GluVal: 3.79 ± 0.522
0.519GluTrp: 0.519 ± 0.174
3.063GluTyr: 3.063 ± 0.468
0.0GluXaa: 0.0 ± 0.0
Phe
2.544PheAla: 2.544 ± 0.401
0.26PheCys: 0.26 ± 0.164
2.492PheAsp: 2.492 ± 0.449
1.921PheGlu: 1.921 ± 0.285
1.557PhePhe: 1.557 ± 0.421
1.817PheGly: 1.817 ± 0.332
0.623PheHis: 0.623 ± 0.271
3.322PheIle: 3.322 ± 0.503
3.115PheLys: 3.115 ± 0.528
4.932PheLeu: 4.932 ± 0.79
1.402PheMet: 1.402 ± 0.213
2.855PheAsn: 2.855 ± 0.525
2.077PhePro: 2.077 ± 0.409
0.779PheGln: 0.779 ± 0.174
1.661PheArg: 1.661 ± 0.243
3.271PheSer: 3.271 ± 0.518
2.855PheThr: 2.855 ± 0.692
3.686PheVal: 3.686 ± 0.494
0.311PheTrp: 0.311 ± 0.183
2.077PheTyr: 2.077 ± 0.369
0.0PheXaa: 0.0 ± 0.0
Gly
2.648GlyAla: 2.648 ± 0.48
0.26GlyCys: 0.26 ± 0.096
2.544GlyAsp: 2.544 ± 0.379
4.413GlyGlu: 4.413 ± 1.45
3.115GlyPhe: 3.115 ± 0.483
3.011GlyGly: 3.011 ± 0.68
0.571GlyHis: 0.571 ± 0.195
4.257GlyIle: 4.257 ± 0.584
4.361GlyLys: 4.361 ± 0.589
4.88GlyLeu: 4.88 ± 0.591
1.194GlyMet: 1.194 ± 0.224
2.648GlyAsn: 2.648 ± 0.625
0.831GlyPro: 0.831 ± 0.201
2.284GlyGln: 2.284 ± 0.661
1.557GlyArg: 1.557 ± 0.329
4.309GlySer: 4.309 ± 0.948
2.959GlyThr: 2.959 ± 0.479
3.322GlyVal: 3.322 ± 0.558
0.675GlyTrp: 0.675 ± 0.252
2.648GlyTyr: 2.648 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
0.623HisAla: 0.623 ± 0.181
0.156HisCys: 0.156 ± 0.093
0.727HisAsp: 0.727 ± 0.189
1.09HisGlu: 1.09 ± 0.45
0.675HisPhe: 0.675 ± 0.213
1.194HisGly: 1.194 ± 0.368
0.623HisHis: 0.623 ± 0.362
1.09HisIle: 1.09 ± 0.231
1.038HisLys: 1.038 ± 0.34
0.986HisLeu: 0.986 ± 0.36
0.363HisMet: 0.363 ± 0.128
0.623HisAsn: 0.623 ± 0.26
0.571HisPro: 0.571 ± 0.183
0.311HisGln: 0.311 ± 0.133
0.727HisArg: 0.727 ± 0.257
0.727HisSer: 0.727 ± 0.273
0.519HisThr: 0.519 ± 0.172
0.623HisVal: 0.623 ± 0.16
0.0HisTrp: 0.0 ± 0.0
0.883HisTyr: 0.883 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
5.918IleAla: 5.918 ± 0.51
0.363IleCys: 0.363 ± 0.127
4.049IleAsp: 4.049 ± 0.582
4.516IleGlu: 4.516 ± 0.511
3.115IlePhe: 3.115 ± 0.404
3.271IleGly: 3.271 ± 0.498
1.194IleHis: 1.194 ± 0.301
4.88IleIle: 4.88 ± 0.661
5.243IleLys: 5.243 ± 0.784
6.904IleLeu: 6.904 ± 0.68
1.609IleMet: 1.609 ± 0.37
3.271IleAsn: 3.271 ± 0.43
4.205IlePro: 4.205 ± 0.665
2.44IleGln: 2.44 ± 0.401
3.374IleArg: 3.374 ± 0.52
5.295IleSer: 5.295 ± 0.686
4.568IleThr: 4.568 ± 0.471
5.555IleVal: 5.555 ± 0.55
0.727IleTrp: 0.727 ± 0.201
3.634IleTyr: 3.634 ± 0.616
0.0IleXaa: 0.0 ± 0.0
Lys
4.361LysAla: 4.361 ± 0.691
0.363LysCys: 0.363 ± 0.163
3.738LysAsp: 3.738 ± 0.821
6.333LysGlu: 6.333 ± 0.871
2.128LysPhe: 2.128 ± 0.395
4.049LysGly: 4.049 ± 0.703
1.142LysHis: 1.142 ± 0.274
5.762LysIle: 5.762 ± 0.782
6.645LysLys: 6.645 ± 0.923
7.527LysLeu: 7.527 ± 0.9
2.128LysMet: 2.128 ± 0.369
4.828LysAsn: 4.828 ± 0.609
3.374LysPro: 3.374 ± 0.527
3.893LysGln: 3.893 ± 0.702
3.167LysArg: 3.167 ± 0.52
3.426LysSer: 3.426 ± 0.375
5.036LysThr: 5.036 ± 0.766
5.451LysVal: 5.451 ± 0.902
0.727LysTrp: 0.727 ± 0.188
3.997LysTyr: 3.997 ± 0.512
0.0LysXaa: 0.0 ± 0.0
Leu
7.683LeuAla: 7.683 ± 0.77
0.467LeuCys: 0.467 ± 0.161
4.724LeuAsp: 4.724 ± 0.633
4.88LeuGlu: 4.88 ± 0.636
4.568LeuPhe: 4.568 ± 0.819
4.516LeuGly: 4.516 ± 0.504
1.817LeuHis: 1.817 ± 0.38
6.178LeuIle: 6.178 ± 0.837
7.735LeuLys: 7.735 ± 0.867
9.76LeuLeu: 9.76 ± 0.889
2.18LeuMet: 2.18 ± 0.394
5.139LeuAsn: 5.139 ± 0.477
5.71LeuPro: 5.71 ± 0.437
3.271LeuGln: 3.271 ± 0.57
4.932LeuArg: 4.932 ± 0.906
7.008LeuSer: 7.008 ± 0.745
4.153LeuThr: 4.153 ± 0.487
6.281LeuVal: 6.281 ± 0.717
0.883LeuTrp: 0.883 ± 0.256
4.309LeuTyr: 4.309 ± 0.684
0.0LeuXaa: 0.0 ± 0.0
Met
1.402MetAla: 1.402 ± 0.305
0.104MetCys: 0.104 ± 0.086
0.831MetAsp: 0.831 ± 0.239
1.402MetGlu: 1.402 ± 0.28
1.09MetPhe: 1.09 ± 0.261
0.883MetGly: 0.883 ± 0.208
0.26MetHis: 0.26 ± 0.104
1.402MetIle: 1.402 ± 0.315
1.765MetLys: 1.765 ± 0.363
2.18MetLeu: 2.18 ± 0.39
0.363MetMet: 0.363 ± 0.142
1.298MetAsn: 1.298 ± 0.292
1.246MetPro: 1.246 ± 0.263
1.194MetGln: 1.194 ± 0.261
0.675MetArg: 0.675 ± 0.253
1.557MetSer: 1.557 ± 0.223
1.194MetThr: 1.194 ± 0.282
1.713MetVal: 1.713 ± 0.347
0.104MetTrp: 0.104 ± 0.059
0.727MetTyr: 0.727 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
4.049AsnAla: 4.049 ± 0.704
0.311AsnCys: 0.311 ± 0.139
2.025AsnAsp: 2.025 ± 0.471
3.686AsnGlu: 3.686 ± 0.768
1.609AsnPhe: 1.609 ± 0.286
3.634AsnGly: 3.634 ± 0.727
0.467AsnHis: 0.467 ± 0.175
4.309AsnIle: 4.309 ± 0.546
3.374AsnLys: 3.374 ± 0.472
4.984AsnLeu: 4.984 ± 0.55
1.09AsnMet: 1.09 ± 0.204
3.686AsnAsn: 3.686 ± 0.618
3.115AsnPro: 3.115 ± 0.564
1.869AsnGln: 1.869 ± 0.549
1.246AsnArg: 1.246 ± 0.29
3.53AsnSer: 3.53 ± 0.604
3.893AsnThr: 3.893 ± 0.839
4.465AsnVal: 4.465 ± 0.674
0.415AsnTrp: 0.415 ± 0.136
1.713AsnTyr: 1.713 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
2.284ProAla: 2.284 ± 0.386
0.052ProCys: 0.052 ± 0.047
2.077ProAsp: 2.077 ± 0.45
2.907ProGlu: 2.907 ± 0.48
2.699ProPhe: 2.699 ± 0.678
2.648ProGly: 2.648 ± 0.419
0.363ProHis: 0.363 ± 0.138
4.309ProIle: 4.309 ± 0.819
3.426ProLys: 3.426 ± 0.495
3.79ProLeu: 3.79 ± 0.511
0.831ProMet: 0.831 ± 0.268
1.973ProAsn: 1.973 ± 0.303
5.866ProPro: 5.866 ± 1.811
2.025ProGln: 2.025 ± 0.362
1.609ProArg: 1.609 ± 0.343
4.828ProSer: 4.828 ± 0.862
4.516ProThr: 4.516 ± 1.333
3.011ProVal: 3.011 ± 0.462
0.831ProTrp: 0.831 ± 0.26
2.492ProTyr: 2.492 ± 0.582
0.0ProXaa: 0.0 ± 0.0
Gln
2.077GlnAla: 2.077 ± 0.491
0.156GlnCys: 0.156 ± 0.072
1.038GlnAsp: 1.038 ± 0.264
3.426GlnGlu: 3.426 ± 0.944
1.142GlnPhe: 1.142 ± 0.232
1.869GlnGly: 1.869 ± 0.452
0.26GlnHis: 0.26 ± 0.121
3.115GlnIle: 3.115 ± 0.392
2.648GlnLys: 2.648 ± 0.42
4.88GlnLeu: 4.88 ± 0.649
0.934GlnMet: 0.934 ± 0.24
3.582GlnAsn: 3.582 ± 0.56
2.232GlnPro: 2.232 ± 0.398
3.478GlnGln: 3.478 ± 0.773
1.35GlnArg: 1.35 ± 0.326
2.18GlnSer: 2.18 ± 0.429
1.921GlnThr: 1.921 ± 0.43
2.128GlnVal: 2.128 ± 0.344
0.052GlnTrp: 0.052 ± 0.055
1.869GlnTyr: 1.869 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
1.817ArgAla: 1.817 ± 0.316
0.052ArgCys: 0.052 ± 0.043
2.232ArgAsp: 2.232 ± 0.5
2.699ArgGlu: 2.699 ± 0.5
1.869ArgPhe: 1.869 ± 0.339
1.921ArgGly: 1.921 ± 0.379
0.623ArgHis: 0.623 ± 0.19
3.374ArgIle: 3.374 ± 0.577
3.686ArgLys: 3.686 ± 0.587
4.049ArgLeu: 4.049 ± 0.562
0.519ArgMet: 0.519 ± 0.191
1.505ArgAsn: 1.505 ± 0.364
1.194ArgPro: 1.194 ± 0.33
1.454ArgGln: 1.454 ± 0.24
2.336ArgArg: 2.336 ± 0.544
2.44ArgSer: 2.44 ± 0.317
1.921ArgThr: 1.921 ± 0.494
2.077ArgVal: 2.077 ± 0.381
0.363ArgTrp: 0.363 ± 0.135
1.609ArgTyr: 1.609 ± 0.289
0.0ArgXaa: 0.0 ± 0.0
Ser
3.997SerAla: 3.997 ± 0.611
0.519SerCys: 0.519 ± 0.205
2.336SerAsp: 2.336 ± 0.358
4.828SerGlu: 4.828 ± 1.475
2.855SerPhe: 2.855 ± 0.461
4.62SerGly: 4.62 ± 1.08
0.831SerHis: 0.831 ± 0.316
4.932SerIle: 4.932 ± 0.536
4.62SerLys: 4.62 ± 0.525
5.503SerLeu: 5.503 ± 0.592
1.973SerMet: 1.973 ± 0.338
3.063SerAsn: 3.063 ± 0.472
3.738SerPro: 3.738 ± 0.578
3.53SerGln: 3.53 ± 0.653
1.973SerArg: 1.973 ± 0.398
6.022SerSer: 6.022 ± 1.15
5.762SerThr: 5.762 ± 1.093
4.568SerVal: 4.568 ± 0.586
0.519SerTrp: 0.519 ± 0.177
2.492SerTyr: 2.492 ± 0.573
0.0SerXaa: 0.0 ± 0.0
Thr
3.478ThrAla: 3.478 ± 0.502
0.311ThrCys: 0.311 ± 0.123
2.751ThrAsp: 2.751 ± 0.567
3.219ThrGlu: 3.219 ± 0.45
3.219ThrPhe: 3.219 ± 0.685
2.803ThrGly: 2.803 ± 0.463
1.194ThrHis: 1.194 ± 0.479
3.842ThrIle: 3.842 ± 0.559
3.53ThrLys: 3.53 ± 0.634
6.074ThrLeu: 6.074 ± 0.636
0.986ThrMet: 0.986 ± 0.242
3.426ThrAsn: 3.426 ± 0.672
5.399ThrPro: 5.399 ± 1.586
2.336ThrGln: 2.336 ± 0.653
1.973ThrArg: 1.973 ± 0.345
4.568ThrSer: 4.568 ± 0.631
4.672ThrThr: 4.672 ± 1.07
4.568ThrVal: 4.568 ± 0.876
0.415ThrTrp: 0.415 ± 0.122
2.232ThrTyr: 2.232 ± 0.369
0.0ThrXaa: 0.0 ± 0.0
Val
4.257ValAla: 4.257 ± 0.476
0.26ValCys: 0.26 ± 0.142
3.063ValAsp: 3.063 ± 0.531
4.205ValGlu: 4.205 ± 0.475
3.063ValPhe: 3.063 ± 0.656
3.842ValGly: 3.842 ± 0.596
0.571ValHis: 0.571 ± 0.151
5.087ValIle: 5.087 ± 0.517
5.866ValLys: 5.866 ± 1.048
6.749ValLeu: 6.749 ± 0.711
1.038ValMet: 1.038 ± 0.247
3.893ValAsn: 3.893 ± 0.567
2.907ValPro: 2.907 ± 0.445
1.817ValGln: 1.817 ± 0.33
2.44ValArg: 2.44 ± 0.471
5.347ValSer: 5.347 ± 0.674
4.361ValThr: 4.361 ± 0.894
4.672ValVal: 4.672 ± 0.587
0.467ValTrp: 0.467 ± 0.168
3.686ValTyr: 3.686 ± 0.721
0.0ValXaa: 0.0 ± 0.0
Trp
0.467TrpAla: 0.467 ± 0.177
0.26TrpCys: 0.26 ± 0.112
0.311TrpAsp: 0.311 ± 0.146
0.675TrpGlu: 0.675 ± 0.2
0.311TrpPhe: 0.311 ± 0.114
0.363TrpGly: 0.363 ± 0.136
0.156TrpHis: 0.156 ± 0.096
0.415TrpIle: 0.415 ± 0.142
0.779TrpLys: 0.779 ± 0.239
1.038TrpLeu: 1.038 ± 0.3
0.26TrpMet: 0.26 ± 0.144
0.104TrpAsn: 0.104 ± 0.073
0.156TrpPro: 0.156 ± 0.097
0.519TrpGln: 0.519 ± 0.179
0.415TrpArg: 0.415 ± 0.221
0.623TrpSer: 0.623 ± 0.218
0.415TrpThr: 0.415 ± 0.171
0.519TrpVal: 0.519 ± 0.17
0.208TrpTrp: 0.208 ± 0.106
0.779TrpTyr: 0.779 ± 0.222
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.219TyrAla: 3.219 ± 0.47
0.467TyrCys: 0.467 ± 0.177
2.648TyrAsp: 2.648 ± 0.381
2.959TyrGlu: 2.959 ± 0.442
2.751TyrPhe: 2.751 ± 0.449
2.544TyrGly: 2.544 ± 0.334
0.571TyrHis: 0.571 ± 0.217
3.219TyrIle: 3.219 ± 0.582
3.271TyrLys: 3.271 ± 0.503
4.153TyrLeu: 4.153 ± 0.633
1.194TyrMet: 1.194 ± 0.234
2.44TyrAsn: 2.44 ± 0.371
2.959TyrPro: 2.959 ± 0.617
1.505TyrGln: 1.505 ± 0.287
1.454TyrArg: 1.454 ± 0.398
2.959TyrSer: 2.959 ± 0.516
2.803TyrThr: 2.803 ± 0.582
2.907TyrVal: 2.907 ± 0.676
0.675TyrTrp: 0.675 ± 0.209
2.492TyrTyr: 2.492 ± 0.471
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (19264 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski