Amino acid dipepetide frequency for Acidianus filamentous virus 2 (isolate Italy/Pozzuoli) (AFV-2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.329AlaCys: 0.329 ± 0.201
2.525AlaAsp: 2.525 ± 0.528
1.757AlaGlu: 1.757 ± 0.364
3.623AlaPhe: 3.623 ± 0.818
1.647AlaGly: 1.647 ± 0.434
0.549AlaHis: 0.549 ± 0.316
6.258AlaIle: 6.258 ± 0.908
4.172AlaLys: 4.172 ± 0.966
5.709AlaLeu: 5.709 ± 0.949
0.988AlaMet: 0.988 ± 0.299
3.623AlaAsn: 3.623 ± 0.6
2.745AlaPro: 2.745 ± 0.505
1.098AlaGln: 1.098 ± 0.362
1.976AlaArg: 1.976 ± 0.345
5.16AlaSer: 5.16 ± 1.087
4.391AlaThr: 4.391 ± 0.689
4.172AlaVal: 4.172 ± 0.571
0.659AlaTrp: 0.659 ± 0.253
2.745AlaTyr: 2.745 ± 0.722
0.0AlaXaa: 0.0 ± 0.0
Cys
0.439CysAla: 0.439 ± 0.21
0.549CysCys: 0.549 ± 0.516
1.098CysAsp: 1.098 ± 0.345
0.22CysGlu: 0.22 ± 0.152
0.439CysPhe: 0.439 ± 0.23
0.659CysGly: 0.659 ± 0.258
0.439CysHis: 0.439 ± 0.213
0.768CysIle: 0.768 ± 0.352
0.549CysLys: 0.549 ± 0.305
0.329CysLeu: 0.329 ± 0.168
0.439CysMet: 0.439 ± 0.198
0.549CysAsn: 0.549 ± 0.31
0.439CysPro: 0.439 ± 0.219
0.22CysGln: 0.22 ± 0.152
0.549CysArg: 0.549 ± 0.29
0.768CysSer: 0.768 ± 0.254
0.659CysThr: 0.659 ± 0.333
0.439CysVal: 0.439 ± 0.198
0.11CysTrp: 0.11 ± 0.092
0.439CysTyr: 0.439 ± 0.226
0.0CysXaa: 0.0 ± 0.0
Asp
3.952AspAla: 3.952 ± 0.638
0.11AspCys: 0.11 ± 0.11
4.611AspAsp: 4.611 ± 0.758
3.293AspGlu: 3.293 ± 0.652
1.208AspPhe: 1.208 ± 0.387
2.745AspGly: 2.745 ± 0.454
0.439AspHis: 0.439 ± 0.221
5.05AspIle: 5.05 ± 0.741
4.062AspLys: 4.062 ± 0.762
3.184AspLeu: 3.184 ± 0.593
1.427AspMet: 1.427 ± 0.395
4.281AspAsn: 4.281 ± 0.672
0.878AspPro: 0.878 ± 0.283
0.439AspGln: 0.439 ± 0.201
0.878AspArg: 0.878 ± 0.263
2.635AspSer: 2.635 ± 0.58
3.074AspThr: 3.074 ± 0.548
5.379AspVal: 5.379 ± 0.851
0.439AspTrp: 0.439 ± 0.214
2.305AspTyr: 2.305 ± 0.453
0.0AspXaa: 0.0 ± 0.0
Glu
1.427GluAla: 1.427 ± 0.316
0.329GluCys: 0.329 ± 0.202
1.757GluAsp: 1.757 ± 0.364
3.842GluGlu: 3.842 ± 1.303
2.745GluPhe: 2.745 ± 0.593
1.208GluGly: 1.208 ± 0.304
0.549GluHis: 0.549 ± 0.217
5.599GluIle: 5.599 ± 1.011
3.623GluLys: 3.623 ± 0.672
5.16GluLeu: 5.16 ± 0.866
1.647GluMet: 1.647 ± 0.478
2.086GluAsn: 2.086 ± 0.498
0.878GluPro: 0.878 ± 0.343
1.208GluGln: 1.208 ± 0.356
1.537GluArg: 1.537 ± 0.393
2.854GluSer: 2.854 ± 0.584
1.647GluThr: 1.647 ± 0.391
4.281GluVal: 4.281 ± 0.828
0.549GluTrp: 0.549 ± 0.255
3.733GluTyr: 3.733 ± 0.601
0.0GluXaa: 0.0 ± 0.0
Phe
2.854PheAla: 2.854 ± 0.684
0.878PheCys: 0.878 ± 0.471
2.305PheAsp: 2.305 ± 0.549
2.086PheGlu: 2.086 ± 0.47
1.757PhePhe: 1.757 ± 0.458
2.415PheGly: 2.415 ± 0.451
1.647PheHis: 1.647 ± 0.439
3.513PheIle: 3.513 ± 0.644
2.305PheLys: 2.305 ± 0.681
2.745PheLeu: 2.745 ± 0.523
1.208PheMet: 1.208 ± 0.346
2.086PheAsn: 2.086 ± 0.405
1.537PhePro: 1.537 ± 0.346
1.098PheGln: 1.098 ± 0.335
1.537PheArg: 1.537 ± 0.38
3.733PheSer: 3.733 ± 0.599
4.721PheThr: 4.721 ± 0.857
3.623PheVal: 3.623 ± 0.742
0.22PheTrp: 0.22 ± 0.172
2.525PheTyr: 2.525 ± 0.554
0.0PheXaa: 0.0 ± 0.0
Gly
2.305GlyAla: 2.305 ± 0.566
0.22GlyCys: 0.22 ± 0.129
3.513GlyAsp: 3.513 ± 0.661
3.293GlyGlu: 3.293 ± 0.573
2.305GlyPhe: 2.305 ± 0.465
3.184GlyGly: 3.184 ± 0.599
0.549GlyHis: 0.549 ± 0.22
4.94GlyIle: 4.94 ± 0.664
2.635GlyLys: 2.635 ± 0.555
2.525GlyLeu: 2.525 ± 0.603
0.878GlyMet: 0.878 ± 0.303
4.172GlyAsn: 4.172 ± 0.743
0.22GlyPro: 0.22 ± 0.152
0.988GlyGln: 0.988 ± 0.321
0.878GlyArg: 0.878 ± 0.277
3.623GlySer: 3.623 ± 0.736
3.184GlyThr: 3.184 ± 0.613
4.391GlyVal: 4.391 ± 0.791
0.439GlyTrp: 0.439 ± 0.181
3.293GlyTyr: 3.293 ± 0.687
0.0GlyXaa: 0.0 ± 0.0
His
1.208HisAla: 1.208 ± 0.364
0.329HisCys: 0.329 ± 0.179
0.549HisAsp: 0.549 ± 0.228
0.659HisGlu: 0.659 ± 0.316
0.878HisPhe: 0.878 ± 0.349
1.098HisGly: 1.098 ± 0.342
1.208HisHis: 1.208 ± 0.488
1.427HisIle: 1.427 ± 0.324
0.988HisLys: 0.988 ± 0.386
1.208HisLeu: 1.208 ± 0.417
0.22HisMet: 0.22 ± 0.152
1.098HisAsn: 1.098 ± 0.35
0.329HisPro: 0.329 ± 0.192
0.22HisGln: 0.22 ± 0.16
0.329HisArg: 0.329 ± 0.177
0.878HisSer: 0.878 ± 0.318
0.878HisThr: 0.878 ± 0.301
0.768HisVal: 0.768 ± 0.274
0.0HisTrp: 0.0 ± 0.0
1.317HisTyr: 1.317 ± 0.537
0.0HisXaa: 0.0 ± 0.0
Ile
7.026IleAla: 7.026 ± 0.767
1.427IleCys: 1.427 ± 0.426
5.05IleAsp: 5.05 ± 0.755
4.611IleGlu: 4.611 ± 0.756
4.172IlePhe: 4.172 ± 0.735
3.842IleGly: 3.842 ± 0.56
1.647IleHis: 1.647 ± 0.521
6.916IleIle: 6.916 ± 1.09
3.623IleLys: 3.623 ± 0.666
4.391IleLeu: 4.391 ± 0.735
1.866IleMet: 1.866 ± 0.536
5.16IleAsn: 5.16 ± 0.739
4.501IlePro: 4.501 ± 0.617
2.305IleGln: 2.305 ± 0.528
2.745IleArg: 2.745 ± 0.554
6.587IleSer: 6.587 ± 0.833
6.587IleThr: 6.587 ± 1.03
8.124IleVal: 8.124 ± 0.905
1.317IleTrp: 1.317 ± 0.323
3.403IleTyr: 3.403 ± 0.471
0.0IleXaa: 0.0 ± 0.0
Lys
2.305LysAla: 2.305 ± 0.438
0.549LysCys: 0.549 ± 0.316
1.976LysAsp: 1.976 ± 0.467
3.733LysGlu: 3.733 ± 0.768
2.854LysPhe: 2.854 ± 0.67
2.964LysGly: 2.964 ± 0.64
0.659LysHis: 0.659 ± 0.258
4.83LysIle: 4.83 ± 0.775
5.928LysLys: 5.928 ± 1.098
4.94LysLeu: 4.94 ± 0.716
2.415LysMet: 2.415 ± 0.57
3.733LysAsn: 3.733 ± 0.596
2.415LysPro: 2.415 ± 0.629
1.427LysGln: 1.427 ± 0.402
2.196LysArg: 2.196 ± 0.41
4.172LysSer: 4.172 ± 0.742
2.305LysThr: 2.305 ± 0.478
4.611LysVal: 4.611 ± 0.677
0.439LysTrp: 0.439 ± 0.189
4.501LysTyr: 4.501 ± 0.951
0.0LysXaa: 0.0 ± 0.0
Leu
3.623LeuAla: 3.623 ± 0.701
1.647LeuCys: 1.647 ± 0.412
3.184LeuAsp: 3.184 ± 0.617
3.074LeuGlu: 3.074 ± 0.577
3.184LeuPhe: 3.184 ± 0.593
3.074LeuGly: 3.074 ± 0.706
1.317LeuHis: 1.317 ± 0.335
5.16LeuIle: 5.16 ± 0.81
5.379LeuLys: 5.379 ± 0.928
10.759LeuLeu: 10.759 ± 1.363
2.305LeuMet: 2.305 ± 0.565
3.403LeuAsn: 3.403 ± 0.571
4.611LeuPro: 4.611 ± 0.639
2.964LeuGln: 2.964 ± 0.484
3.184LeuArg: 3.184 ± 0.617
9.99LeuSer: 9.99 ± 1.262
5.709LeuThr: 5.709 ± 0.678
5.928LeuVal: 5.928 ± 0.73
0.768LeuTrp: 0.768 ± 0.285
4.391LeuTyr: 4.391 ± 0.73
0.0LeuXaa: 0.0 ± 0.0
Met
1.208MetAla: 1.208 ± 0.493
0.11MetCys: 0.11 ± 0.104
0.329MetAsp: 0.329 ± 0.195
0.22MetGlu: 0.22 ± 0.152
0.878MetPhe: 0.878 ± 0.324
0.988MetGly: 0.988 ± 0.379
0.22MetHis: 0.22 ± 0.149
1.976MetIle: 1.976 ± 0.456
2.635MetLys: 2.635 ± 0.575
2.086MetLeu: 2.086 ± 0.571
1.098MetMet: 1.098 ± 0.394
1.427MetAsn: 1.427 ± 0.406
1.427MetPro: 1.427 ± 0.447
0.878MetGln: 0.878 ± 0.333
1.866MetArg: 1.866 ± 0.513
2.964MetSer: 2.964 ± 0.543
2.745MetThr: 2.745 ± 0.539
1.317MetVal: 1.317 ± 0.316
0.329MetTrp: 0.329 ± 0.192
1.317MetTyr: 1.317 ± 0.41
0.0MetXaa: 0.0 ± 0.0
Asn
5.27AsnAla: 5.27 ± 0.828
0.988AsnCys: 0.988 ± 0.375
4.172AsnAsp: 4.172 ± 0.606
4.062AsnGlu: 4.062 ± 0.647
2.745AsnPhe: 2.745 ± 0.435
3.074AsnGly: 3.074 ± 0.607
0.878AsnHis: 0.878 ± 0.364
4.501AsnIle: 4.501 ± 0.711
3.403AsnLys: 3.403 ± 0.592
4.721AsnLeu: 4.721 ± 0.624
1.098AsnMet: 1.098 ± 0.352
3.952AsnAsn: 3.952 ± 0.689
1.866AsnPro: 1.866 ± 0.442
1.317AsnGln: 1.317 ± 0.313
0.878AsnArg: 0.878 ± 0.379
3.513AsnSer: 3.513 ± 0.752
4.83AsnThr: 4.83 ± 0.994
6.806AsnVal: 6.806 ± 0.898
1.208AsnTrp: 1.208 ± 0.332
3.513AsnTyr: 3.513 ± 0.591
0.0AsnXaa: 0.0 ± 0.0
Pro
2.196ProAla: 2.196 ± 0.443
0.22ProCys: 0.22 ± 0.143
1.208ProAsp: 1.208 ± 0.363
1.427ProGlu: 1.427 ± 0.42
1.757ProPhe: 1.757 ± 0.359
0.768ProGly: 0.768 ± 0.303
0.878ProHis: 0.878 ± 0.416
4.501ProIle: 4.501 ± 0.613
1.317ProLys: 1.317 ± 0.48
3.952ProLeu: 3.952 ± 0.626
0.988ProMet: 0.988 ± 0.331
2.196ProAsn: 2.196 ± 0.478
3.184ProPro: 3.184 ± 0.642
1.208ProGln: 1.208 ± 0.292
1.537ProArg: 1.537 ± 0.39
3.733ProSer: 3.733 ± 0.548
3.403ProThr: 3.403 ± 0.714
3.952ProVal: 3.952 ± 0.672
0.11ProTrp: 0.11 ± 0.101
2.086ProTyr: 2.086 ± 0.522
0.0ProXaa: 0.0 ± 0.0
Gln
0.768GlnAla: 0.768 ± 0.273
0.768GlnCys: 0.768 ± 0.319
0.329GlnAsp: 0.329 ± 0.178
1.317GlnGlu: 1.317 ± 0.351
1.098GlnPhe: 1.098 ± 0.299
1.317GlnGly: 1.317 ± 0.405
0.22GlnHis: 0.22 ± 0.254
2.854GlnIle: 2.854 ± 0.6
0.988GlnLys: 0.988 ± 0.334
2.635GlnLeu: 2.635 ± 0.592
0.22GlnMet: 0.22 ± 0.178
2.964GlnAsn: 2.964 ± 0.529
0.878GlnPro: 0.878 ± 0.52
2.196GlnGln: 2.196 ± 0.776
0.988GlnArg: 0.988 ± 0.345
3.403GlnSer: 3.403 ± 0.544
0.878GlnThr: 0.878 ± 0.268
2.415GlnVal: 2.415 ± 0.611
0.768GlnTrp: 0.768 ± 0.263
2.415GlnTyr: 2.415 ± 0.556
0.0GlnXaa: 0.0 ± 0.0
Arg
1.537ArgAla: 1.537 ± 0.482
0.0ArgCys: 0.0 ± 0.0
1.537ArgAsp: 1.537 ± 0.43
2.415ArgGlu: 2.415 ± 0.476
1.647ArgPhe: 1.647 ± 0.416
1.537ArgGly: 1.537 ± 0.512
0.659ArgHis: 0.659 ± 0.26
2.415ArgIle: 2.415 ± 0.544
3.403ArgLys: 3.403 ± 0.662
2.415ArgLeu: 2.415 ± 0.421
0.768ArgMet: 0.768 ± 0.284
1.976ArgAsn: 1.976 ± 0.504
0.549ArgPro: 0.549 ± 0.24
1.537ArgGln: 1.537 ± 0.391
1.537ArgArg: 1.537 ± 0.451
1.647ArgSer: 1.647 ± 0.549
1.537ArgThr: 1.537 ± 0.437
2.086ArgVal: 2.086 ± 0.509
0.22ArgTrp: 0.22 ± 0.178
1.757ArgTyr: 1.757 ± 0.547
0.0ArgXaa: 0.0 ± 0.0
Ser
4.501SerAla: 4.501 ± 0.654
0.22SerCys: 0.22 ± 0.146
4.281SerAsp: 4.281 ± 0.657
3.293SerGlu: 3.293 ± 0.692
4.501SerPhe: 4.501 ± 0.799
5.489SerGly: 5.489 ± 0.744
0.988SerHis: 0.988 ± 0.257
5.599SerIle: 5.599 ± 0.712
4.281SerLys: 4.281 ± 0.828
9.112SerLeu: 9.112 ± 1.052
3.403SerMet: 3.403 ± 0.548
5.379SerAsn: 5.379 ± 0.775
4.391SerPro: 4.391 ± 0.681
3.513SerGln: 3.513 ± 0.794
1.976SerArg: 1.976 ± 0.438
11.198SerSer: 11.198 ± 1.497
6.148SerThr: 6.148 ± 0.926
6.587SerVal: 6.587 ± 0.783
0.329SerTrp: 0.329 ± 0.179
3.842SerTyr: 3.842 ± 0.629
0.0SerXaa: 0.0 ± 0.0
Thr
4.391ThrAla: 4.391 ± 0.789
0.11ThrCys: 0.11 ± 0.104
3.074ThrAsp: 3.074 ± 0.591
1.757ThrGlu: 1.757 ± 0.414
3.184ThrPhe: 3.184 ± 0.587
4.611ThrGly: 4.611 ± 0.786
0.659ThrHis: 0.659 ± 0.265
7.246ThrIle: 7.246 ± 1.039
2.415ThrLys: 2.415 ± 0.514
6.477ThrLeu: 6.477 ± 0.935
1.976ThrMet: 1.976 ± 0.443
3.293ThrAsn: 3.293 ± 0.59
4.501ThrPro: 4.501 ± 0.527
2.086ThrGln: 2.086 ± 0.443
2.196ThrArg: 2.196 ± 0.464
7.904ThrSer: 7.904 ± 1.197
9.002ThrThr: 9.002 ± 1.342
5.818ThrVal: 5.818 ± 0.831
0.768ThrTrp: 0.768 ± 0.312
2.854ThrTyr: 2.854 ± 0.513
0.0ThrXaa: 0.0 ± 0.0
Val
5.16ValAla: 5.16 ± 0.808
0.659ValCys: 0.659 ± 0.256
5.27ValAsp: 5.27 ± 0.791
2.964ValGlu: 2.964 ± 0.422
3.184ValPhe: 3.184 ± 0.589
3.842ValGly: 3.842 ± 0.584
1.098ValHis: 1.098 ± 0.392
5.928ValIle: 5.928 ± 0.819
4.501ValLys: 4.501 ± 0.836
6.587ValLeu: 6.587 ± 0.982
1.647ValMet: 1.647 ± 0.477
6.697ValAsn: 6.697 ± 0.797
3.842ValPro: 3.842 ± 0.531
2.745ValGln: 2.745 ± 0.514
2.086ValArg: 2.086 ± 0.477
8.453ValSer: 8.453 ± 0.719
8.124ValThr: 8.124 ± 1.106
8.343ValVal: 8.343 ± 1.023
0.22ValTrp: 0.22 ± 0.14
4.172ValTyr: 4.172 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.549TrpAla: 0.549 ± 0.238
0.329TrpCys: 0.329 ± 0.185
0.22TrpAsp: 0.22 ± 0.152
0.22TrpGlu: 0.22 ± 0.148
0.22TrpPhe: 0.22 ± 0.148
0.659TrpGly: 0.659 ± 0.222
0.329TrpHis: 0.329 ± 0.169
0.659TrpIle: 0.659 ± 0.253
0.439TrpLys: 0.439 ± 0.207
1.098TrpLeu: 1.098 ± 0.36
0.22TrpMet: 0.22 ± 0.157
0.768TrpAsn: 0.768 ± 0.285
0.0TrpPro: 0.0 ± 0.0
0.22TrpGln: 0.22 ± 0.167
0.659TrpArg: 0.659 ± 0.216
0.988TrpSer: 0.988 ± 0.276
0.439TrpThr: 0.439 ± 0.209
0.549TrpVal: 0.549 ± 0.241
0.11TrpTrp: 0.11 ± 0.095
0.768TrpTyr: 0.768 ± 0.309
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.403TyrAla: 3.403 ± 0.682
0.329TyrCys: 0.329 ± 0.195
3.403TyrAsp: 3.403 ± 0.587
2.415TyrGlu: 2.415 ± 0.416
2.415TyrPhe: 2.415 ± 0.418
2.415TyrGly: 2.415 ± 0.581
0.659TyrHis: 0.659 ± 0.331
5.16TyrIle: 5.16 ± 0.677
2.305TyrLys: 2.305 ± 0.504
3.623TyrLeu: 3.623 ± 0.615
1.098TyrMet: 1.098 ± 0.36
3.952TyrAsn: 3.952 ± 0.662
1.427TyrPro: 1.427 ± 0.423
1.757TyrGln: 1.757 ± 0.347
1.647TyrArg: 1.647 ± 0.569
4.94TyrSer: 4.94 ± 1.122
4.062TyrThr: 4.062 ± 0.857
5.928TyrVal: 5.928 ± 0.857
0.439TyrTrp: 0.439 ± 0.211
3.184TyrTyr: 3.184 ± 0.669
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (9110 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski