Amino acid dipepetide frequency for Acidianus rod-shaped virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.082AlaAla: 2.082 ± 0.743
0.278AlaCys: 0.278 ± 0.181
4.304AlaAsp: 4.304 ± 0.897
3.054AlaGlu: 3.054 ± 0.849
2.777AlaPhe: 2.777 ± 0.569
2.777AlaGly: 2.777 ± 0.567
0.972AlaHis: 0.972 ± 0.384
7.08AlaIle: 7.08 ± 1.096
5.414AlaLys: 5.414 ± 0.875
6.803AlaLeu: 6.803 ± 0.883
1.111AlaMet: 1.111 ± 0.44
3.193AlaAsn: 3.193 ± 0.675
2.221AlaPro: 2.221 ± 0.547
3.193AlaGln: 3.193 ± 0.761
1.805AlaArg: 1.805 ± 0.4
4.026AlaSer: 4.026 ± 0.8
5.276AlaThr: 5.276 ± 0.802
3.887AlaVal: 3.887 ± 0.728
1.111AlaTrp: 1.111 ± 0.482
3.193AlaTyr: 3.193 ± 0.535
0.0AlaXaa: 0.0 ± 0.0
Cys
0.555CysAla: 0.555 ± 0.274
0.0CysCys: 0.0 ± 0.0
0.694CysAsp: 0.694 ± 0.421
1.388CysGlu: 1.388 ± 0.524
0.555CysPhe: 0.555 ± 0.254
1.388CysGly: 1.388 ± 0.544
0.139CysHis: 0.139 ± 0.135
1.249CysIle: 1.249 ± 0.527
0.139CysLys: 0.139 ± 0.151
1.111CysLeu: 1.111 ± 0.461
0.278CysMet: 0.278 ± 0.173
0.694CysAsn: 0.694 ± 0.292
0.139CysPro: 0.139 ± 0.144
0.416CysGln: 0.416 ± 0.215
0.278CysArg: 0.278 ± 0.237
0.972CysSer: 0.972 ± 0.373
0.0CysThr: 0.0 ± 0.0
0.416CysVal: 0.416 ± 0.261
0.416CysTrp: 0.416 ± 0.241
0.278CysTyr: 0.278 ± 0.216
0.0CysXaa: 0.0 ± 0.0
Asp
4.859AspAla: 4.859 ± 0.674
0.833AspCys: 0.833 ± 0.463
2.915AspAsp: 2.915 ± 0.864
3.748AspGlu: 3.748 ± 0.641
3.332AspPhe: 3.332 ± 0.741
2.36AspGly: 2.36 ± 0.607
0.278AspHis: 0.278 ± 0.173
6.109AspIle: 6.109 ± 1.204
3.054AspLys: 3.054 ± 0.676
4.581AspLeu: 4.581 ± 0.583
0.416AspMet: 0.416 ± 0.253
2.638AspAsn: 2.638 ± 0.588
2.221AspPro: 2.221 ± 0.476
2.36AspGln: 2.36 ± 0.572
1.527AspArg: 1.527 ± 0.479
2.082AspSer: 2.082 ± 0.38
2.36AspThr: 2.36 ± 0.62
4.026AspVal: 4.026 ± 0.633
0.972AspTrp: 0.972 ± 0.385
1.944AspTyr: 1.944 ± 0.632
0.0AspXaa: 0.0 ± 0.0
Glu
1.388GluAla: 1.388 ± 0.409
0.555GluCys: 0.555 ± 0.298
3.193GluAsp: 3.193 ± 0.78
3.193GluGlu: 3.193 ± 1.112
2.36GluPhe: 2.36 ± 0.519
2.221GluGly: 2.221 ± 0.656
0.833GluHis: 0.833 ± 0.36
6.109GluIle: 6.109 ± 1.107
5.692GluLys: 5.692 ± 1.11
4.581GluLeu: 4.581 ± 1.026
1.666GluMet: 1.666 ± 0.566
3.471GluAsn: 3.471 ± 0.675
1.388GluPro: 1.388 ± 0.527
2.221GluGln: 2.221 ± 0.625
1.527GluArg: 1.527 ± 0.489
2.499GluSer: 2.499 ± 0.511
3.61GluThr: 3.61 ± 0.692
2.777GluVal: 2.777 ± 0.577
0.416GluTrp: 0.416 ± 0.284
3.332GluTyr: 3.332 ± 0.875
0.0GluXaa: 0.0 ± 0.0
Phe
3.054PheAla: 3.054 ± 0.837
1.666PheCys: 1.666 ± 0.67
1.805PheAsp: 1.805 ± 0.415
2.777PheGlu: 2.777 ± 0.511
3.332PhePhe: 3.332 ± 0.979
3.471PheGly: 3.471 ± 0.589
0.416PheHis: 0.416 ± 0.238
3.748PheIle: 3.748 ± 0.779
3.61PheLys: 3.61 ± 0.628
4.859PheLeu: 4.859 ± 0.773
1.805PheMet: 1.805 ± 0.439
2.638PheAsn: 2.638 ± 0.655
1.944PhePro: 1.944 ± 0.512
2.082PheGln: 2.082 ± 0.399
2.499PheArg: 2.499 ± 0.648
4.443PheSer: 4.443 ± 1.106
4.581PheThr: 4.581 ± 0.781
2.915PheVal: 2.915 ± 0.67
0.0PheTrp: 0.0 ± 0.0
2.36PheTyr: 2.36 ± 0.627
0.0PheXaa: 0.0 ± 0.0
Gly
3.332GlyAla: 3.332 ± 0.684
0.555GlyCys: 0.555 ± 0.262
2.36GlyAsp: 2.36 ± 0.513
3.054GlyGlu: 3.054 ± 0.809
2.638GlyPhe: 2.638 ± 0.58
1.666GlyGly: 1.666 ± 0.377
1.527GlyHis: 1.527 ± 0.544
3.748GlyIle: 3.748 ± 0.56
2.499GlyLys: 2.499 ± 0.56
4.304GlyLeu: 4.304 ± 0.741
1.111GlyMet: 1.111 ± 0.402
2.36GlyAsn: 2.36 ± 0.609
1.666GlyPro: 1.666 ± 0.42
1.527GlyGln: 1.527 ± 0.468
1.249GlyArg: 1.249 ± 0.443
3.471GlySer: 3.471 ± 0.725
2.638GlyThr: 2.638 ± 0.655
4.443GlyVal: 4.443 ± 0.751
0.555GlyTrp: 0.555 ± 0.233
2.915GlyTyr: 2.915 ± 0.697
0.0GlyXaa: 0.0 ± 0.0
His
0.555HisAla: 0.555 ± 0.307
0.139HisCys: 0.139 ± 0.132
0.694HisAsp: 0.694 ± 0.309
0.694HisGlu: 0.694 ± 0.292
0.833HisPhe: 0.833 ± 0.5
1.249HisGly: 1.249 ± 0.377
0.278HisHis: 0.278 ± 0.192
2.082HisIle: 2.082 ± 0.726
0.833HisLys: 0.833 ± 0.261
1.805HisLeu: 1.805 ± 0.534
0.416HisMet: 0.416 ± 0.223
0.694HisAsn: 0.694 ± 0.284
0.555HisPro: 0.555 ± 0.393
0.833HisGln: 0.833 ± 0.384
0.972HisArg: 0.972 ± 0.433
0.972HisSer: 0.972 ± 0.322
0.278HisThr: 0.278 ± 0.227
1.249HisVal: 1.249 ± 0.316
0.139HisTrp: 0.139 ± 0.133
0.833HisTyr: 0.833 ± 0.247
0.0HisXaa: 0.0 ± 0.0
Ile
6.247IleAla: 6.247 ± 1.291
0.972IleCys: 0.972 ± 0.395
4.859IleAsp: 4.859 ± 0.889
3.471IleGlu: 3.471 ± 0.878
3.193IlePhe: 3.193 ± 0.611
3.887IleGly: 3.887 ± 0.563
2.221IleHis: 2.221 ± 0.629
5.276IleIle: 5.276 ± 0.935
4.859IleLys: 4.859 ± 1.014
5.414IleLeu: 5.414 ± 0.899
0.972IleMet: 0.972 ± 0.388
4.998IleAsn: 4.998 ± 0.911
4.998IlePro: 4.998 ± 0.741
3.61IleGln: 3.61 ± 0.687
2.499IleArg: 2.499 ± 0.587
8.191IleSer: 8.191 ± 0.916
4.72IleThr: 4.72 ± 0.887
5.414IleVal: 5.414 ± 0.812
0.972IleTrp: 0.972 ± 0.312
4.026IleTyr: 4.026 ± 0.679
0.0IleXaa: 0.0 ± 0.0
Lys
3.332LysAla: 3.332 ± 0.661
0.416LysCys: 0.416 ± 0.246
2.36LysAsp: 2.36 ± 0.495
3.61LysGlu: 3.61 ± 1.138
1.666LysPhe: 1.666 ± 0.538
3.193LysGly: 3.193 ± 0.462
0.555LysHis: 0.555 ± 0.244
6.386LysIle: 6.386 ± 1.208
4.443LysLys: 4.443 ± 0.863
6.803LysLeu: 6.803 ± 1.086
1.805LysMet: 1.805 ± 0.441
2.638LysAsn: 2.638 ± 0.658
2.082LysPro: 2.082 ± 0.414
2.082LysGln: 2.082 ± 0.612
2.36LysArg: 2.36 ± 0.824
4.72LysSer: 4.72 ± 0.929
3.61LysThr: 3.61 ± 0.557
4.443LysVal: 4.443 ± 0.796
0.555LysTrp: 0.555 ± 0.255
4.026LysTyr: 4.026 ± 0.576
0.0LysXaa: 0.0 ± 0.0
Leu
4.998LeuAla: 4.998 ± 0.93
1.249LeuCys: 1.249 ± 0.435
4.998LeuAsp: 4.998 ± 0.971
4.72LeuGlu: 4.72 ± 0.784
5.414LeuPhe: 5.414 ± 0.744
4.581LeuGly: 4.581 ± 1.225
1.111LeuHis: 1.111 ± 0.336
6.386LeuIle: 6.386 ± 1.11
6.386LeuLys: 6.386 ± 0.942
9.024LeuLeu: 9.024 ± 1.447
2.638LeuMet: 2.638 ± 0.535
4.72LeuAsn: 4.72 ± 0.564
3.887LeuPro: 3.887 ± 0.553
5.414LeuGln: 5.414 ± 1.082
3.61LeuArg: 3.61 ± 0.937
8.33LeuSer: 8.33 ± 1.208
5.137LeuThr: 5.137 ± 1.135
4.026LeuVal: 4.026 ± 0.733
0.139LeuTrp: 0.139 ± 0.118
5.97LeuTyr: 5.97 ± 1.06
0.0LeuXaa: 0.0 ± 0.0
Met
1.527MetAla: 1.527 ± 0.436
0.416MetCys: 0.416 ± 0.209
0.972MetAsp: 0.972 ± 0.42
1.249MetGlu: 1.249 ± 0.45
1.111MetPhe: 1.111 ± 0.278
1.666MetGly: 1.666 ± 0.476
0.278MetHis: 0.278 ± 0.205
0.833MetIle: 0.833 ± 0.38
1.249MetLys: 1.249 ± 0.326
2.36MetLeu: 2.36 ± 0.537
0.833MetMet: 0.833 ± 0.319
0.833MetAsn: 0.833 ± 0.298
1.388MetPro: 1.388 ± 0.545
0.694MetGln: 0.694 ± 0.3
0.555MetArg: 0.555 ± 0.246
2.221MetSer: 2.221 ± 0.545
1.111MetThr: 1.111 ± 0.339
1.805MetVal: 1.805 ± 0.487
0.278MetTrp: 0.278 ± 0.212
1.111MetTyr: 1.111 ± 0.286
0.0MetXaa: 0.0 ± 0.0
Asn
4.026AsnAla: 4.026 ± 0.564
1.111AsnCys: 1.111 ± 0.459
2.915AsnAsp: 2.915 ± 0.521
2.36AsnGlu: 2.36 ± 0.57
4.443AsnPhe: 4.443 ± 0.511
3.61AsnGly: 3.61 ± 1.124
0.416AsnHis: 0.416 ± 0.252
5.831AsnIle: 5.831 ± 0.591
1.527AsnLys: 1.527 ± 0.504
5.276AsnLeu: 5.276 ± 1.146
0.833AsnMet: 0.833 ± 0.32
4.165AsnAsn: 4.165 ± 0.909
2.082AsnPro: 2.082 ± 0.593
1.527AsnGln: 1.527 ± 0.413
2.36AsnArg: 2.36 ± 0.673
3.332AsnSer: 3.332 ± 0.717
3.193AsnThr: 3.193 ± 0.765
5.137AsnVal: 5.137 ± 0.95
0.555AsnTrp: 0.555 ± 0.312
2.082AsnTyr: 2.082 ± 0.661
0.0AsnXaa: 0.0 ± 0.0
Pro
2.915ProAla: 2.915 ± 0.629
0.278ProCys: 0.278 ± 0.215
3.054ProAsp: 3.054 ± 0.51
2.36ProGlu: 2.36 ± 0.565
1.805ProPhe: 1.805 ± 0.597
1.111ProGly: 1.111 ± 0.43
0.416ProHis: 0.416 ± 0.231
2.638ProIle: 2.638 ± 0.617
1.944ProLys: 1.944 ± 0.588
4.304ProLeu: 4.304 ± 0.548
0.694ProMet: 0.694 ± 0.469
2.221ProAsn: 2.221 ± 0.608
2.915ProPro: 2.915 ± 0.671
1.388ProGln: 1.388 ± 0.312
1.666ProArg: 1.666 ± 0.443
3.748ProSer: 3.748 ± 0.84
4.443ProThr: 4.443 ± 0.971
3.054ProVal: 3.054 ± 0.578
0.139ProTrp: 0.139 ± 0.159
1.666ProTyr: 1.666 ± 0.431
0.0ProXaa: 0.0 ± 0.0
Gln
2.36GlnAla: 2.36 ± 0.527
0.139GlnCys: 0.139 ± 0.135
0.833GlnAsp: 0.833 ± 0.297
2.082GlnGlu: 2.082 ± 0.615
1.805GlnPhe: 1.805 ± 0.472
1.111GlnGly: 1.111 ± 0.317
1.249GlnHis: 1.249 ± 0.285
4.026GlnIle: 4.026 ± 0.631
3.471GlnLys: 3.471 ± 0.655
5.414GlnLeu: 5.414 ± 1.22
0.833GlnMet: 0.833 ± 0.283
3.887GlnAsn: 3.887 ± 0.681
1.388GlnPro: 1.388 ± 0.358
3.61GlnGln: 3.61 ± 0.908
1.666GlnArg: 1.666 ± 0.292
3.332GlnSer: 3.332 ± 0.521
3.471GlnThr: 3.471 ± 0.653
2.777GlnVal: 2.777 ± 0.712
1.111GlnTrp: 1.111 ± 0.335
2.638GlnTyr: 2.638 ± 0.549
0.0GlnXaa: 0.0 ± 0.0
Arg
2.221ArgAla: 2.221 ± 0.428
0.416ArgCys: 0.416 ± 0.245
2.082ArgAsp: 2.082 ± 0.554
1.527ArgGlu: 1.527 ± 0.506
2.638ArgPhe: 2.638 ± 0.53
0.833ArgGly: 0.833 ± 0.349
1.111ArgHis: 1.111 ± 0.429
2.777ArgIle: 2.777 ± 0.723
2.915ArgLys: 2.915 ± 0.68
2.915ArgLeu: 2.915 ± 0.592
0.555ArgMet: 0.555 ± 0.245
2.499ArgAsn: 2.499 ± 0.549
0.833ArgPro: 0.833 ± 0.261
1.666ArgGln: 1.666 ± 0.467
1.527ArgArg: 1.527 ± 0.566
2.499ArgSer: 2.499 ± 0.496
1.111ArgThr: 1.111 ± 0.307
2.082ArgVal: 2.082 ± 0.62
0.139ArgTrp: 0.139 ± 0.159
2.36ArgTyr: 2.36 ± 0.563
0.0ArgXaa: 0.0 ± 0.0
Ser
4.859SerAla: 4.859 ± 1.021
0.278SerCys: 0.278 ± 0.173
4.72SerAsp: 4.72 ± 0.876
3.887SerGlu: 3.887 ± 0.844
3.887SerPhe: 3.887 ± 0.863
4.72SerGly: 4.72 ± 0.879
1.527SerHis: 1.527 ± 0.507
4.72SerIle: 4.72 ± 0.864
2.915SerLys: 2.915 ± 0.628
6.942SerLeu: 6.942 ± 1.533
2.221SerMet: 2.221 ± 0.508
4.165SerAsn: 4.165 ± 0.667
3.471SerPro: 3.471 ± 0.6
5.553SerGln: 5.553 ± 0.959
2.499SerArg: 2.499 ± 0.763
6.247SerSer: 6.247 ± 1.908
4.026SerThr: 4.026 ± 0.669
4.443SerVal: 4.443 ± 0.876
0.833SerTrp: 0.833 ± 0.443
2.915SerTyr: 2.915 ± 0.609
0.0SerXaa: 0.0 ± 0.0
Thr
5.414ThrAla: 5.414 ± 0.813
0.278ThrCys: 0.278 ± 0.197
3.193ThrAsp: 3.193 ± 0.637
3.471ThrGlu: 3.471 ± 0.717
4.026ThrPhe: 4.026 ± 0.551
2.638ThrGly: 2.638 ± 0.537
0.139ThrHis: 0.139 ± 0.124
4.304ThrIle: 4.304 ± 0.902
2.777ThrLys: 2.777 ± 0.57
4.998ThrLeu: 4.998 ± 0.839
1.388ThrMet: 1.388 ± 0.526
2.915ThrAsn: 2.915 ± 0.703
4.165ThrPro: 4.165 ± 0.734
2.777ThrGln: 2.777 ± 0.616
1.805ThrArg: 1.805 ± 0.681
2.777ThrSer: 2.777 ± 0.57
3.471ThrThr: 3.471 ± 0.88
4.581ThrVal: 4.581 ± 0.769
0.416ThrTrp: 0.416 ± 0.213
3.054ThrTyr: 3.054 ± 0.716
0.0ThrXaa: 0.0 ± 0.0
Val
4.998ValAla: 4.998 ± 0.962
0.694ValCys: 0.694 ± 0.317
2.777ValAsp: 2.777 ± 0.475
4.026ValGlu: 4.026 ± 0.765
4.72ValPhe: 4.72 ± 0.971
2.36ValGly: 2.36 ± 0.497
1.249ValHis: 1.249 ± 0.426
3.471ValIle: 3.471 ± 0.767
4.026ValLys: 4.026 ± 0.855
5.831ValLeu: 5.831 ± 0.697
0.972ValMet: 0.972 ± 0.384
3.61ValAsn: 3.61 ± 0.656
2.777ValPro: 2.777 ± 0.617
2.499ValGln: 2.499 ± 0.576
2.221ValArg: 2.221 ± 0.499
6.803ValSer: 6.803 ± 1.051
2.915ValThr: 2.915 ± 0.675
4.72ValVal: 4.72 ± 0.788
0.278ValTrp: 0.278 ± 0.248
5.276ValTyr: 5.276 ± 1.03
0.0ValXaa: 0.0 ± 0.0
Trp
0.833TrpAla: 0.833 ± 0.404
0.278TrpCys: 0.278 ± 0.188
0.555TrpAsp: 0.555 ± 0.304
0.555TrpGlu: 0.555 ± 0.221
0.416TrpPhe: 0.416 ± 0.17
0.0TrpGly: 0.0 ± 0.0
0.139TrpHis: 0.139 ± 0.159
0.555TrpIle: 0.555 ± 0.228
1.249TrpLys: 1.249 ± 0.362
0.278TrpLeu: 0.278 ± 0.231
0.139TrpMet: 0.139 ± 0.168
1.111TrpAsn: 1.111 ± 0.387
0.0TrpPro: 0.0 ± 0.0
0.555TrpGln: 0.555 ± 0.262
0.139TrpArg: 0.139 ± 0.16
0.694TrpSer: 0.694 ± 0.281
0.278TrpThr: 0.278 ± 0.19
0.833TrpVal: 0.833 ± 0.331
0.0TrpTrp: 0.0 ± 0.0
0.694TrpTyr: 0.694 ± 0.298
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.998TyrAla: 4.998 ± 1.084
0.555TyrCys: 0.555 ± 0.319
3.332TyrAsp: 3.332 ± 0.827
1.805TyrGlu: 1.805 ± 0.42
3.054TyrPhe: 3.054 ± 0.532
2.915TyrGly: 2.915 ± 0.524
1.249TyrHis: 1.249 ± 0.402
3.471TyrIle: 3.471 ± 0.616
2.082TyrLys: 2.082 ± 0.446
5.137TyrLeu: 5.137 ± 0.977
1.666TyrMet: 1.666 ± 0.392
3.332TyrAsn: 3.332 ± 0.744
2.638TyrPro: 2.638 ± 0.68
3.193TyrGln: 3.193 ± 0.59
1.944TyrArg: 1.944 ± 0.605
3.471TyrSer: 3.471 ± 0.82
2.36TyrThr: 2.36 ± 0.543
3.193TyrVal: 3.193 ± 0.666
0.278TyrTrp: 0.278 ± 0.215
2.777TyrTyr: 2.777 ± 0.564
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 38 proteins (7204 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski