Amino acid dipepetide frequency for Upolu virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.434AlaAla: 4.434 ± 1.415
2.069AlaCys: 2.069 ± 0.888
2.069AlaAsp: 2.069 ± 0.329
4.434AlaGlu: 4.434 ± 0.954
2.365AlaPhe: 2.365 ± 0.664
2.66AlaGly: 2.66 ± 0.883
0.591AlaHis: 0.591 ± 0.352
2.956AlaIle: 2.956 ± 0.703
4.138AlaLys: 4.138 ± 1.005
6.208AlaLeu: 6.208 ± 2.138
1.478AlaMet: 1.478 ± 0.745
2.365AlaAsn: 2.365 ± 0.484
3.843AlaPro: 3.843 ± 0.629
2.365AlaGln: 2.365 ± 0.715
2.365AlaArg: 2.365 ± 0.917
2.956AlaSer: 2.956 ± 1.267
5.616AlaThr: 5.616 ± 0.942
2.365AlaVal: 2.365 ± 0.847
0.591AlaTrp: 0.591 ± 0.322
2.069AlaTyr: 2.069 ± 0.738
0.0AlaXaa: 0.0 ± 0.0
Cys
0.591CysAla: 0.591 ± 0.322
0.296CysCys: 0.296 ± 0.28
0.887CysAsp: 0.887 ± 0.554
0.296CysGlu: 0.296 ± 0.28
0.887CysPhe: 0.887 ± 0.535
0.887CysGly: 0.887 ± 0.348
0.0CysHis: 0.0 ± 0.0
2.069CysIle: 2.069 ± 0.597
1.478CysLys: 1.478 ± 0.468
1.182CysLeu: 1.182 ± 0.229
0.591CysMet: 0.591 ± 0.366
1.182CysAsn: 1.182 ± 0.774
0.887CysPro: 0.887 ± 0.444
0.887CysGln: 0.887 ± 0.573
1.182CysArg: 1.182 ± 0.242
1.478CysSer: 1.478 ± 0.702
1.478CysThr: 1.478 ± 0.613
0.887CysVal: 0.887 ± 0.551
0.591CysTrp: 0.591 ± 0.367
2.365CysTyr: 2.365 ± 0.893
0.0CysXaa: 0.0 ± 0.0
Asp
2.66AspAla: 2.66 ± 0.596
1.182AspCys: 1.182 ± 0.506
2.365AspAsp: 2.365 ± 0.966
3.843AspGlu: 3.843 ± 0.475
1.478AspPhe: 1.478 ± 0.398
1.478AspGly: 1.478 ± 0.495
2.365AspHis: 2.365 ± 1.139
3.252AspIle: 3.252 ± 0.67
1.182AspLys: 1.182 ± 0.506
6.208AspLeu: 6.208 ± 0.524
1.182AspMet: 1.182 ± 0.339
2.069AspAsn: 2.069 ± 0.811
2.956AspPro: 2.956 ± 0.795
2.069AspGln: 2.069 ± 0.771
2.66AspArg: 2.66 ± 1.096
4.73AspSer: 4.73 ± 0.847
3.547AspThr: 3.547 ± 0.934
1.478AspVal: 1.478 ± 0.48
1.182AspTrp: 1.182 ± 0.857
1.182AspTyr: 1.182 ± 0.411
0.0AspXaa: 0.0 ± 0.0
Glu
3.547GluAla: 3.547 ± 1.125
1.774GluCys: 1.774 ± 0.339
4.434GluAsp: 4.434 ± 1.095
6.208GluGlu: 6.208 ± 0.54
3.843GluPhe: 3.843 ± 0.688
4.73GluGly: 4.73 ± 0.606
1.774GluHis: 1.774 ± 0.5
4.434GluIle: 4.434 ± 0.648
4.138GluLys: 4.138 ± 1.153
8.868GluLeu: 8.868 ± 1.259
0.296GluMet: 0.296 ± 0.235
2.956GluAsn: 2.956 ± 1.5
1.182GluPro: 1.182 ± 0.65
1.478GluGln: 1.478 ± 0.336
3.547GluArg: 3.547 ± 0.517
4.138GluSer: 4.138 ± 0.718
6.208GluThr: 6.208 ± 1.02
5.025GluVal: 5.025 ± 0.708
2.069GluTrp: 2.069 ± 0.838
1.478GluTyr: 1.478 ± 0.611
0.0GluXaa: 0.0 ± 0.0
Phe
1.182PheAla: 1.182 ± 0.48
1.182PheCys: 1.182 ± 0.48
2.069PheAsp: 2.069 ± 0.299
1.478PheGlu: 1.478 ± 0.575
3.252PhePhe: 3.252 ± 1.03
1.182PheGly: 1.182 ± 0.575
1.478PheHis: 1.478 ± 0.779
1.478PheIle: 1.478 ± 0.475
4.434PheLys: 4.434 ± 0.541
5.616PheLeu: 5.616 ± 0.552
0.591PheMet: 0.591 ± 0.543
2.365PheAsn: 2.365 ± 1.333
1.182PhePro: 1.182 ± 0.339
1.478PheGln: 1.478 ± 0.611
0.887PheArg: 0.887 ± 0.369
3.547PheSer: 3.547 ± 0.981
2.069PheThr: 2.069 ± 0.286
3.547PheVal: 3.547 ± 0.626
0.0PheTrp: 0.0 ± 0.0
1.182PheTyr: 1.182 ± 0.577
0.0PheXaa: 0.0 ± 0.0
Gly
2.365GlyAla: 2.365 ± 0.57
0.887GlyCys: 0.887 ± 0.344
0.887GlyAsp: 0.887 ± 0.577
6.503GlyGlu: 6.503 ± 1.436
1.774GlyPhe: 1.774 ± 0.472
4.138GlyGly: 4.138 ± 1.148
1.182GlyHis: 1.182 ± 0.634
2.956GlyIle: 2.956 ± 1.016
2.365GlyLys: 2.365 ± 0.837
5.321GlyLeu: 5.321 ± 0.638
1.478GlyMet: 1.478 ± 0.453
2.365GlyAsn: 2.365 ± 0.638
2.956GlyPro: 2.956 ± 1.035
2.66GlyGln: 2.66 ± 0.648
3.252GlyArg: 3.252 ± 0.731
3.843GlySer: 3.843 ± 1.124
2.365GlyThr: 2.365 ± 0.627
4.434GlyVal: 4.434 ± 0.907
1.182GlyTrp: 1.182 ± 0.671
2.365GlyTyr: 2.365 ± 1.118
0.0GlyXaa: 0.0 ± 0.0
His
1.774HisAla: 1.774 ± 0.508
0.296HisCys: 0.296 ± 0.281
0.591HisAsp: 0.591 ± 0.543
1.182HisGlu: 1.182 ± 0.938
1.478HisPhe: 1.478 ± 0.589
0.887HisGly: 0.887 ± 0.348
0.887HisHis: 0.887 ± 0.236
0.591HisIle: 0.591 ± 0.288
1.182HisLys: 1.182 ± 0.526
4.138HisLeu: 4.138 ± 1.368
1.478HisMet: 1.478 ± 1.127
0.887HisAsn: 0.887 ± 0.588
0.591HisPro: 0.591 ± 0.322
1.478HisGln: 1.478 ± 0.547
1.774HisArg: 1.774 ± 0.556
1.478HisSer: 1.478 ± 0.625
1.774HisThr: 1.774 ± 0.698
1.478HisVal: 1.478 ± 0.584
0.0HisTrp: 0.0 ± 0.0
1.182HisTyr: 1.182 ± 0.525
0.0HisXaa: 0.0 ± 0.0
Ile
4.434IleAla: 4.434 ± 0.507
1.182IleCys: 1.182 ± 0.525
3.843IleAsp: 3.843 ± 1.149
3.547IleGlu: 3.547 ± 0.68
2.069IlePhe: 2.069 ± 0.434
3.547IleGly: 3.547 ± 0.391
2.069IleHis: 2.069 ± 0.651
3.547IleIle: 3.547 ± 0.837
5.025IleLys: 5.025 ± 1.069
4.434IleLeu: 4.434 ± 0.464
2.069IleMet: 2.069 ± 0.763
1.774IleAsn: 1.774 ± 0.301
2.365IlePro: 2.365 ± 0.222
3.252IleGln: 3.252 ± 0.84
3.252IleArg: 3.252 ± 0.826
3.547IleSer: 3.547 ± 0.881
3.547IleThr: 3.547 ± 0.61
3.252IleVal: 3.252 ± 0.968
0.887IleTrp: 0.887 ± 0.65
2.365IleTyr: 2.365 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
2.66LysAla: 2.66 ± 1.189
0.887LysCys: 0.887 ± 0.52
3.252LysAsp: 3.252 ± 1.112
5.616LysGlu: 5.616 ± 1.315
4.73LysPhe: 4.73 ± 1.101
3.252LysGly: 3.252 ± 0.233
1.774LysHis: 1.774 ± 0.5
3.252LysIle: 3.252 ± 1.262
5.616LysLys: 5.616 ± 1.607
4.434LysLeu: 4.434 ± 1.681
1.182LysMet: 1.182 ± 0.512
2.365LysAsn: 2.365 ± 0.691
4.138LysPro: 4.138 ± 1.363
1.478LysGln: 1.478 ± 0.509
6.208LysArg: 6.208 ± 1.523
5.321LysSer: 5.321 ± 0.994
4.138LysThr: 4.138 ± 0.733
5.025LysVal: 5.025 ± 1.964
1.182LysTrp: 1.182 ± 0.521
3.547LysTyr: 3.547 ± 0.901
0.0LysXaa: 0.0 ± 0.0
Leu
6.503LeuAla: 6.503 ± 1.669
2.66LeuCys: 2.66 ± 1.054
5.616LeuAsp: 5.616 ± 0.776
7.094LeuGlu: 7.094 ± 1.271
2.069LeuPhe: 2.069 ± 0.46
5.321LeuGly: 5.321 ± 0.48
2.069LeuHis: 2.069 ± 0.772
6.799LeuIle: 6.799 ± 1.213
6.503LeuLys: 6.503 ± 1.639
7.981LeuLeu: 7.981 ± 2.21
1.774LeuMet: 1.774 ± 0.448
5.025LeuAsn: 5.025 ± 0.609
4.73LeuPro: 4.73 ± 0.922
4.73LeuGln: 4.73 ± 0.499
2.956LeuArg: 2.956 ± 0.86
8.572LeuSer: 8.572 ± 1.973
2.956LeuThr: 2.956 ± 1.352
6.799LeuVal: 6.799 ± 0.938
0.887LeuTrp: 0.887 ± 0.352
2.365LeuTyr: 2.365 ± 0.965
0.0LeuXaa: 0.0 ± 0.0
Met
2.069MetAla: 2.069 ± 0.657
0.591MetCys: 0.591 ± 0.524
3.547MetAsp: 3.547 ± 0.705
3.252MetGlu: 3.252 ± 0.571
1.478MetPhe: 1.478 ± 0.925
1.774MetGly: 1.774 ± 0.792
0.887MetHis: 0.887 ± 0.388
0.591MetIle: 0.591 ± 0.371
1.478MetLys: 1.478 ± 0.396
2.365MetLeu: 2.365 ± 0.964
0.0MetMet: 0.0 ± 0.0
0.591MetAsn: 0.591 ± 0.543
1.182MetPro: 1.182 ± 0.521
0.591MetGln: 0.591 ± 0.322
1.182MetArg: 1.182 ± 0.787
3.252MetSer: 3.252 ± 0.723
0.887MetThr: 0.887 ± 0.319
1.182MetVal: 1.182 ± 0.369
0.887MetTrp: 0.887 ± 0.815
1.478MetTyr: 1.478 ± 0.613
0.0MetXaa: 0.0 ± 0.0
Asn
3.547AsnAla: 3.547 ± 1.015
0.887AsnCys: 0.887 ± 0.597
0.887AsnAsp: 0.887 ± 0.597
1.478AsnGlu: 1.478 ± 0.398
2.365AsnPhe: 2.365 ± 0.5
1.774AsnGly: 1.774 ± 0.677
0.591AsnHis: 0.591 ± 0.313
3.252AsnIle: 3.252 ± 1.096
2.956AsnLys: 2.956 ± 0.747
3.252AsnLeu: 3.252 ± 0.655
2.365AsnMet: 2.365 ± 0.532
2.069AsnAsn: 2.069 ± 0.906
2.66AsnPro: 2.66 ± 1.08
1.478AsnGln: 1.478 ± 0.597
2.069AsnArg: 2.069 ± 0.677
1.182AsnSer: 1.182 ± 0.572
4.434AsnThr: 4.434 ± 1.023
2.66AsnVal: 2.66 ± 0.746
1.182AsnTrp: 1.182 ± 0.531
0.591AsnTyr: 0.591 ± 0.561
0.0AsnXaa: 0.0 ± 0.0
Pro
2.956ProAla: 2.956 ± 0.76
0.0ProCys: 0.0 ± 0.0
2.365ProAsp: 2.365 ± 0.504
3.252ProGlu: 3.252 ± 1.209
0.887ProPhe: 0.887 ± 0.352
2.365ProGly: 2.365 ± 0.638
1.182ProHis: 1.182 ± 0.415
3.547ProIle: 3.547 ± 0.707
2.66ProLys: 2.66 ± 0.479
4.73ProLeu: 4.73 ± 1.383
0.887ProMet: 0.887 ± 0.815
2.069ProAsn: 2.069 ± 0.603
2.069ProPro: 2.069 ± 0.856
1.774ProGln: 1.774 ± 0.997
2.66ProArg: 2.66 ± 0.97
3.843ProSer: 3.843 ± 1.081
2.66ProThr: 2.66 ± 0.757
3.547ProVal: 3.547 ± 1.205
0.296ProTrp: 0.296 ± 0.28
2.365ProTyr: 2.365 ± 0.978
0.0ProXaa: 0.0 ± 0.0
Gln
2.365GlnAla: 2.365 ± 1.025
0.591GlnCys: 0.591 ± 0.313
2.66GlnAsp: 2.66 ± 1.312
4.434GlnGlu: 4.434 ± 0.442
0.887GlnPhe: 0.887 ± 0.578
1.774GlnGly: 1.774 ± 0.509
0.591GlnHis: 0.591 ± 0.322
1.478GlnIle: 1.478 ± 0.951
2.956GlnLys: 2.956 ± 0.629
2.365GlnLeu: 2.365 ± 0.926
2.956GlnMet: 2.956 ± 0.425
1.182GlnAsn: 1.182 ± 0.466
1.182GlnPro: 1.182 ± 0.525
0.887GlnGln: 0.887 ± 0.343
2.956GlnArg: 2.956 ± 0.719
2.365GlnSer: 2.365 ± 0.814
2.069GlnThr: 2.069 ± 0.55
2.365GlnVal: 2.365 ± 0.63
0.887GlnTrp: 0.887 ± 0.326
1.774GlnTyr: 1.774 ± 0.782
0.0GlnXaa: 0.0 ± 0.0
Arg
4.138ArgAla: 4.138 ± 0.543
1.478ArgCys: 1.478 ± 0.646
3.252ArgAsp: 3.252 ± 0.388
2.069ArgGlu: 2.069 ± 0.931
3.547ArgPhe: 3.547 ± 0.966
3.843ArgGly: 3.843 ± 0.611
1.478ArgHis: 1.478 ± 0.475
4.73ArgIle: 4.73 ± 1.108
3.547ArgLys: 3.547 ± 0.858
3.843ArgLeu: 3.843 ± 0.782
1.478ArgMet: 1.478 ± 0.479
2.069ArgAsn: 2.069 ± 0.715
2.956ArgPro: 2.956 ± 0.887
0.887ArgGln: 0.887 ± 0.59
3.843ArgArg: 3.843 ± 0.509
3.547ArgSer: 3.547 ± 0.902
2.365ArgThr: 2.365 ± 1.159
2.365ArgVal: 2.365 ± 0.659
0.887ArgTrp: 0.887 ± 0.411
1.478ArgTyr: 1.478 ± 0.625
0.0ArgXaa: 0.0 ± 0.0
Ser
3.843SerAla: 3.843 ± 1.297
1.478SerCys: 1.478 ± 0.581
2.365SerAsp: 2.365 ± 0.532
4.138SerGlu: 4.138 ± 0.732
2.069SerPhe: 2.069 ± 1.031
2.956SerGly: 2.956 ± 0.801
1.182SerHis: 1.182 ± 0.617
5.025SerIle: 5.025 ± 1.558
7.685SerLys: 7.685 ± 0.949
6.503SerLeu: 6.503 ± 1.123
2.66SerMet: 2.66 ± 0.596
2.956SerAsn: 2.956 ± 0.833
2.956SerPro: 2.956 ± 0.513
3.252SerGln: 3.252 ± 1.074
4.73SerArg: 4.73 ± 1.199
7.39SerSer: 7.39 ± 1.745
4.434SerThr: 4.434 ± 0.822
4.138SerVal: 4.138 ± 0.726
0.591SerTrp: 0.591 ± 0.371
3.252SerTyr: 3.252 ± 1.151
0.0SerXaa: 0.0 ± 0.0
Thr
2.365ThrAla: 2.365 ± 0.322
0.0ThrCys: 0.0 ± 0.0
3.252ThrAsp: 3.252 ± 1.02
3.547ThrGlu: 3.547 ± 0.661
0.887ThrPhe: 0.887 ± 0.326
5.025ThrGly: 5.025 ± 0.905
1.478ThrHis: 1.478 ± 0.174
5.025ThrIle: 5.025 ± 0.853
5.321ThrLys: 5.321 ± 0.663
4.434ThrLeu: 4.434 ± 1.482
1.774ThrMet: 1.774 ± 0.869
2.956ThrAsn: 2.956 ± 1.231
2.956ThrPro: 2.956 ± 0.669
2.66ThrGln: 2.66 ± 0.561
2.365ThrArg: 2.365 ± 1.128
5.025ThrSer: 5.025 ± 0.688
4.138ThrThr: 4.138 ± 1.071
3.547ThrVal: 3.547 ± 0.524
1.182ThrTrp: 1.182 ± 0.54
2.069ThrTyr: 2.069 ± 0.329
0.0ThrXaa: 0.0 ± 0.0
Val
2.956ValAla: 2.956 ± 0.762
1.182ValCys: 1.182 ± 0.774
2.66ValAsp: 2.66 ± 1.123
5.321ValGlu: 5.321 ± 1.012
2.66ValPhe: 2.66 ± 0.316
4.434ValGly: 4.434 ± 1.257
2.069ValHis: 2.069 ± 0.534
2.956ValIle: 2.956 ± 1.218
3.252ValLys: 3.252 ± 0.532
7.094ValLeu: 7.094 ± 1.179
2.66ValMet: 2.66 ± 0.993
1.182ValAsn: 1.182 ± 0.503
3.843ValPro: 3.843 ± 0.882
3.252ValGln: 3.252 ± 0.961
3.547ValArg: 3.547 ± 0.747
2.956ValSer: 2.956 ± 1.184
1.478ValThr: 1.478 ± 0.692
4.73ValVal: 4.73 ± 0.793
0.296ValTrp: 0.296 ± 0.235
2.365ValTyr: 2.365 ± 1.035
0.0ValXaa: 0.0 ± 0.0
Trp
0.887TrpAla: 0.887 ± 0.348
0.296TrpCys: 0.296 ± 0.28
0.591TrpAsp: 0.591 ± 0.291
0.887TrpGlu: 0.887 ± 0.517
0.0TrpPhe: 0.0 ± 0.0
0.887TrpGly: 0.887 ± 0.344
0.296TrpHis: 0.296 ± 0.28
0.591TrpIle: 0.591 ± 0.543
2.365TrpLys: 2.365 ± 0.72
0.887TrpLeu: 0.887 ± 0.344
0.591TrpMet: 0.591 ± 0.363
0.887TrpAsn: 0.887 ± 0.319
0.0TrpPro: 0.0 ± 0.0
1.182TrpGln: 1.182 ± 0.663
0.296TrpArg: 0.296 ± 0.262
1.774TrpSer: 1.774 ± 0.63
1.182TrpThr: 1.182 ± 0.506
1.182TrpVal: 1.182 ± 0.852
0.0TrpTrp: 0.0 ± 0.0
1.182TrpTyr: 1.182 ± 0.545
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.66TyrAla: 2.66 ± 0.776
1.182TyrCys: 1.182 ± 0.643
1.478TyrAsp: 1.478 ± 0.56
3.547TyrGlu: 3.547 ± 0.99
1.478TyrPhe: 1.478 ± 0.906
2.365TyrGly: 2.365 ± 0.363
1.182TyrHis: 1.182 ± 0.368
1.478TyrIle: 1.478 ± 0.547
1.774TyrLys: 1.774 ± 0.581
3.547TyrLeu: 3.547 ± 1.083
1.478TyrMet: 1.478 ± 0.545
2.365TyrAsn: 2.365 ± 0.523
1.478TyrPro: 1.478 ± 0.523
0.887TyrGln: 0.887 ± 0.489
2.069TyrArg: 2.069 ± 0.597
2.66TyrSer: 2.66 ± 0.947
2.66TyrThr: 2.66 ± 0.571
0.887TyrVal: 0.887 ± 0.348
1.182TyrTrp: 1.182 ± 0.319
0.296TyrTyr: 0.296 ± 0.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3384 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski