Amino acid dipepetide frequency for Rio Grande virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.817AlaAla: 5.817 ± 2.231
1.77AlaCys: 1.77 ± 0.371
2.023AlaAsp: 2.023 ± 0.529
4.047AlaGlu: 4.047 ± 0.743
3.794AlaPhe: 3.794 ± 1.096
2.529AlaGly: 2.529 ± 0.673
1.265AlaHis: 1.265 ± 0.659
2.529AlaIle: 2.529 ± 0.367
3.794AlaLys: 3.794 ± 1.328
7.081AlaLeu: 7.081 ± 0.313
1.517AlaMet: 1.517 ± 0.647
2.782AlaAsn: 2.782 ± 0.904
1.77AlaPro: 1.77 ± 0.292
2.023AlaGln: 2.023 ± 0.883
3.541AlaArg: 3.541 ± 0.165
5.311AlaSer: 5.311 ± 1.499
3.288AlaThr: 3.288 ± 0.94
2.782AlaVal: 2.782 ± 1.312
0.253AlaTrp: 0.253 ± 0.169
2.023AlaTyr: 2.023 ± 1.282
0.0AlaXaa: 0.0 ± 0.0
Cys
2.023CysAla: 2.023 ± 0.529
0.759CysCys: 0.759 ± 0.553
0.759CysAsp: 0.759 ± 0.203
1.012CysGlu: 1.012 ± 0.583
1.517CysPhe: 1.517 ± 0.704
1.012CysGly: 1.012 ± 0.948
0.759CysHis: 0.759 ± 0.468
1.265CysIle: 1.265 ± 0.307
2.023CysLys: 2.023 ± 0.567
1.517CysLeu: 1.517 ± 0.328
0.253CysMet: 0.253 ± 0.169
1.012CysAsn: 1.012 ± 0.346
0.759CysPro: 0.759 ± 0.203
2.023CysGln: 2.023 ± 1.166
1.77CysArg: 1.77 ± 1.659
2.529CysSer: 2.529 ± 0.962
1.012CysThr: 1.012 ± 0.346
2.529CysVal: 2.529 ± 1.705
0.253CysTrp: 0.253 ± 0.237
0.759CysTyr: 0.759 ± 0.352
0.0CysXaa: 0.0 ± 0.0
Asp
1.265AspAla: 1.265 ± 0.356
1.517AspCys: 1.517 ± 0.503
5.311AspAsp: 5.311 ± 0.897
4.299AspGlu: 4.299 ± 1.141
2.023AspPhe: 2.023 ± 1.354
2.023AspGly: 2.023 ± 0.501
1.77AspHis: 1.77 ± 0.836
5.058AspIle: 5.058 ± 1.246
3.794AspLys: 3.794 ± 1.14
7.334AspLeu: 7.334 ± 3.051
1.265AspMet: 1.265 ± 0.721
2.529AspAsn: 2.529 ± 0.742
3.035AspPro: 3.035 ± 0.247
2.276AspGln: 2.276 ± 0.566
2.529AspArg: 2.529 ± 0.994
3.794AspSer: 3.794 ± 0.461
1.77AspThr: 1.77 ± 0.836
3.035AspVal: 3.035 ± 0.729
0.506AspTrp: 0.506 ± 0.142
1.012AspTyr: 1.012 ± 0.283
0.0AspXaa: 0.0 ± 0.0
Glu
6.576GluAla: 6.576 ± 2.816
1.517GluCys: 1.517 ± 0.704
3.541GluAsp: 3.541 ± 0.681
7.587GluGlu: 7.587 ± 0.603
4.047GluPhe: 4.047 ± 1.393
4.299GluGly: 4.299 ± 0.068
1.012GluHis: 1.012 ± 0.283
5.311GluIle: 5.311 ± 0.296
2.782GluLys: 2.782 ± 0.7
7.334GluLeu: 7.334 ± 1.422
2.782GluMet: 2.782 ± 0.7
3.541GluAsn: 3.541 ± 1.278
1.265GluPro: 1.265 ± 0.436
1.517GluGln: 1.517 ± 0.425
4.047GluArg: 4.047 ± 0.906
4.299GluSer: 4.299 ± 0.367
3.541GluThr: 3.541 ± 0.966
3.541GluVal: 3.541 ± 0.884
1.012GluTrp: 1.012 ± 0.419
1.77GluTyr: 1.77 ± 0.624
0.0GluXaa: 0.0 ± 0.0
Phe
2.276PheAla: 2.276 ± 1.561
1.265PheCys: 1.265 ± 0.481
3.288PheAsp: 3.288 ± 0.535
2.276PheGlu: 2.276 ± 0.547
2.276PhePhe: 2.276 ± 0.783
0.759PheGly: 0.759 ± 0.508
0.253PheHis: 0.253 ± 0.169
1.77PheIle: 1.77 ± 0.433
3.541PheLys: 3.541 ± 0.681
6.07PheLeu: 6.07 ± 0.984
0.759PheMet: 0.759 ± 0.352
3.541PheAsn: 3.541 ± 0.486
1.517PhePro: 1.517 ± 1.106
2.023PheGln: 2.023 ± 0.501
2.529PheArg: 2.529 ± 0.871
5.817PheSer: 5.817 ± 1.031
2.782PheThr: 2.782 ± 0.288
3.794PheVal: 3.794 ± 0.789
0.506PheTrp: 0.506 ± 0.142
0.759PheTyr: 0.759 ± 0.203
0.0PheXaa: 0.0 ± 0.0
Gly
4.299GlyAla: 4.299 ± 1.224
1.265GlyCys: 1.265 ± 0.481
2.276GlyAsp: 2.276 ± 0.28
2.782GlyGlu: 2.782 ± 1.42
4.552GlyPhe: 4.552 ± 0.617
5.817GlyGly: 5.817 ± 0.976
1.77GlyHis: 1.77 ± 0.433
2.276GlyIle: 2.276 ± 0.446
3.541GlyLys: 3.541 ± 0.681
4.299GlyLeu: 4.299 ± 0.656
1.517GlyMet: 1.517 ± 0.349
1.77GlyAsn: 1.77 ± 0.552
3.288GlyPro: 3.288 ± 0.841
1.265GlyGln: 1.265 ± 0.817
2.782GlyArg: 2.782 ± 1.602
6.07GlySer: 6.07 ± 1.473
3.541GlyThr: 3.541 ± 0.742
4.552GlyVal: 4.552 ± 1.507
0.506GlyTrp: 0.506 ± 0.474
1.77GlyTyr: 1.77 ± 0.338
0.0GlyXaa: 0.0 ± 0.0
His
1.012HisAla: 1.012 ± 0.583
0.506HisCys: 0.506 ± 0.142
1.77HisAsp: 1.77 ± 0.552
1.77HisGlu: 1.77 ± 0.933
1.517HisPhe: 1.517 ± 0.328
1.77HisGly: 1.77 ± 0.836
0.506HisHis: 0.506 ± 0.474
1.77HisIle: 1.77 ± 0.982
1.012HisLys: 1.012 ± 0.583
2.782HisLeu: 2.782 ± 0.702
0.506HisMet: 0.506 ± 0.142
0.0HisAsn: 0.0 ± 0.0
1.012HisPro: 1.012 ± 0.321
1.012HisGln: 1.012 ± 0.283
1.012HisArg: 1.012 ± 0.677
2.782HisSer: 2.782 ± 0.374
1.265HisThr: 1.265 ± 0.817
1.012HisVal: 1.012 ± 0.677
0.0HisTrp: 0.0 ± 0.0
1.012HisTyr: 1.012 ± 1.067
0.0HisXaa: 0.0 ± 0.0
Ile
4.805IleAla: 4.805 ± 0.479
1.517IleCys: 1.517 ± 0.328
3.541IleAsp: 3.541 ± 0.681
3.541IleGlu: 3.541 ± 1.328
2.529IlePhe: 2.529 ± 0.302
3.794IleGly: 3.794 ± 1.172
1.517IleHis: 1.517 ± 0.328
4.299IleIle: 4.299 ± 0.723
5.817IleLys: 5.817 ± 1.464
6.07IleLeu: 6.07 ± 0.552
0.759IleMet: 0.759 ± 0.468
2.276IleAsn: 2.276 ± 0.459
1.77IlePro: 1.77 ± 0.542
2.529IleGln: 2.529 ± 0.994
5.564IleArg: 5.564 ± 1.55
5.058IleSer: 5.058 ± 1.164
3.035IleThr: 3.035 ± 0.736
4.299IleVal: 4.299 ± 0.693
0.506IleTrp: 0.506 ± 0.142
1.012IleTyr: 1.012 ± 0.677
0.0IleXaa: 0.0 ± 0.0
Lys
3.288LysAla: 3.288 ± 0.535
1.517LysCys: 1.517 ± 1.053
2.782LysAsp: 2.782 ± 0.374
7.587LysGlu: 7.587 ± 1.298
2.276LysPhe: 2.276 ± 0.61
3.794LysGly: 3.794 ± 0.799
1.517LysHis: 1.517 ± 0.828
5.311LysIle: 5.311 ± 0.989
3.035LysLys: 3.035 ± 0.492
5.817LysLeu: 5.817 ± 1.175
2.782LysMet: 2.782 ± 0.85
3.035LysAsn: 3.035 ± 0.662
2.529LysPro: 2.529 ± 1.059
2.023LysGln: 2.023 ± 0.388
2.023LysArg: 2.023 ± 0.713
3.035LysSer: 3.035 ± 0.769
3.288LysThr: 3.288 ± 0.546
5.564LysVal: 5.564 ± 0.528
1.517LysTrp: 1.517 ± 0.278
2.782LysTyr: 2.782 ± 0.886
0.0LysXaa: 0.0 ± 0.0
Leu
4.552LeuAla: 4.552 ± 1.471
2.529LeuCys: 2.529 ± 0.367
5.564LeuAsp: 5.564 ± 1.31
5.311LeuGlu: 5.311 ± 1.198
4.552LeuPhe: 4.552 ± 0.914
4.047LeuGly: 4.047 ± 1.345
2.023LeuHis: 2.023 ± 0.501
6.323LeuIle: 6.323 ± 0.339
6.576LeuLys: 6.576 ± 1.091
4.805LeuLeu: 4.805 ± 1.134
3.035LeuMet: 3.035 ± 0.648
2.782LeuAsn: 2.782 ± 0.734
3.288LeuPro: 3.288 ± 0.833
1.77LeuGln: 1.77 ± 0.542
6.323LeuArg: 6.323 ± 0.701
11.128LeuSer: 11.128 ± 0.362
5.058LeuThr: 5.058 ± 1.41
5.058LeuVal: 5.058 ± 1.736
0.253LeuTrp: 0.253 ± 0.237
1.517LeuTyr: 1.517 ± 0.67
0.0LeuXaa: 0.0 ± 0.0
Met
0.759MetAla: 0.759 ± 0.203
0.253MetCys: 0.253 ± 0.237
3.035MetAsp: 3.035 ± 0.423
2.529MetGlu: 2.529 ± 2.146
1.012MetPhe: 1.012 ± 0.346
2.276MetGly: 2.276 ± 0.906
1.012MetHis: 1.012 ± 0.521
2.782MetIle: 2.782 ± 0.831
1.77MetLys: 1.77 ± 0.338
1.517MetLeu: 1.517 ± 1.106
1.77MetMet: 1.77 ± 0.641
1.265MetAsn: 1.265 ± 0.481
0.506MetPro: 0.506 ± 0.859
1.012MetGln: 1.012 ± 0.283
1.77MetArg: 1.77 ± 0.386
2.276MetSer: 2.276 ± 0.851
2.782MetThr: 2.782 ± 1.516
0.506MetVal: 0.506 ± 0.142
0.253MetTrp: 0.253 ± 0.169
1.012MetTyr: 1.012 ± 0.346
0.0MetXaa: 0.0 ± 0.0
Asn
1.265AsnAla: 1.265 ± 0.436
1.012AsnCys: 1.012 ± 0.283
2.023AsnAsp: 2.023 ± 0.422
3.288AsnGlu: 3.288 ± 0.833
2.023AsnPhe: 2.023 ± 0.388
4.299AsnGly: 4.299 ± 1.724
0.759AsnHis: 0.759 ± 0.508
1.265AsnIle: 1.265 ± 0.481
3.035AsnLys: 3.035 ± 0.662
3.035AsnLeu: 3.035 ± 0.772
1.012AsnMet: 1.012 ± 0.433
1.012AsnAsn: 1.012 ± 0.346
3.288AsnPro: 3.288 ± 0.841
2.023AsnGln: 2.023 ± 0.567
2.782AsnArg: 2.782 ± 0.66
2.782AsnSer: 2.782 ± 1.248
1.012AsnThr: 1.012 ± 0.346
2.023AsnVal: 2.023 ± 1.073
0.506AsnTrp: 0.506 ± 0.339
2.529AsnTyr: 2.529 ± 1.391
0.0AsnXaa: 0.0 ± 0.0
Pro
1.265ProAla: 1.265 ± 0.344
0.506ProCys: 0.506 ± 0.142
2.023ProAsp: 2.023 ± 0.713
4.299ProGlu: 4.299 ± 1.118
2.023ProPhe: 2.023 ± 0.883
2.276ProGly: 2.276 ± 0.304
1.265ProHis: 1.265 ± 0.481
1.012ProIle: 1.012 ± 1.029
2.529ProLys: 2.529 ± 0.829
3.035ProLeu: 3.035 ± 0.864
1.012ProMet: 1.012 ± 0.56
1.265ProAsn: 1.265 ± 0.307
1.517ProPro: 1.517 ± 1.573
1.77ProGln: 1.77 ± 0.715
1.517ProArg: 1.517 ± 0.513
3.541ProSer: 3.541 ± 1.351
1.265ProThr: 1.265 ± 0.398
3.035ProVal: 3.035 ± 0.864
1.012ProTrp: 1.012 ± 0.521
1.517ProTyr: 1.517 ± 0.67
0.0ProXaa: 0.0 ± 0.0
Gln
2.023GlnAla: 2.023 ± 0.577
1.77GlnCys: 1.77 ± 0.371
1.265GlnAsp: 1.265 ± 0.481
3.288GlnGlu: 3.288 ± 0.629
0.759GlnPhe: 0.759 ± 0.352
2.782GlnGly: 2.782 ± 0.516
1.012GlnHis: 1.012 ± 0.346
3.541GlnIle: 3.541 ± 0.865
1.265GlnLys: 1.265 ± 0.506
1.265GlnLeu: 1.265 ± 1.142
1.012GlnMet: 1.012 ± 0.283
1.265GlnAsn: 1.265 ± 0.307
1.77GlnPro: 1.77 ± 0.386
2.276GlnGln: 2.276 ± 0.644
1.517GlnArg: 1.517 ± 0.937
3.288GlnSer: 3.288 ± 0.841
1.265GlnThr: 1.265 ± 0.398
2.276GlnVal: 2.276 ± 0.783
0.253GlnTrp: 0.253 ± 0.451
0.253GlnTyr: 0.253 ± 0.169
0.0GlnXaa: 0.0 ± 0.0
Arg
5.564ArgAla: 5.564 ± 1.325
0.506ArgCys: 0.506 ± 0.474
4.299ArgAsp: 4.299 ± 1.909
4.047ArgGlu: 4.047 ± 0.625
1.265ArgPhe: 1.265 ± 0.672
3.541ArgGly: 3.541 ± 1.105
0.759ArgHis: 0.759 ± 0.657
4.299ArgIle: 4.299 ± 1.523
3.035ArgLys: 3.035 ± 0.623
2.782ArgLeu: 2.782 ± 0.892
1.77ArgMet: 1.77 ± 0.641
2.276ArgAsn: 2.276 ± 0.793
1.77ArgPro: 1.77 ± 0.552
1.77ArgGln: 1.77 ± 0.433
3.035ArgArg: 3.035 ± 1.035
6.07ArgSer: 6.07 ± 2.276
3.035ArgThr: 3.035 ± 0.648
3.288ArgVal: 3.288 ± 0.477
0.506ArgTrp: 0.506 ± 0.142
1.012ArgTyr: 1.012 ± 0.346
0.0ArgXaa: 0.0 ± 0.0
Ser
5.564SerAla: 5.564 ± 0.856
2.782SerCys: 2.782 ± 1.87
4.552SerAsp: 4.552 ± 1.992
4.552SerGlu: 4.552 ± 0.283
5.564SerPhe: 5.564 ± 0.814
4.552SerGly: 4.552 ± 1.008
2.023SerHis: 2.023 ± 0.501
5.817SerIle: 5.817 ± 2.205
7.081SerLys: 7.081 ± 1.148
11.634SerLeu: 11.634 ± 1.692
3.035SerMet: 3.035 ± 0.889
3.288SerAsn: 3.288 ± 0.216
3.541SerPro: 3.541 ± 1.75
2.782SerGln: 2.782 ± 0.499
4.047SerArg: 4.047 ± 0.949
8.852SerSer: 8.852 ± 1.837
2.529SerThr: 2.529 ± 0.367
4.299SerVal: 4.299 ± 0.582
2.529SerTrp: 2.529 ± 0.74
2.782SerTyr: 2.782 ± 0.288
0.0SerXaa: 0.0 ± 0.0
Thr
4.047ThrAla: 4.047 ± 0.71
2.276ThrCys: 2.276 ± 0.793
2.782ThrAsp: 2.782 ± 0.702
3.794ThrGlu: 3.794 ± 0.723
1.265ThrPhe: 1.265 ± 0.307
4.299ThrGly: 4.299 ± 1.4
1.012ThrHis: 1.012 ± 0.583
2.529ThrIle: 2.529 ± 0.742
3.541ThrLys: 3.541 ± 0.901
4.047ThrLeu: 4.047 ± 0.625
0.506ThrMet: 0.506 ± 0.339
2.023ThrAsn: 2.023 ± 0.713
1.265ThrPro: 1.265 ± 0.398
1.012ThrGln: 1.012 ± 0.62
2.276ThrArg: 2.276 ± 0.499
4.552ThrSer: 4.552 ± 1.428
2.529ThrThr: 2.529 ± 0.388
2.782ThrVal: 2.782 ± 0.702
0.253ThrTrp: 0.253 ± 0.169
0.759ThrTyr: 0.759 ± 0.381
0.0ThrXaa: 0.0 ± 0.0
Val
2.276ValAla: 2.276 ± 0.731
1.265ValCys: 1.265 ± 0.817
2.023ValAsp: 2.023 ± 0.566
4.047ValGlu: 4.047 ± 1.108
3.288ValPhe: 3.288 ± 0.805
3.794ValGly: 3.794 ± 0.98
2.276ValHis: 2.276 ± 1.101
4.299ValIle: 4.299 ± 0.928
4.805ValLys: 4.805 ± 1.282
3.541ValLeu: 3.541 ± 1.61
1.77ValMet: 1.77 ± 1.185
2.782ValAsn: 2.782 ± 1.592
2.023ValPro: 2.023 ± 1.256
1.77ValGln: 1.77 ± 0.567
3.794ValArg: 3.794 ± 0.075
8.093ValSer: 8.093 ± 1.484
2.529ValThr: 2.529 ± 1.284
3.541ValVal: 3.541 ± 0.9
1.012ValTrp: 1.012 ± 0.346
2.023ValTyr: 2.023 ± 0.422
0.0ValXaa: 0.0 ± 0.0
Trp
0.253TrpAla: 0.253 ± 0.169
0.506TrpCys: 0.506 ± 0.142
1.265TrpAsp: 1.265 ± 0.506
0.0TrpGlu: 0.0 ± 0.0
0.506TrpPhe: 0.506 ± 0.142
0.759TrpGly: 0.759 ± 0.352
0.0TrpHis: 0.0 ± 0.0
0.759TrpIle: 0.759 ± 0.553
0.253TrpLys: 0.253 ± 0.169
0.253TrpLeu: 0.253 ± 0.169
1.012TrpMet: 1.012 ± 0.283
0.506TrpAsn: 0.506 ± 0.142
0.759TrpPro: 0.759 ± 1.087
0.253TrpGln: 0.253 ± 0.451
0.759TrpArg: 0.759 ± 0.508
1.265TrpSer: 1.265 ± 0.817
1.012TrpThr: 1.012 ± 0.321
1.265TrpVal: 1.265 ± 0.356
0.253TrpTrp: 0.253 ± 0.169
0.759TrpTyr: 0.759 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.012TyrAla: 1.012 ± 0.583
0.506TyrCys: 0.506 ± 0.474
2.023TyrAsp: 2.023 ± 0.69
1.265TyrGlu: 1.265 ± 0.307
1.012TyrPhe: 1.012 ± 0.677
1.517TyrGly: 1.517 ± 0.503
1.517TyrHis: 1.517 ± 0.67
2.023TyrIle: 2.023 ± 1.24
2.276TyrLys: 2.276 ± 0.85
2.276TyrLeu: 2.276 ± 0.61
1.77TyrMet: 1.77 ± 0.706
2.276TyrAsn: 2.276 ± 0.28
1.012TyrPro: 1.012 ± 0.483
1.012TyrGln: 1.012 ± 1.183
1.012TyrArg: 1.012 ± 0.346
1.265TyrSer: 1.265 ± 0.436
1.012TyrThr: 1.012 ± 0.346
1.77TyrVal: 1.77 ± 0.542
0.506TyrTrp: 0.506 ± 0.142
1.77TyrTyr: 1.77 ± 1.496
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3955 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski