Amino acid dipepetide frequency for Utinga virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.301AlaAla: 2.301 ± 5.084
1.278AlaCys: 1.278 ± 0.518
2.046AlaAsp: 2.046 ± 0.68
3.835AlaGlu: 3.835 ± 4.697
3.835AlaPhe: 3.835 ± 2.406
0.256AlaGly: 0.256 ± 1.07
0.767AlaHis: 0.767 ± 0.211
3.58AlaIle: 3.58 ± 0.966
4.347AlaLys: 4.347 ± 0.821
3.58AlaLeu: 3.58 ± 0.958
1.023AlaMet: 1.023 ± 0.87
1.534AlaAsn: 1.534 ± 0.64
0.511AlaPro: 0.511 ± 0.158
1.534AlaGln: 1.534 ± 0.423
1.79AlaArg: 1.79 ± 2.946
2.301AlaSer: 2.301 ± 0.634
1.79AlaThr: 1.79 ± 1.803
2.046AlaVal: 2.046 ± 1.755
0.511AlaTrp: 0.511 ± 0.158
1.534AlaTyr: 1.534 ± 0.835
0.0AlaXaa: 0.0 ± 0.0
Cys
1.023CysAla: 1.023 ± 0.639
0.256CysCys: 0.256 ± 0.237
0.511CysAsp: 0.511 ± 0.32
1.79CysGlu: 1.79 ± 1.309
1.534CysPhe: 1.534 ± 0.741
1.79CysGly: 1.79 ± 1.309
0.256CysHis: 0.256 ± 0.237
3.324CysIle: 3.324 ± 1.403
2.301CysLys: 2.301 ± 1.438
3.58CysLeu: 3.58 ± 1.34
0.511CysMet: 0.511 ± 0.32
3.324CysAsn: 3.324 ± 2.039
1.023CysPro: 1.023 ± 0.316
0.511CysGln: 0.511 ± 0.158
1.278CysArg: 1.278 ± 0.337
1.79CysSer: 1.79 ± 0.67
1.79CysThr: 1.79 ± 0.713
1.023CysVal: 1.023 ± 0.95
0.0CysTrp: 0.0 ± 0.0
1.79CysTyr: 1.79 ± 0.971
0.0CysXaa: 0.0 ± 0.0
Asp
2.046AspAla: 2.046 ± 0.653
1.278AspCys: 1.278 ± 0.518
3.324AspAsp: 3.324 ± 1.436
4.347AspGlu: 4.347 ± 0.821
3.068AspPhe: 3.068 ± 0.904
2.813AspGly: 2.813 ± 1.547
0.511AspHis: 0.511 ± 0.158
7.926AspIle: 7.926 ± 2.21
3.068AspLys: 3.068 ± 1.019
8.949AspLeu: 8.949 ± 2.613
2.046AspMet: 2.046 ± 0.541
4.091AspAsn: 4.091 ± 0.309
1.278AspPro: 1.278 ± 0.984
1.534AspGln: 1.534 ± 0.734
1.79AspArg: 1.79 ± 0.543
1.79AspSer: 1.79 ± 0.543
2.046AspThr: 2.046 ± 0.541
2.046AspVal: 2.046 ± 1.802
0.0AspTrp: 0.0 ± 0.0
2.557AspTyr: 2.557 ± 0.675
0.0AspXaa: 0.0 ± 0.0
Glu
2.557GluAla: 2.557 ± 1.638
1.023GluCys: 1.023 ± 0.34
3.068GluAsp: 3.068 ± 1.281
6.648GluGlu: 6.648 ± 1.8
3.835GluPhe: 3.835 ± 1.471
1.278GluGly: 1.278 ± 0.518
1.79GluHis: 1.79 ± 0.971
7.415GluIle: 7.415 ± 0.364
5.881GluLys: 5.881 ± 2.135
4.347GluLeu: 4.347 ± 1.168
3.324GluMet: 3.324 ± 1.402
3.068GluAsn: 3.068 ± 0.846
2.046GluPro: 2.046 ± 0.953
2.046GluGln: 2.046 ± 0.608
3.068GluArg: 3.068 ± 1.587
4.347GluSer: 4.347 ± 1.257
2.557GluThr: 2.557 ± 1.34
3.58GluVal: 3.58 ± 0.195
0.511GluTrp: 0.511 ± 0.475
2.557GluTyr: 2.557 ± 1.27
0.0GluXaa: 0.0 ± 0.0
Phe
1.79PheAla: 1.79 ± 0.676
2.046PheCys: 2.046 ± 1.278
3.324PheAsp: 3.324 ± 0.914
4.347PheGlu: 4.347 ± 1.81
2.557PhePhe: 2.557 ± 1.65
3.324PheGly: 3.324 ± 1.473
1.023PheHis: 1.023 ± 0.34
4.602PheIle: 4.602 ± 1.213
4.602PheLys: 4.602 ± 1.423
5.369PheLeu: 5.369 ± 1.997
0.767PheMet: 0.767 ± 0.899
2.813PheAsn: 2.813 ± 0.512
2.046PhePro: 2.046 ± 2.892
1.278PheGln: 1.278 ± 0.825
3.324PheArg: 3.324 ± 1.427
3.835PheSer: 3.835 ± 0.546
2.813PheThr: 2.813 ± 0.887
2.557PheVal: 2.557 ± 0.502
0.256PheTrp: 0.256 ± 0.16
2.301PheTyr: 2.301 ± 0.634
0.0PheXaa: 0.0 ± 0.0
Gly
1.278GlyAla: 1.278 ± 1.983
2.046GlyCys: 2.046 ± 1.044
3.835GlyAsp: 3.835 ± 1.337
2.301GlyGlu: 2.301 ± 0.634
2.301GlyPhe: 2.301 ± 0.887
1.023GlyGly: 1.023 ± 0.87
0.256GlyHis: 0.256 ± 0.16
4.602GlyIle: 4.602 ± 1.643
3.068GlyLys: 3.068 ± 0.949
3.068GlyLeu: 3.068 ± 0.343
1.278GlyMet: 1.278 ± 1.965
3.068GlyAsn: 3.068 ± 0.846
1.534GlyPro: 1.534 ± 0.474
2.046GlyGln: 2.046 ± 0.633
0.511GlyArg: 0.511 ± 0.32
2.813GlySer: 2.813 ± 1.256
1.79GlyThr: 1.79 ± 0.713
0.256GlyVal: 0.256 ± 0.16
0.767GlyTrp: 0.767 ± 0.37
1.534GlyTyr: 1.534 ± 0.734
0.0GlyXaa: 0.0 ± 0.0
His
0.767HisAla: 0.767 ± 0.712
0.256HisCys: 0.256 ± 0.237
1.278HisAsp: 1.278 ± 0.518
0.767HisGlu: 0.767 ± 0.479
1.534HisPhe: 1.534 ± 0.81
1.79HisGly: 1.79 ± 0.796
0.767HisHis: 0.767 ± 0.211
1.023HisIle: 1.023 ± 0.95
2.301HisLys: 2.301 ± 0.825
1.278HisLeu: 1.278 ± 0.337
0.511HisMet: 0.511 ± 0.158
3.068HisAsn: 3.068 ± 0.818
0.767HisPro: 0.767 ± 0.211
0.511HisGln: 0.511 ± 0.158
1.79HisArg: 1.79 ± 1.964
1.534HisSer: 1.534 ± 0.959
1.278HisThr: 1.278 ± 0.518
0.511HisVal: 0.511 ± 0.32
0.511HisTrp: 0.511 ± 0.32
0.767HisTyr: 0.767 ± 0.211
0.0HisXaa: 0.0 ± 0.0
Ile
5.881IleAla: 5.881 ± 1.55
2.813IleCys: 2.813 ± 1.256
5.625IleAsp: 5.625 ± 1.761
6.392IleGlu: 6.392 ± 0.559
4.347IlePhe: 4.347 ± 1.704
3.835IleGly: 3.835 ± 1.553
3.068IleHis: 3.068 ± 1.019
8.693IleIle: 8.693 ± 1.939
8.949IleLys: 8.949 ± 0.825
9.972IleLeu: 9.972 ± 2.749
1.278IleMet: 1.278 ± 0.487
7.926IleAsn: 7.926 ± 3.611
3.068IlePro: 3.068 ± 0.343
2.557IleGln: 2.557 ± 0.791
2.813IleArg: 2.813 ± 0.512
7.159IleSer: 7.159 ± 1.931
5.881IleThr: 5.881 ± 1.599
3.835IleVal: 3.835 ± 0.6
0.767IleTrp: 0.767 ± 0.479
3.58IleTyr: 3.58 ± 1.941
0.0IleXaa: 0.0 ± 0.0
Lys
2.301LysAla: 2.301 ± 0.532
2.557LysCys: 2.557 ± 1.035
3.58LysAsp: 3.58 ± 1.426
7.159LysGlu: 7.159 ± 1.887
5.625LysPhe: 5.625 ± 1.496
3.068LysGly: 3.068 ± 0.343
0.511LysHis: 0.511 ± 0.32
6.392LysIle: 6.392 ± 1.687
5.114LysLys: 5.114 ± 1.948
5.881LysLeu: 5.881 ± 1.55
4.602LysMet: 4.602 ± 1.231
6.137LysAsn: 6.137 ± 0.686
2.557LysPro: 2.557 ± 0.751
2.557LysGln: 2.557 ± 0.791
3.068LysArg: 3.068 ± 1.019
6.648LysSer: 6.648 ± 1.783
8.693LysThr: 8.693 ± 0.965
4.091LysVal: 4.091 ± 0.679
1.023LysTrp: 1.023 ± 0.34
3.835LysTyr: 3.835 ± 1.553
0.0LysXaa: 0.0 ± 0.0
Leu
3.835LeuAla: 3.835 ± 2.456
2.813LeuCys: 2.813 ± 1.909
6.648LeuAsp: 6.648 ± 2.1
6.392LeuGlu: 6.392 ± 1.687
4.858LeuPhe: 4.858 ± 1.288
3.068LeuGly: 3.068 ± 1.805
1.534LeuHis: 1.534 ± 1.026
7.415LeuIle: 7.415 ± 2.201
6.904LeuLys: 6.904 ± 1.036
7.415LeuLeu: 7.415 ± 1.956
1.534LeuMet: 1.534 ± 0.959
6.904LeuAsn: 6.904 ± 1.805
3.068LeuPro: 3.068 ± 0.846
1.534LeuGln: 1.534 ± 0.64
3.835LeuArg: 3.835 ± 2.475
9.46LeuSer: 9.46 ± 1.718
5.625LeuThr: 5.625 ± 1.024
2.557LeuVal: 2.557 ± 0.502
0.256LeuTrp: 0.256 ± 0.16
2.813LeuTyr: 2.813 ± 0.788
0.0LeuXaa: 0.0 ± 0.0
Met
1.023MetAla: 1.023 ± 0.924
0.767MetCys: 0.767 ± 0.479
2.046MetAsp: 2.046 ± 1.802
1.534MetGlu: 1.534 ± 0.64
1.023MetPhe: 1.023 ± 0.87
0.767MetGly: 0.767 ± 0.211
1.278MetHis: 1.278 ± 0.819
1.278MetIle: 1.278 ± 0.487
3.58MetLys: 3.58 ± 0.195
2.301MetLeu: 2.301 ± 0.669
0.511MetMet: 0.511 ± 0.32
1.023MetAsn: 1.023 ± 0.34
2.046MetPro: 2.046 ± 0.541
1.023MetGln: 1.023 ± 0.34
1.79MetArg: 1.79 ± 1.938
2.046MetSer: 2.046 ± 0.541
2.813MetThr: 2.813 ± 0.748
1.023MetVal: 1.023 ± 0.316
0.511MetTrp: 0.511 ± 0.158
0.511MetTyr: 0.511 ± 1.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.557AsnAla: 2.557 ± 0.497
1.278AsnCys: 1.278 ± 1.187
5.369AsnAsp: 5.369 ± 0.997
2.557AsnGlu: 2.557 ± 0.675
3.324AsnPhe: 3.324 ± 0.667
2.301AsnGly: 2.301 ± 0.669
2.301AsnHis: 2.301 ± 0.629
6.137AsnIle: 6.137 ± 2.374
5.114AsnLys: 5.114 ± 1.804
6.392AsnLeu: 6.392 ± 2.08
2.046AsnMet: 2.046 ± 0.68
4.858AsnAsn: 4.858 ± 0.405
3.324AsnPro: 3.324 ± 1.402
2.557AsnGln: 2.557 ± 0.497
1.79AsnArg: 1.79 ± 0.796
4.347AsnSer: 4.347 ± 1.257
5.369AsnThr: 5.369 ± 1.065
2.557AsnVal: 2.557 ± 5.031
0.767AsnTrp: 0.767 ± 0.479
2.813AsnTyr: 2.813 ± 1.127
0.0AsnXaa: 0.0 ± 0.0
Pro
1.79ProAla: 1.79 ± 0.676
0.511ProCys: 0.511 ± 0.475
2.301ProAsp: 2.301 ± 0.532
3.068ProGlu: 3.068 ± 0.904
0.767ProPhe: 0.767 ± 0.479
1.023ProGly: 1.023 ± 0.639
1.278ProHis: 1.278 ± 0.518
4.602ProIle: 4.602 ± 2.246
2.557ProLys: 2.557 ± 1.035
2.557ProLeu: 2.557 ± 1.937
0.511ProMet: 0.511 ± 0.158
1.534ProAsn: 1.534 ± 0.81
0.256ProPro: 0.256 ± 0.16
1.278ProGln: 1.278 ± 1.938
0.256ProArg: 0.256 ± 0.16
2.301ProSer: 2.301 ± 0.825
0.767ProThr: 0.767 ± 0.712
2.301ProVal: 2.301 ± 0.943
0.767ProTrp: 0.767 ± 0.479
1.534ProTyr: 1.534 ± 0.423
0.0ProXaa: 0.0 ± 0.0
Gln
1.534GlnAla: 1.534 ± 1.026
1.278GlnCys: 1.278 ± 0.836
1.79GlnAsp: 1.79 ± 0.796
1.278GlnGlu: 1.278 ± 0.337
1.278GlnPhe: 1.278 ± 1.983
1.023GlnGly: 1.023 ± 0.34
0.511GlnHis: 0.511 ± 0.475
3.324GlnIle: 3.324 ± 0.96
2.813GlnLys: 2.813 ± 0.887
2.557GlnLeu: 2.557 ± 0.751
0.767GlnMet: 0.767 ± 0.939
1.534GlnAsn: 1.534 ± 0.423
0.256GlnPro: 0.256 ± 0.237
1.534GlnGln: 1.534 ± 0.474
2.301GlnArg: 2.301 ± 1.945
2.046GlnSer: 2.046 ± 0.769
2.301GlnThr: 2.301 ± 0.629
1.534GlnVal: 1.534 ± 0.423
1.023GlnTrp: 1.023 ± 0.87
0.511GlnTyr: 0.511 ± 0.32
0.0GlnXaa: 0.0 ± 0.0
Arg
1.534ArgAla: 1.534 ± 0.734
1.278ArgCys: 1.278 ± 0.518
2.301ArgAsp: 2.301 ± 0.943
2.301ArgGlu: 2.301 ± 1.111
2.046ArgPhe: 2.046 ± 0.633
0.511ArgGly: 0.511 ± 0.475
1.534ArgHis: 1.534 ± 0.959
4.347ArgIle: 4.347 ± 2.716
2.813ArgLys: 2.813 ± 1.637
3.068ArgLeu: 3.068 ± 0.904
1.023ArgMet: 1.023 ± 0.316
2.301ArgAsn: 2.301 ± 0.825
0.767ArgPro: 0.767 ± 1.043
1.79ArgGln: 1.79 ± 4.078
0.511ArgArg: 0.511 ± 0.32
4.091ArgSer: 4.091 ± 1.05
2.301ArgThr: 2.301 ± 1.672
1.79ArgVal: 1.79 ± 1.938
0.256ArgTrp: 0.256 ± 1.07
1.534ArgTyr: 1.534 ± 1.026
0.0ArgXaa: 0.0 ± 0.0
Ser
3.068SerAla: 3.068 ± 0.818
3.068SerCys: 3.068 ± 1.805
2.557SerAsp: 2.557 ± 0.751
4.347SerGlu: 4.347 ± 1.154
2.813SerPhe: 2.813 ± 0.788
3.324SerGly: 3.324 ± 1.275
1.534SerHis: 1.534 ± 0.423
9.46SerIle: 9.46 ± 2.546
7.159SerLys: 7.159 ± 2.378
5.369SerLeu: 5.369 ± 1.629
2.813SerMet: 2.813 ± 0.512
3.324SerAsn: 3.324 ± 0.567
2.557SerPro: 2.557 ± 0.721
2.813SerGln: 2.813 ± 0.881
3.324SerArg: 3.324 ± 1.751
2.557SerSer: 2.557 ± 0.751
6.137SerThr: 6.137 ± 0.682
3.068SerVal: 3.068 ± 1.019
1.023SerTrp: 1.023 ± 0.87
2.046SerTyr: 2.046 ± 0.633
0.0SerXaa: 0.0 ± 0.0
Thr
2.046ThrAla: 2.046 ± 0.608
2.301ThrCys: 2.301 ± 1.438
3.324ThrAsp: 3.324 ± 1.163
2.046ThrGlu: 2.046 ± 0.886
4.602ThrPhe: 4.602 ± 5.641
4.858ThrGly: 4.858 ± 1.133
1.023ThrHis: 1.023 ± 0.639
5.881ThrIle: 5.881 ± 2.736
5.625ThrLys: 5.625 ± 1.583
3.068ThrLeu: 3.068 ± 1.527
1.278ThrMet: 1.278 ± 0.799
4.091ThrAsn: 4.091 ± 0.816
2.301ThrPro: 2.301 ± 0.824
1.278ThrGln: 1.278 ± 0.487
1.534ThrArg: 1.534 ± 0.474
5.369ThrSer: 5.369 ± 1.516
3.068ThrThr: 3.068 ± 0.949
3.068ThrVal: 3.068 ± 0.949
0.767ThrTrp: 0.767 ± 0.939
2.557ThrTyr: 2.557 ± 0.791
0.0ThrXaa: 0.0 ± 0.0
Val
1.79ValAla: 1.79 ± 0.713
1.534ValCys: 1.534 ± 0.741
1.278ValAsp: 1.278 ± 0.825
2.046ValGlu: 2.046 ± 0.953
2.301ValPhe: 2.301 ± 1.688
1.79ValGly: 1.79 ± 0.713
1.534ValHis: 1.534 ± 0.741
3.324ValIle: 3.324 ± 0.567
4.347ValLys: 4.347 ± 1.257
3.58ValLeu: 3.58 ± 2.47
0.767ValMet: 0.767 ± 0.65
3.58ValAsn: 3.58 ± 2.745
1.023ValPro: 1.023 ± 3.145
1.534ValGln: 1.534 ± 0.474
1.534ValArg: 1.534 ± 0.734
3.835ValSer: 3.835 ± 1.012
1.278ValThr: 1.278 ± 0.799
2.046ValVal: 2.046 ± 2.958
0.0ValTrp: 0.0 ± 0.0
2.046ValTyr: 2.046 ± 0.953
0.0ValXaa: 0.0 ± 0.0
Trp
0.511TrpAla: 0.511 ± 1.012
0.0TrpCys: 0.0 ± 0.0
0.511TrpAsp: 0.511 ± 0.32
0.256TrpGlu: 0.256 ± 0.16
1.278TrpPhe: 1.278 ± 0.337
0.256TrpGly: 0.256 ± 0.237
0.256TrpHis: 0.256 ± 0.237
0.767TrpIle: 0.767 ± 0.211
0.511TrpLys: 0.511 ± 0.32
1.534TrpLeu: 1.534 ± 0.423
0.256TrpMet: 0.256 ± 1.07
0.767TrpAsn: 0.767 ± 0.977
0.511TrpPro: 0.511 ± 0.475
0.511TrpGln: 0.511 ± 0.32
0.511TrpArg: 0.511 ± 0.158
1.278TrpSer: 1.278 ± 0.984
0.256TrpThr: 0.256 ± 0.237
0.256TrpVal: 0.256 ± 0.16
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.767TyrAla: 0.767 ± 1.043
1.023TyrCys: 1.023 ± 0.316
1.79TyrAsp: 1.79 ± 0.796
1.534TyrGlu: 1.534 ± 0.423
2.301TyrPhe: 2.301 ± 0.825
1.278TyrGly: 1.278 ± 0.968
1.278TyrHis: 1.278 ± 0.518
4.347TyrIle: 4.347 ± 1.45
4.347TyrLys: 4.347 ± 2.641
4.602TyrLeu: 4.602 ± 1.15
1.79TyrMet: 1.79 ± 0.796
2.813TyrAsn: 2.813 ± 0.748
1.023TyrPro: 1.023 ± 0.602
0.767TyrGln: 0.767 ± 0.479
1.278TyrArg: 1.278 ± 0.487
2.813TyrSer: 2.813 ± 0.788
1.534TyrThr: 1.534 ± 0.64
1.023TyrVal: 1.023 ± 0.639
0.511TyrTrp: 0.511 ± 0.158
0.511TyrTyr: 0.511 ± 0.475
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3912 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski