Amino acid dipepetide frequency for Alethinophid 1 reptarenavirus (isolate AlRrV1/Boa/USA/BC/2009) (Golden Gate virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.126AlaAla: 3.126 ± 2.128
1.25AlaCys: 1.25 ± 0.743
1.563AlaAsp: 1.563 ± 0.461
2.813AlaGlu: 2.813 ± 0.94
1.563AlaPhe: 1.563 ± 0.876
2.501AlaGly: 2.501 ± 0.378
1.25AlaHis: 1.25 ± 0.285
4.064AlaIle: 4.064 ± 1.39
1.876AlaLys: 1.876 ± 0.579
5.939AlaLeu: 5.939 ± 3.087
0.938AlaMet: 0.938 ± 0.221
1.876AlaAsn: 1.876 ± 0.927
2.501AlaPro: 2.501 ± 1.048
0.625AlaGln: 0.625 ± 0.568
1.563AlaArg: 1.563 ± 0.381
3.126AlaSer: 3.126 ± 1.225
1.563AlaThr: 1.563 ± 0.461
3.126AlaVal: 3.126 ± 0.814
0.0AlaTrp: 0.0 ± 0.0
0.938AlaTyr: 0.938 ± 0.452
0.0AlaXaa: 0.0 ± 0.0
Cys
1.25CysAla: 1.25 ± 0.528
1.25CysCys: 1.25 ± 0.359
0.938CysAsp: 0.938 ± 0.307
1.563CysGlu: 1.563 ± 0.481
0.938CysPhe: 0.938 ± 0.711
0.625CysGly: 0.625 ± 0.48
0.625CysHis: 0.625 ± 0.323
1.25CysIle: 1.25 ± 0.285
2.501CysLys: 2.501 ± 0.591
4.689CysLeu: 4.689 ± 0.909
0.313CysMet: 0.313 ± 0.53
0.625CysAsn: 0.625 ± 0.301
1.25CysPro: 1.25 ± 0.924
0.625CysGln: 0.625 ± 0.714
0.938CysArg: 0.938 ± 0.711
1.563CysSer: 1.563 ± 1.505
0.313CysThr: 0.313 ± 0.151
1.876CysVal: 1.876 ± 1.302
0.0CysTrp: 0.0 ± 0.0
0.938CysTyr: 0.938 ± 0.475
0.0CysXaa: 0.0 ± 0.0
Asp
2.501AspAla: 2.501 ± 0.225
1.563AspCys: 1.563 ± 0.876
2.501AspAsp: 2.501 ± 0.669
4.064AspGlu: 4.064 ± 0.903
3.751AspPhe: 3.751 ± 1.078
2.813AspGly: 2.813 ± 0.704
1.876AspHis: 1.876 ± 0.847
3.439AspIle: 3.439 ± 1.759
4.064AspLys: 4.064 ± 0.657
9.065AspLeu: 9.065 ± 1.715
1.563AspMet: 1.563 ± 0.628
0.938AspAsn: 0.938 ± 0.877
2.501AspPro: 2.501 ± 1.154
1.25AspGln: 1.25 ± 0.273
2.188AspArg: 2.188 ± 1.055
4.689AspSer: 4.689 ± 0.661
3.126AspThr: 3.126 ± 0.763
3.439AspVal: 3.439 ± 0.89
1.25AspTrp: 1.25 ± 0.603
1.563AspTyr: 1.563 ± 0.458
0.0AspXaa: 0.0 ± 0.0
Glu
3.751GluAla: 3.751 ± 0.766
0.938GluCys: 0.938 ± 0.61
4.376GluAsp: 4.376 ± 1.302
5.939GluGlu: 5.939 ± 1.196
4.376GluPhe: 4.376 ± 1.302
2.188GluGly: 2.188 ± 0.244
0.625GluHis: 0.625 ± 0.799
6.252GluIle: 6.252 ± 0.795
3.126GluLys: 3.126 ± 1.087
8.44GluLeu: 8.44 ± 1.198
3.439GluMet: 3.439 ± 0.573
2.501GluAsn: 2.501 ± 0.887
4.689GluPro: 4.689 ± 1.235
1.25GluGln: 1.25 ± 0.603
4.064GluArg: 4.064 ± 1.219
5.627GluSer: 5.627 ± 1.055
5.002GluThr: 5.002 ± 0.786
3.126GluVal: 3.126 ± 0.454
0.313GluTrp: 0.313 ± 0.151
1.876GluTyr: 1.876 ± 0.786
0.0GluXaa: 0.0 ± 0.0
Phe
1.25PheAla: 1.25 ± 0.273
0.938PheCys: 0.938 ± 0.61
2.501PheAsp: 2.501 ± 1.205
2.501PheGlu: 2.501 ± 0.546
2.813PhePhe: 2.813 ± 0.215
3.439PheGly: 3.439 ± 1.204
1.25PheHis: 1.25 ± 0.359
1.876PheIle: 1.876 ± 0.518
5.002PheLys: 5.002 ± 0.863
4.689PheLeu: 4.689 ± 1.874
2.501PheMet: 2.501 ± 0.225
2.501PheAsn: 2.501 ± 0.673
2.501PhePro: 2.501 ± 0.794
1.563PheGln: 1.563 ± 0.381
2.501PheArg: 2.501 ± 0.225
4.376PheSer: 4.376 ± 1.089
1.876PheThr: 1.876 ± 0.968
1.876PheVal: 1.876 ± 1.232
0.0PheTrp: 0.0 ± 0.0
1.25PheTyr: 1.25 ± 0.781
0.0PheXaa: 0.0 ± 0.0
Gly
1.25GlyAla: 1.25 ± 0.273
1.563GlyCys: 1.563 ± 0.511
1.876GlyAsp: 1.876 ± 0.442
5.314GlyGlu: 5.314 ± 1.612
2.501GlyPhe: 2.501 ± 0.855
2.813GlyGly: 2.813 ± 1.27
0.938GlyHis: 0.938 ± 0.711
3.126GlyIle: 3.126 ± 0.735
2.188GlyLys: 2.188 ± 0.719
8.128GlyLeu: 8.128 ± 0.696
2.188GlyMet: 2.188 ± 0.794
0.938GlyAsn: 0.938 ± 0.307
4.064GlyPro: 4.064 ± 0.461
0.938GlyGln: 0.938 ± 0.452
3.439GlyArg: 3.439 ± 0.89
3.751GlySer: 3.751 ± 1.362
4.376GlyThr: 4.376 ± 1.089
0.938GlyVal: 0.938 ± 0.452
1.876GlyTrp: 1.876 ± 0.146
1.563GlyTyr: 1.563 ± 0.169
0.0GlyXaa: 0.0 ± 0.0
His
0.313HisAla: 0.313 ± 0.366
0.313HisCys: 0.313 ± 0.366
0.938HisAsp: 0.938 ± 0.423
2.501HisGlu: 2.501 ± 1.059
0.313HisPhe: 0.313 ± 0.151
1.25HisGly: 1.25 ± 0.285
1.25HisHis: 1.25 ± 0.359
1.25HisIle: 1.25 ± 0.285
0.625HisLys: 0.625 ± 0.301
1.25HisLeu: 1.25 ± 0.603
0.625HisMet: 0.625 ± 0.301
1.25HisAsn: 1.25 ± 1.107
0.625HisPro: 0.625 ± 0.732
0.625HisGln: 0.625 ± 0.323
0.938HisArg: 0.938 ± 0.452
2.188HisSer: 2.188 ± 0.946
0.625HisThr: 0.625 ± 0.301
0.938HisVal: 0.938 ± 1.199
0.0HisTrp: 0.0 ± 0.0
0.938HisTyr: 0.938 ± 0.452
0.0HisXaa: 0.0 ± 0.0
Ile
3.439IleAla: 3.439 ± 0.829
2.501IleCys: 2.501 ± 0.57
4.376IleAsp: 4.376 ± 0.634
4.064IleGlu: 4.064 ± 1.07
4.689IlePhe: 4.689 ± 2.432
3.439IleGly: 3.439 ± 0.994
1.25IleHis: 1.25 ± 0.603
2.501IleIle: 2.501 ± 0.663
4.064IleLys: 4.064 ± 1.161
6.877IleLeu: 6.877 ± 1.109
2.501IleMet: 2.501 ± 0.528
4.064IleAsn: 4.064 ± 0.804
1.25IlePro: 1.25 ± 0.285
2.188IleGln: 2.188 ± 0.716
2.501IleArg: 2.501 ± 1.414
7.19IleSer: 7.19 ± 0.124
5.314IleThr: 5.314 ± 0.929
3.439IleVal: 3.439 ± 1.349
0.0IleTrp: 0.0 ± 0.0
1.25IleTyr: 1.25 ± 0.273
0.0IleXaa: 0.0 ± 0.0
Lys
4.376LysAla: 4.376 ± 2.047
0.625LysCys: 0.625 ± 0.301
4.064LysAsp: 4.064 ± 1.919
4.689LysGlu: 4.689 ± 1.08
3.439LysPhe: 3.439 ± 0.417
5.627LysGly: 5.627 ± 0.684
1.25LysHis: 1.25 ± 0.524
3.751LysIle: 3.751 ± 0.855
6.252LysLys: 6.252 ± 2.413
8.44LysLeu: 8.44 ± 1.534
1.25LysMet: 1.25 ± 0.573
4.689LysAsn: 4.689 ± 1.272
3.751LysPro: 3.751 ± 1.023
2.501LysGln: 2.501 ± 0.794
4.064LysArg: 4.064 ± 1.219
4.689LysSer: 4.689 ± 1.444
4.064LysThr: 4.064 ± 0.563
5.939LysVal: 5.939 ± 0.821
1.25LysTrp: 1.25 ± 0.273
2.501LysTyr: 2.501 ± 0.794
0.0LysXaa: 0.0 ± 0.0
Leu
4.376LeuAla: 4.376 ± 0.89
1.876LeuCys: 1.876 ± 0.613
6.252LeuAsp: 6.252 ± 2.21
6.565LeuGlu: 6.565 ± 2.42
6.252LeuPhe: 6.252 ± 0.378
5.002LeuGly: 5.002 ± 1.49
1.563LeuHis: 1.563 ± 0.169
7.502LeuIle: 7.502 ± 1.957
11.254LeuLys: 11.254 ± 2.889
11.254LeuLeu: 11.254 ± 1.648
3.439LeuMet: 3.439 ± 0.413
7.815LeuAsn: 7.815 ± 0.652
3.126LeuPro: 3.126 ± 1.234
5.002LeuGln: 5.002 ± 1.22
5.627LeuArg: 5.627 ± 0.841
9.378LeuSer: 9.378 ± 0.686
4.376LeuThr: 4.376 ± 0.437
7.19LeuVal: 7.19 ± 1.408
1.563LeuTrp: 1.563 ± 0.593
3.126LeuTyr: 3.126 ± 0.296
0.0LeuXaa: 0.0 ± 0.0
Met
0.938MetAla: 0.938 ± 0.618
0.625MetCys: 0.625 ± 0.799
2.188MetAsp: 2.188 ± 0.473
1.876MetGlu: 1.876 ± 0.695
2.188MetPhe: 2.188 ± 0.794
1.25MetGly: 1.25 ± 0.359
0.313MetHis: 0.313 ± 0.4
1.876MetIle: 1.876 ± 0.613
2.501MetLys: 2.501 ± 0.42
2.813MetLeu: 2.813 ± 0.426
1.563MetMet: 1.563 ± 1.498
1.876MetAsn: 1.876 ± 0.904
0.625MetPro: 0.625 ± 0.262
0.625MetGln: 0.625 ± 0.323
1.563MetArg: 1.563 ± 0.481
3.439MetSer: 3.439 ± 0.535
1.25MetThr: 1.25 ± 0.528
3.126MetVal: 3.126 ± 0.668
0.0MetTrp: 0.0 ± 0.0
0.313MetTyr: 0.313 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
1.25AsnAla: 1.25 ± 0.883
0.938AsnCys: 0.938 ± 0.452
3.439AsnAsp: 3.439 ± 0.89
3.439AsnGlu: 3.439 ± 0.417
2.188AsnPhe: 2.188 ± 0.712
2.501AsnGly: 2.501 ± 0.42
0.938AsnHis: 0.938 ± 0.452
3.751AsnIle: 3.751 ± 1.175
6.877AsnLys: 6.877 ± 1.788
3.126AsnLeu: 3.126 ± 0.454
0.625AsnMet: 0.625 ± 0.323
3.126AsnAsn: 3.126 ± 0.668
2.813AsnPro: 2.813 ± 0.572
1.25AsnGln: 1.25 ± 0.981
1.563AsnArg: 1.563 ± 0.58
3.751AsnSer: 3.751 ± 0.622
2.501AsnThr: 2.501 ± 0.42
2.813AsnVal: 2.813 ± 0.783
0.938AsnTrp: 0.938 ± 0.452
1.25AsnTyr: 1.25 ± 1.599
0.0AsnXaa: 0.0 ± 0.0
Pro
2.501ProAla: 2.501 ± 1.645
1.563ProCys: 1.563 ± 1.207
3.126ProAsp: 3.126 ± 1.225
3.439ProGlu: 3.439 ± 0.949
1.563ProPhe: 1.563 ± 0.169
1.25ProGly: 1.25 ± 0.285
0.313ProHis: 0.313 ± 0.366
1.876ProIle: 1.876 ± 0.613
4.376ProLys: 4.376 ± 0.981
4.376ProLeu: 4.376 ± 1.264
1.25ProMet: 1.25 ± 0.914
1.876ProAsn: 1.876 ± 0.442
1.563ProPro: 1.563 ± 0.876
1.25ProGln: 1.25 ± 0.603
2.188ProArg: 2.188 ± 0.716
3.751ProSer: 3.751 ± 1.078
2.813ProThr: 2.813 ± 0.444
2.188ProVal: 2.188 ± 0.612
0.0ProTrp: 0.0 ± 0.0
1.563ProTyr: 1.563 ± 0.461
0.0ProXaa: 0.0 ± 0.0
Gln
0.625GlnAla: 0.625 ± 0.301
0.938GlnCys: 0.938 ± 0.221
1.876GlnAsp: 1.876 ± 0.146
1.25GlnGlu: 1.25 ± 0.603
1.25GlnPhe: 1.25 ± 0.285
3.439GlnGly: 3.439 ± 0.535
0.313GlnHis: 0.313 ± 0.366
3.126GlnIle: 3.126 ± 0.296
2.188GlnLys: 2.188 ± 0.651
2.501GlnLeu: 2.501 ± 0.42
0.313GlnMet: 0.313 ± 0.151
0.625GlnAsn: 0.625 ± 0.301
0.313GlnPro: 0.313 ± 0.151
1.563GlnGln: 1.563 ± 0.458
1.563GlnArg: 1.563 ± 0.876
1.876GlnSer: 1.876 ± 0.904
0.938GlnThr: 0.938 ± 0.221
0.938GlnVal: 0.938 ± 0.221
0.313GlnTrp: 0.313 ± 0.4
0.625GlnTyr: 0.625 ± 0.301
0.0GlnXaa: 0.0 ± 0.0
Arg
0.938ArgAla: 0.938 ± 0.452
0.625ArgCys: 0.625 ± 0.48
3.439ArgAsp: 3.439 ± 0.487
2.501ArgGlu: 2.501 ± 0.673
0.938ArgPhe: 0.938 ± 0.221
1.876ArgGly: 1.876 ± 0.442
0.313ArgHis: 0.313 ± 0.151
3.126ArgIle: 3.126 ± 0.296
2.813ArgLys: 2.813 ± 0.215
7.502ArgLeu: 7.502 ± 1.095
1.876ArgMet: 1.876 ± 0.512
1.876ArgAsn: 1.876 ± 0.518
1.563ArgPro: 1.563 ± 0.461
1.876ArgGln: 1.876 ± 0.579
2.501ArgArg: 2.501 ± 0.225
4.064ArgSer: 4.064 ± 1.161
4.064ArgThr: 4.064 ± 0.657
3.439ArgVal: 3.439 ± 0.413
0.0ArgTrp: 0.0 ± 0.0
1.25ArgTyr: 1.25 ± 0.603
0.0ArgXaa: 0.0 ± 0.0
Ser
2.813SerAla: 2.813 ± 0.444
2.501SerCys: 2.501 ± 1.471
6.252SerAsp: 6.252 ± 0.632
7.19SerGlu: 7.19 ± 0.463
3.439SerPhe: 3.439 ± 0.994
3.751SerGly: 3.751 ± 1.215
1.876SerHis: 1.876 ± 0.613
5.314SerIle: 5.314 ± 1.027
4.689SerLys: 4.689 ± 0.98
7.502SerLeu: 7.502 ± 1.412
1.563SerMet: 1.563 ± 0.458
5.002SerAsn: 5.002 ± 1.49
3.126SerPro: 3.126 ± 1.394
1.876SerGln: 1.876 ± 0.55
2.501SerArg: 2.501 ± 0.546
5.627SerSer: 5.627 ± 0.853
5.627SerThr: 5.627 ± 1.162
4.689SerVal: 4.689 ± 0.559
1.563SerTrp: 1.563 ± 0.169
2.501SerTyr: 2.501 ± 0.851
0.0SerXaa: 0.0 ± 0.0
Thr
4.064ThrAla: 4.064 ± 1.439
1.25ThrCys: 1.25 ± 0.359
3.439ThrAsp: 3.439 ± 0.709
2.813ThrGlu: 2.813 ± 0.572
1.876ThrPhe: 1.876 ± 0.695
3.439ThrGly: 3.439 ± 0.99
0.938ThrHis: 0.938 ± 0.307
5.002ThrIle: 5.002 ± 0.701
5.314ThrLys: 5.314 ± 1.283
5.939ThrLeu: 5.939 ± 0.812
2.813ThrMet: 2.813 ± 1.072
1.876ThrAsn: 1.876 ± 0.146
2.813ThrPro: 2.813 ± 0.646
0.313ThrGln: 0.313 ± 0.366
2.188ThrArg: 2.188 ± 1.055
5.002ThrSer: 5.002 ± 0.615
2.501ThrThr: 2.501 ± 0.692
2.188ThrVal: 2.188 ± 1.136
0.938ThrTrp: 0.938 ± 0.307
1.25ThrTyr: 1.25 ± 0.603
0.0ThrXaa: 0.0 ± 0.0
Val
2.188ValAla: 2.188 ± 0.244
2.188ValCys: 2.188 ± 1.157
2.813ValAsp: 2.813 ± 0.993
5.002ValGlu: 5.002 ± 1.122
1.563ValPhe: 1.563 ± 0.461
3.439ValGly: 3.439 ± 0.278
0.625ValHis: 0.625 ± 0.568
4.689ValIle: 4.689 ± 1.837
3.751ValLys: 3.751 ± 1.853
6.252ValLeu: 6.252 ± 1.336
0.625ValMet: 0.625 ± 0.301
4.376ValAsn: 4.376 ± 1.412
3.126ValPro: 3.126 ± 0.338
0.625ValGln: 0.625 ± 0.301
3.439ValArg: 3.439 ± 0.822
2.813ValSer: 2.813 ± 0.475
3.751ValThr: 3.751 ± 0.912
5.627ValVal: 5.627 ± 1.165
0.0ValTrp: 0.0 ± 0.0
1.563ValTyr: 1.563 ± 0.611
0.0ValXaa: 0.0 ± 0.0
Trp
0.938TrpAla: 0.938 ± 0.452
0.313TrpCys: 0.313 ± 0.53
0.0TrpAsp: 0.0 ± 0.0
0.938TrpGlu: 0.938 ± 0.221
0.938TrpPhe: 0.938 ± 0.452
1.25TrpGly: 1.25 ± 0.646
0.313TrpHis: 0.313 ± 0.151
1.25TrpIle: 1.25 ± 0.359
1.25TrpLys: 1.25 ± 0.273
1.25TrpLeu: 1.25 ± 0.603
0.313TrpMet: 0.313 ± 0.151
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.313TrpThr: 0.313 ± 0.151
0.938TrpVal: 0.938 ± 0.423
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.313TyrAla: 0.313 ± 0.366
0.313TyrCys: 0.313 ± 0.151
2.188TyrAsp: 2.188 ± 1.055
3.439TyrGlu: 3.439 ± 0.994
0.625TyrPhe: 0.625 ± 0.301
1.876TyrGly: 1.876 ± 0.613
0.938TyrHis: 0.938 ± 0.423
1.876TyrIle: 1.876 ± 0.442
2.813TyrLys: 2.813 ± 0.646
2.501TyrLeu: 2.501 ± 0.851
0.938TyrMet: 0.938 ± 0.423
1.25TyrAsn: 1.25 ± 0.781
0.625TyrPro: 0.625 ± 0.262
0.625TyrGln: 0.625 ± 0.262
1.25TyrArg: 1.25 ± 0.603
2.501TyrSer: 2.501 ± 0.851
1.563TyrThr: 1.563 ± 1.166
0.625TyrVal: 0.625 ± 0.301
0.0TyrTrp: 0.0 ± 0.0
1.25TyrTyr: 1.25 ± 0.359
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3200 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski