CN112752266B - Joint spectrum access and power control method in D2D haptic communication - Google Patents
Joint spectrum access and power control method in D2D haptic communication Download PDFInfo
- Publication number
- CN112752266B CN112752266B CN202011587066.7A CN202011587066A CN112752266B CN 112752266 B CN112752266 B CN 112752266B CN 202011587066 A CN202011587066 A CN 202011587066A CN 112752266 B CN112752266 B CN 112752266B
- Authority
- CN
- China
- Prior art keywords
- destination
- error rate
- source
- packet error
- source end
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000004891 communication Methods 0.000 title claims abstract description 56
- 230000006854 communication Effects 0.000 title claims abstract description 55
- 238000000034 method Methods 0.000 title claims abstract description 27
- 238000001228 spectrum Methods 0.000 title claims abstract description 25
- 230000007246 mechanism Effects 0.000 claims abstract description 33
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 25
- 230000005540 biological transmission Effects 0.000 claims abstract description 20
- 238000005457 optimization Methods 0.000 claims abstract description 13
- 230000002860 competitive effect Effects 0.000 claims abstract description 12
- 230000002441 reversible effect Effects 0.000 claims abstract description 11
- 230000001360 synchronised effect Effects 0.000 claims abstract description 6
- 239000013598 vector Substances 0.000 claims description 13
- 230000006870 function Effects 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 5
- 230000003993 interaction Effects 0.000 claims description 3
- 239000002131 composite material Substances 0.000 claims description 2
- -1 hydrogen Chemical class 0.000 claims 1
- 239000001257 hydrogen Substances 0.000 claims 1
- 238000013468 resource allocation Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 5
- 230000007423 decrease Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000004088 simulation Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013475 authorization Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005562 fading Methods 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000035807 sensation Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W16/00—Network planning, e.g. coverage or traffic planning tools; Network deployment, e.g. resource partitioning or cells structures
- H04W16/14—Spectrum sharing arrangements between different networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/145—Network analysis or design involving simulating, designing, planning or modelling of a network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W28/00—Network traffic management; Network resource management
- H04W28/02—Traffic management, e.g. flow control or congestion control
- H04W28/0231—Traffic management, e.g. flow control or congestion control based on communication conditions
- H04W28/0242—Determining whether packet losses are due to overload or to deterioration of radio communication conditions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W74/00—Wireless channel access
- H04W74/08—Non-scheduled access, e.g. ALOHA
- H04W74/0833—Random access procedures, e.g. with 4-step access
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W76/00—Connection management
- H04W76/10—Connection setup
- H04W76/14—Direct-mode setup
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
A joint spectrum access and power control method in D2D haptic communication relates to the technical field of wireless communication. The invention randomly accesses a resource block to each D2D pair according to a frequency spectrum access mechanism based on competition; calculating the packet error rate of a D2D forward link under a competitive access mechanism, namely from a source end to a destination end; calculating the packet error rate of a D2D reverse link under a competitive access mechanism, namely from a destination end to a source end; calculating the packet error rate of the D2D haptic communication closed loop; establishing an optimization problem, namely minimizing the total packet error rate of the system; modeling an optimization problem as a game; and solving the game Nash equilibrium by adopting a synchronous logarithmic linear learning algorithm. Interference among users can be coordinated through reasonable power control, resource scheduling delay can be effectively unloaded, and the error probability of data packet transmission is reduced, so that the delay and reliability requirements of D2D tactile communication are met.
Description
Technical Field
The invention relates to the technical field of wireless communication, in particular to a combined spectrum access and power control method for D2D haptic communication.
Background
With the development of wireless communication technology, people no longer satisfy traditional audio-video communication, but hope to obtain more immersive user experience. Based on this, haptic communication technology has been developed. The haptic communication mainly transmits human haptic signals (such as strength, torque, speed, and the like), and can provide multi-dimensional sensing information for a user. Therefore, the haptic communication has a wide application prospect, such as virtual reality, automatic driving, smart medical treatment, and the like.
The haptic communication can be divided into remote haptic communication and local haptic communication according to the distance between the control terminal and the operation terminal. For tele-tactile communication, it is necessary to transmit signals by means of a network infrastructure (network domain); for local tactile communication, the control end and the operation end can directly communicate in a device-to-device (D2D) mode, so that the spectrum utilization efficiency is effectively improved, and the transmission delay is reduced.
However, implementing D2D haptic communication is more challenging than traditional D2D audiovisual communication. Specifically, the audio/video stream has a high throughput requirement, so that only rich bandwidth resources are allocated, and the audio/video stream is generally unidirectional. The difference is that the transmission of the haptic flow is bidirectional transmission, and the transmission comprises that the control end sends a control signal to the operation end on a forward link, and the operation end feeds back the haptic signal to the control end on a reverse link, so that a closed control loop is formed. Furthermore, to ensure the immersive experience of the user, the entire control loop has very strict requirements on latency and reliability, and thus the haptic communication belongs to an ultra-reliable low-latency communication (URLLC) scenario.
Radio resource allocation, including spectrum access and power control, is a key step in implementing D2D haptic communication, as it directly impacts the latency and reliability performance of the communication. Currently, there is a small body of literature investigating the problem of haptic communication resource management. The literature [ heated Human-in-the-Loop Mobile Networks: A Radio Resource Allocation permission on Haptical Communications, IEEE trans. Wireless.Commun., vol.17, No.7, pp.4493-4508,2018] proposes a greedy algorithm with low complexity and a greedy Resource Allocation scheme close to the optimal solution, respectively, for two Resource Allocation situations of perceptual coding and symmetric design. The document Fog Computing for 5G Tactile Industrial Internet of Things QoE-Aware Resource Allocation Model, IEEE trans. Ind. Informatt, vol.15, No.5, pp.3085-3092,2019 proposes a dynamic Resource Allocation Model based on user experience to cope with haptic communication applications in the Internet of Things, and implements the Model using Java.
However, the above documents are all based on a request-grant mechanism, i.e. the user needs to make a resource scheduling request to the base station and wait for the base station to grant before using the spectrum resource. Obviously, this process will degrade the end-to-end latency to the point that the latency requirements for haptic communication cannot be met.
Disclosure of Invention
The invention mainly aims at the problem that the existing resource allocation scheme can cause the increase of D2D tactile communication time delay and the reduction of reliability, and provides a combined spectrum access and power control method. The method can effectively eliminate the resource scheduling time delay and reduce the error probability of data packet transmission, thereby meeting the time delay and reliability requirements of D2D tactile communication.
The invention provides a method for combining spectrum access and power control in D2D haptic communication, which comprises the following steps:
the method comprises the following steps: for the local tactile communication scene, the control end and the operation end can directly communicate by means of D2D. Thus, the control node and the operation node form a D2D pair, wherein the control node is the source node and the operation node is the destination node. Suppose there are N D2D pairs and K orthogonal resource blocks (N) in a cell>K) Respectively usingAndand (4) showing. Wherein, D2D uses e for the source end and the destination end of n respectivelynAnd rnRepresenting that all sources and destinations are formed separatelyAndeach D2D pair randomly accesses one resource block according to the contention-based spectrum access mechanism.
Step two: calculating the packet error rate of the D2D forward link under the competitive access mechanism, namely the destination end r nPacket error rate ofThe forward link is from a source end to a destination end; according to short data packet formula under finite block length, packet error rateDepending on the destination rnReceived signal-to-noise ratio ofDestination terminal r under calculation competition access mechanismnReceived signal-to-noise ratio of
Step three: calculating the packet error rate of the D2D reverse link under the competitive access mechanism, namely the source end enPacket error rate ofThe reverse link is from a destination end to a source end; according to the short data packet formula under the finite block length, the packet error rateDependent on source enReceived signal-to-noise ratio ofSource end e under calculation competition access mechanismnReceived signal-to-noise ratio of
Step four: the packet error rate of the D2D haptic communication closed loop is calculated. According to step two and step three, the probability of successful transmission of the D2D haptic communication closed loop is
Thus, the closed loop packet error rate can be approximated as
Step five: and establishing an optimization problem. The total packet error rate of N D2D pairs in the network is
Based on this, under the contention access mechanism, the optimization goal of power control is to minimize the total packet error rate of the system, i.e. the total packet error rate
s.t.
Wherein, P ═ P (P)1,...,pn,...,pN) For N D2D pairs of transmit power vectors,consists of the transmitting power of a source end and a destination end, S is the maximum transmitting power grade which can be adopted by the source end and the destination end, and epsilon thA closed loop packet error rate threshold for each D2D pair.
Step six: the optimization problem is modeled as a game. The optimization problem is a mixed integer nonlinear programming problem and is solved by adopting a game theory. Building (2)The standing game model isWhereinIn order to be a collection of gaming participants,in order to be a policy space, the policy space,is the set of policies for game participant n. u. ofn(pn,p-n) Representing the utility function of a game participant n, where pnFor the transmission power of game participants n, p-nIs the transmission power of the other gaming participants than gaming participant n. Thus, the proposed power control bet can be modeled as
Step seven: and solving game Nash equilibrium. By adopting a synchronous log linear learning (SLL) algorithm, the optimal Nash equilibrium solution can be converged on the premise of no information interaction and coordination mechanism, namely the sum of the packet error rates of all D2D pairs under a competitive access mechanism is minimized. The SLL algorithm is based on the boltzmann exploration strategy, i.e., the probability of a participant selecting a strategy with higher utility is greater than the probability with a strategy with lower utility. Therefore, the boltzmann exploration strategy is considered to be an effective method for getting rid of local optima and finally achieving global optima.
The invention provides a resource access mechanism based on competition. Specifically, a number of orthogonal resource blocks are first reserved for D2D haptic communication. When a data packet needs to be transmitted, the source end or the destination end can randomly access a resource block on a forward link or a reverse link without sending a resource scheduling request and waiting for authorization of a base station. Although the access mechanism can effectively unload the scheduling delay, due to the random access, when two or more users access the same resource block, mutual interference occurs among the users, and the reliability of transmission is affected. Then, interference among users can be coordinated through reasonable power control, so that the time delay and reliability requirements of D2D haptic communication are met simultaneously.
The invention provides a D2D haptic communication-oriented combined spectrum access and power control method, which can effectively unload resource scheduling delay and reduce the error probability of data packet transmission, thereby meeting the delay and reliability requirements of D2D haptic communication. The method comprises the steps of firstly adopting a frequency spectrum access mode based on a competition mechanism to unload resource scheduling time delay, and then reducing the packet error rate of transmission through reasonable power control under the mechanism. Specifically, the power control problem is converted into a non-cooperative game model, and a distributed learning algorithm is adopted to solve Nash equilibrium.
Drawings
FIG. 1 is a diagram of a D2D haptic communication model of the present invention.
Fig. 2 is a diagram illustrating interference analysis between D2D pairs under the contention mechanism of the present invention.
Fig. 3 is a flow chart of the joint spectrum access and power control of the present invention.
Fig. 4a is a schematic diagram showing the comparison between the algorithm of the present invention and the optimal solution for different iteration numbers about the total packet error rate.
Fig. 4b is a graph showing the power selection comparison of different pairs of D2D for different iterations.
Fig. 5a is a schematic diagram of a relationship between packet error rate and packet length under different algorithm environments.
Fig. 5b is a schematic diagram of a relationship between packet error rate and number of resource blocks in different algorithm environments.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
The D2D haptic communication model is further described with reference to FIG. 1. A D2D pair consists of a source and a destination. In haptic communication, the source terminal is typically a haptic device, i.e., human action, sensation can be converted into a haptic signal input through various encoding techniques. The destination is typically a remote control, i.e. it can interact with the environment and feed back tactile signals to the source.
The communication process of D2D haptic communication is described as follows. First, the source peer transmits a control signal to the destination peer over the forward link. The destination then interacts with the environment based on the received control signals and sends haptic feedback signals to the source via the reverse link, forming a closed-loop D2D haptic communication model. In the present invention, the source and destination of multiple D2D are strictly synchronized in time when transmitting.
Referring to fig. 2, the interference generated between the pair of D2D in the contention mechanism is further analyzed. Analyzing the destination r for the forward link example1The interference experienced. Considering the finiteness of the transmitting power and the fading characteristics of the wireless channel, each source end can only affect the destination end within a certain coverage range. For example, the pink shaded area can be regarded as the source end e 3Of the source end e3Destination ends within the coverage area can be affected, but destination ends outside the coverage area cannot be affected. Assume that 3D 2D pairs in the figure access the same resource block simultaneously. Then, the destination terminal r1Will be received by the source e2Without being influenced by the source e3Due to the destination end r1At the source end e2Is not in the source end e3Within the coverage of (c). Similarly, on the reverse link, the source e1Will receive the destination end r3Because of the source end e1At the destination end r3Within the coverage of (c).
The implementation steps of the present invention will be further explained with reference to fig. 3:
the method comprises the following steps: suppose that N D2D pairs and K orthogonal resource blocks coexist in a cell and are used respectivelyAndand (4) showing. Wherein D2D is divided into source end and destination end of nFor use of enAnd rnRepresenting that all sources and destinations are formed separatelyAndeach D2D pair randomly accesses one resource block according to the contention-based spectrum access mechanism.
Step two: calculating the packet error rate of the D2D forward link (source end to destination end) under the competitive access mechanism, namely the destination end rnThe packet error rate of (d). Under a competitive access mechanism, a user randomly accesses one resource block in an equal probability mode. At this time, if D2D accesses resource block k to n, the destination r nAt a received signal-to-noise ratio SNR of
Wherein,as a source end enTransmit power of H (e)n,rn) As a source end enTo the destination end rnChannel gain of σ21/K is the probability that D2D accesses resource block K to i;is a vector of the source transmit power,is in a vector PeAll to the destination end rnHas a signal-to-noise ratio exceeding gammath1The source terminal set of (2); wherein, γth1To the destination end rnIf other sources go to rnSNR of greater than gammath1Then, it will be directed to the destination rnWill not otherwise interfere with the reception of rnThe reception of (2) causes an impact; e.g. of the typemIs DThe source end of the 2D pair m,as a source end emTransmit power of H (e)m,rn) As a source end emTo the destination end rnThe channel gain of (a); e.g. of the typeiThe source end of pair i of D2D,as a source end eiTransmit power of H (e)i,rn) As a source end eiTo the destination end rnThe channel gain of (1).
Since D2D randomly accesses resource block n under the contention mechanism, the received signal-to-noise ratio relative to K resource blocks is
According to the short packet formula under the finite block length, the packet error rate of the D2D forward link (source end to destination end) is
Wherein,is a Gaussian Q function, t is an integral variable, dt is a first derivative of t,l is the packet length, η is the number of information bits transmitted, and
Step three: calculating the packet error rate of the D2D reverse link (from the destination end to the source end) under the competitive access mechanism, namely the source end enThe packet error rate of (d). Under the contention mechanism, the received signal-to-noise ratio of D2D on K resource blocks to n is
Wherein,for the destination end rnTransmit power of H (e)n,rn) As a source end enTo the destination end rnChannel gain of σ21/K is the probability that D2D accesses resource block K to i;for the vector of transmit powers of the destination,is in a vector PrDown, all to the source end enHas a signal-to-noise ratio exceeding gammath2In which γth2To the source end enA signal-to-noise ratio threshold causing interference; r ismFor the destination of D2D pair m,to a destination end rmTransmit power of H (e)n,rm) As a source end enTo the destination end rmThe channel gain of (a); r isiFor the destination of D2D pair i,to a destination end riTransmit power of H (e)n,ri) As a source end enTo the destination end riThe channel gain of (1).
According to the short packet formula under the finite block length, the packet error rate of the reverse link (destination end to source end) of D2D is
Wherein,is a Gaussian Q function, t is an integral variable, dt is a first derivative of t,l is the packet length, η is the number of information bits transmitted, and
step four: the packet error rate of the D2D haptic communication closed loop is calculated. According to step two and step three, the probability of successful transmission of the D2D haptic communication closed loop is
Due to the fact thatAndis substantially 10-3~10-5Of order of magnitude, therefore, the closed loop packet error rate can be approximated as
Step five: and establishing an optimization problem. The total packet error rate of N D2D pairs in the network is
Based on this, under the contention access mechanism, the optimization goal of power control is to minimize the total packet error rate of the system, i.e. the total packet error rate
s.t.
Wherein, P ═ P (P)1,...,pn,...,pN) For N D2D pairs of transmit power vectors,consists of the transmitting power of a source end and a destination end, S is the maximum transmitting power grade which can be adopted by the source end and the destination end, and epsilonthA closed loop packet error rate threshold for each D2D pair.
Step six: the optimization problem is modeled as a game. The optimization problem is a mixed integer nonlinear programming problem and is solved by adopting a game theory. The game model isWhereinIn order to be a collection of gaming participants,in order to be a policy space, the policy space,is the set of policies for game participant n. Due to the fact thatTherefore, it is not only easy to useCan be expressed as
un(pn,p-n) Representing the utility function of a game participant n, where pnFor the transmission power of game participants n, p-nIs the transmission power of the other gaming participants than gaming participant n. In this model, un(pn,p-n) Is defined as
Wherein r isiFor the destination of D2D pair i,to a destination end riThe packet error rate of (2); e.g. of the typejBeing the source end of D2D pair j, As a source end ejThe packet error rate of (d);as source end enSet of destinations affected when transmitting at transmit power S, i.e.WhereinIn the same way, the method for preparing the composite material,is as the destination terminal rnSet of source terminals affected when transmitting at a transmit power SI.e. byWhereinTherefore, unThe physical meaning is the sum of the packet error rates of all the sources and destinations affected by the game participant n.
Thus, the proposed power control game can be modeled as
Step seven: and solving game Nash equilibrium. By adopting a synchronous log linear learning (SLL) algorithm, the optimal Nash equilibrium solution can be converged on the premise of no information interaction and coordination mechanism, namely the sum of the packet error rates of all D2D pairs under a competitive access mechanism is minimized. The algorithm is based on the boltzmann exploration strategy, i.e. the probability of a participant selecting a strategy with higher utility is greater than the probability of a strategy with lower utility. Therefore, the boltzmann exploration strategy is considered to be an effective method for getting rid of local optima and finally achieving global optima. The proposed algorithm is as follows:
algorithm 1 power control algorithm based on SLL
Initialization: setting the initial iteration time t as 0, and initializing power vectors p (0) of all game participants as p1(0),...,pN(0) Therein of And is provided withIn addition, a binary indication variable is set
All game participants simultaneously perform the following steps:
Repeat
t=t+1
Wherein p' is the strategy taken by the game participant n at time t;representation collectionNumber of middle elements, δnThe exploration rate for the gaming participant n. Furthermore, if pn(t)≠pn(t-1) setting bn(t) ═ 1; otherwise, set bn(t)=0。
End if
If bn(t-1) ═ 1, game participant n updates its policy according to the following rulesWherein, beta is a learning parameter,andthe utility of the betting participant n at times t-1 and t-2 respectively,then, set up bn(t)=0。
End if
Setting p (t) ═ p (t-1)
End if
The effect of the invention will be further described with reference to fig. 4 and 5:
the simulation in the invention is based on software MatlabR2016 a. The simulation parameter parameters are set as follows: the number of the D2D pairs is 10, the number of the resource blocks is 3, the actual quantized power set is {0.1,0.2,0.3}, the noise power is-100 dBm, the distance from the source end to the destination end of the D2D is 15m, and the threshold r of the signal-to-noise ratio isth20dB, 400bits of information bits, and 50channel use of packet length. Thus, the set of policies for each gaming participant is
Fig. 4 analyzes the convergence of the proposed game learning algorithm. In fig. 4a, the proposed algorithm gradually converges to the optimal power selection strategy over a number of iterations. It should be noted that even if the algorithm reaches optimal power selection, it is still possible to jump out of the current scheme to explore unknown areas, as shown by the red circles. Finally, when the number of iterations is large enough, the proposed algorithm will converge. Fig. 4b further illustrates the convergence of the algorithm. We randomly picked 4D 2D pairs from 10D 2D pairs, denoted D2D pair 1, D2D pair 2, D2D pair 3 and D2D pair 4 respectively. When the number of algorithm iterations is large enough, the power selection of each D2D pair will converge to a particular strategy.
Fig. 5 shows the relation between the packet error rate and the packet length and the number of resource blocks, respectively. Wherein "BR" is an optimal response algorithm, i.e. in each iteration of the algorithm, the user selects the strategy that maximizes his own benefit. As shown in fig. 5a, the packet error rate of all algorithms decreases as the packet length increases until it approaches 0. In fig. 5b, as the number of resource blocks increases, the packet error rate of all algorithms is decreasing. This is because, as the number of resource blocks increases, the probability that a user will access the same resource block will decrease. In addition, when K ≧ 10, each user can be assigned a dedicated resource block, so no interference occurs between each other.
Claims (9)
1. A method for combining spectrum access and power control in D2D haptic communication is characterized by comprising the following steps:
the method comprises the following steps: for a local tactile communication scene, a control end and an operation end are directly communicated in a D2D mode, and the control end and the operation end form a D2D pair, wherein the control end is a source end, and the operation end is a destination end; suppose there are N D2D pairs and K orthogonal resource blocks in a cell, where N is>K, respectively usingAndrepresents; wherein, D2D uses e for the source end and the destination end of n respectively nAnd rnMeaning that all sources and destinations constitute e ═ e, respectively1,...,en,...,eNAndeach D2D pair randomly accesses a resource block according to a frequency spectrum access mechanism based on competition;
step two: calculating the packet error rate of the D2D forward link under the competitive access mechanism, namely the destination end rnPacket error rate ofThe forward link is from a source end to a destination end; according to the short data packet formula under the finite block length, the packet error rateDepending on the destination rnReceived signal-to-noise ratio ofCalculating contestDestination terminal r under contention access mechanismnReceived signal-to-noise ratio of
Step three: calculating the packet error rate of the D2D reverse link under the competitive access mechanism, namely the source end enPacket error rate ofThe reverse link is from a destination end to a source end; according to the short data packet formula under the finite block length, the packet error rateDependent on source enReceived signal-to-noise ratio ofSource end e under calculation competition access mechanismnReceived signal-to-noise ratio of
Step four: calculating the packet error rate of the D2D tactile communication closed loop, wherein the successful transmission probability of the D2D tactile communication closed loop is that according to the step two and the step three
Thus, the closed loop packet error rate is approximately
Step five: the optimization problem is established, and the total packet error rate of N D2D pairs in the network is
Based on this, under the contention access mechanism, the optimization goal of power control is to minimize the total packet error rate of the system, i.e. the total packet error rate
s.t.
Wherein, P ═ P (P)1,...,pn,...,pN) For N D2D pairs of transmit power vectors,consists of the transmitting power of a source end and a destination end, S is the maximum transmitting power grade which can be adopted by the source end and the destination end, and epsilonthA closed loop packet error rate threshold for each D2D pair;
step six: modeling the optimization problem as a game and establishing a game model asWhereinIn order to be a collection of gaming participants,in order to be a policy space, the policy space,set of policies for game participant n, un(pn,p-n) Representing the utility function of a game participant n, where pnFor the transmission power of game participants n, p-nFor the transmission power of other game participants than game participant n, the proposed power control game is modeled as
Step seven: and (3) solving game Nash equilibrium, adopting a synchronous logarithmic linear learning algorithm, and converging to an optimal Nash equilibrium solution on the premise of no information interaction and coordination mechanism, namely minimizing the sum of the packet error rates of all D2D pairs under a competitive access mechanism.
2. The method of joint spectrum access and power control in D2D haptic communication of claim 1, wherein transmissions of source and destination ends of D2D are synchronized in time.
3. The method of claim 1, wherein the destination-side packet error rate in step two is determined by combining spectrum access and power control in D2D haptic communication And destination terminal rnReceived signal-to-noise ratio ofThe relationship of (1) is:
4. the method for joint spectrum access and power control in D2D haptic communication according to claim 3, wherein the destination r isnReceived signal-to-noise ratio ofThe calculation method of (2) is as follows: assuming that D2D accesses resource block k for n at this time, the destination rnAt a received signal-to-noise ratio SNR of
Wherein,as a source end enTransmit power of H (e)n,rn) As a source end enTo the destination end rnChannel gain of σ21/K is the probability that D2D accesses resource block K to i;is a vector of the source transmit power,is in a vector PeAll to the destination end rnHas a signal-to-noise ratio exceeding gammath1The source terminal set of (2); wherein, γth1To the destination end rnIf other sources go to rnSNR of greater than gammath1Then, it will be directed to the destination rnWill not otherwise interfere with the reception of rnThe reception of (2) causes an impact; e.g. of the typemThe source end of D2D pair m,as a source end emTransmit power of H (e)m,rn) As a source end emTo the destination end rnThe channel gain of (a); e.g. of the typeiThe source end of pair i of D2D, As a source end eiTransmit power of H (e)i,rn) As a source end eiTo the destination end rnThe channel gain of (a);
since D2D randomly accesses resource block n under the contention mechanism, the received signal-to-noise ratio relative to K resource blocks is
5. The method of claim 1, wherein the source-end packet error rate in step three is determined by combining spectrum access and power control in D2D haptic communicationAnd source end enReceived signal-to-noise ratio ofThe relationship of (1) is:
6. the method of claim 5, wherein the source e is a source e for joint spectrum access and power control in D2D haptic communicationnReceived signal-to-noise ratio ofThe calculation method of (2) is as follows: under the contention mechanism, the received signal-to-noise ratio of D2D on K resource blocks to n is
Wherein,to a destination end rnTransmit power of H (e)n,rn) As a source end enTo the destination end rnChannel gain of σ21/K is the probability that D2D accesses resource block K to i;for the vector of transmit powers of the destination,is in a vector PrDown, all to the source end enHas a signal-to-noise ratio exceeding gammath2In which γth2To the source end e nA signal-to-noise ratio threshold causing interference; r is a radical of hydrogenmFor the destination of D2D versus m,to a destination end rmTransmit power of H (e)n,rm) As a source end enTo the destination end rmThe channel gain of (a); r isiFor the destination of D2D pair i,to a destination end riTransmit power of H (e)n,ri) As a source end enTo the destination end riThe channel gain of (1).
8. The method for combined spectrum access and power control in D2D haptic communication of claim 1, wherein the utility function u of game participant n in step sixn(pn,p-n) Is defined as
Wherein r isiFor the destination of D2D pair i,to a destination end riThe packet error rate of (2); e.g. of the typejBeing the source end of D2D pair j,as a source end ejThe packet error rate of (2);as source end enSet of destinations affected when transmitting at transmit power S, i.e.WhereinIn the same way, the method for preparing the composite material,is as the destination terminal rnSet of source terminals affected when transmitting at the transmission power S, i.e.WhereinTherefore, unThe physical meaning is the sum of the packet error rates of all the sources and destinations affected by the game participant n.
9. The method for combined spectrum access and power control in D2D haptic communication according to claim 1, wherein the power control algorithm based on log-linear learning of synchronization in step seven is as follows:
initialization: setting the initial iteration time t as 0 and initializing all gamesParticipant's power vector p (0) ═ { p { (p) }1(0),...,pN(0) Therein ofAnd isIn addition, a binary indication variable is set
All game participants simultaneously perform the following steps:
t=t+1
Wherein p' is the strategy taken by the game participant n at time t;representation collectionNumber of middle elements, δnAn exploration rate for game participant n; if p isn(t)≠pn(t-1) setting bn(t) ═ 1; otherwise, set bn(t)=0;
Wherein, beta is a learning parameter,andthe utility of the betting participant n at times t-1 and t-2 respectively,then, set up bn(t)=0;
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011587066.7A CN112752266B (en) | 2020-12-28 | 2020-12-28 | Joint spectrum access and power control method in D2D haptic communication |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011587066.7A CN112752266B (en) | 2020-12-28 | 2020-12-28 | Joint spectrum access and power control method in D2D haptic communication |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112752266A CN112752266A (en) | 2021-05-04 |
CN112752266B true CN112752266B (en) | 2022-05-24 |
Family
ID=75646464
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011587066.7A Active CN112752266B (en) | 2020-12-28 | 2020-12-28 | Joint spectrum access and power control method in D2D haptic communication |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112752266B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114979013B (en) * | 2022-05-17 | 2023-12-05 | 南京邮电大学 | Multi-mode service-oriented transmission mode selection and resource allocation method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103220724A (en) * | 2013-04-25 | 2013-07-24 | 北京邮电大学 | Cellular and dimension to dimension (D2D) user frequency spectrum accessing method in D2D communication mixing system |
CN107094060A (en) * | 2017-04-24 | 2017-08-25 | 东南大学 | Distributed super-intensive heterogeneous network disturbance coordination method based on non-cooperative game |
CN109729528A (en) * | 2018-12-21 | 2019-05-07 | 北京邮电大学 | A kind of D2D resource allocation methods based on the study of multiple agent deeply |
-
2020
- 2020-12-28 CN CN202011587066.7A patent/CN112752266B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103220724A (en) * | 2013-04-25 | 2013-07-24 | 北京邮电大学 | Cellular and dimension to dimension (D2D) user frequency spectrum accessing method in D2D communication mixing system |
CN107094060A (en) * | 2017-04-24 | 2017-08-25 | 东南大学 | Distributed super-intensive heterogeneous network disturbance coordination method based on non-cooperative game |
CN109729528A (en) * | 2018-12-21 | 2019-05-07 | 北京邮电大学 | A kind of D2D resource allocation methods based on the study of multiple agent deeply |
Non-Patent Citations (2)
Title |
---|
Contention-Based Radio Resource Management for URLLC-Oriented D2D Communications;Y. Wu等;《in IEEE Transactions on Vehicular Technology, vol. 69, no. 9, pp. 9960-9971, Sept. 2020, doi: 10.1109/TVT.2020.3003944》;20200622;全文 * |
基于博弈论的D2D通信功率控制算法;刘晓玲等;《信息通信》;20180315(第03期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN112752266A (en) | 2021-05-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2023179010A1 (en) | User packet and resource allocation method and apparatus in noma-mec system | |
CN111800812B (en) | Design method of user access scheme applied to mobile edge computing network of non-orthogonal multiple access | |
Cai et al. | Intelligent cognitive spectrum collaboration: Convergence of spectrum sensing, spectrum access, and coding technology | |
Hu et al. | Throughput analysis of low-latency IoT systems with QoS constraints and finite blocklength codes | |
CN114867030B (en) | Dual-time scale intelligent wireless access network slicing method | |
CN114051222A (en) | Wireless resource allocation and communication optimization method based on federal learning in Internet of vehicles environment | |
CN110121212A (en) | A kind of ascending transmission method towards periodic group URLLC business | |
Dosti et al. | Ultra reliable communication via optimum power allocation for HARQ retransmission schemes | |
CN112752266B (en) | Joint spectrum access and power control method in D2D haptic communication | |
Zafar et al. | An efficient resource optimization scheme for D2D communication | |
Hamid et al. | An Optimized Algorithm for Resource Allocation for D2D in Heterogeneous Networks. | |
CN113891481B (en) | Throughput-oriented cellular network D2D communication dynamic resource allocation method | |
CN108848045A (en) | D2D Communication Jamming management method based on joint interference alignment and power optimization | |
Zhang et al. | Matching-based resource allocation and distributed power control using mean field game in the NOMA-based UAV networks | |
Zhu et al. | Load-aware dynamic mode selection for network-assisted full-duplex cell-free large-scale distributed MIMO systems | |
CN110798285B (en) | Retransmission method of URLLC in large-scale network based on frequency diversity | |
Ji et al. | Optimization of resource allocation for V2X security communication based on multi-agent reinforcement learning | |
CN107302747B (en) | Cooperative video transmission method based on stable matching | |
Al-Abiad et al. | Cross-layer network codes for completion time minimization in device-to-device networks | |
CN111343722B (en) | Cognitive radio-based energy efficiency optimization method in edge calculation | |
CN111885718B (en) | Robust cognitive communication system power distribution method based on cooperative relay | |
Shang et al. | Computation offloading management in vehicular edge network under imperfect CSI | |
Lu et al. | Deep reinforcement learning-based power allocation for ultra reliable low latency communications in vehicular networks | |
Yang et al. | [Retracted] Power Control for Full‐Duplex Device‐to‐Device Underlaid Cellular Networks: A Stackelberg Game Approach | |
Wang et al. | Cooperative game-based cheating in full-duplex relaying-based D2D communication underlaying heterogeneous cellular networks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |